- Issue created by @lammensj
- ๐ฉ๐ชGermany jurgenhaas Gottmadingen
Thanks @lammensj for creating this issue about our Slack thread. To the maintainers, I suggest removing the
handle_multiples = TRUE
from the encode plugin. - ๐ง๐ชBelgium lammensj
Another (non-breaking) solution might be to create a specific Decode-plugin that doesn't handle multiples. Different options are possible :-)
- ๐ณ๐ฟNew Zealand ericgsmith
Hello!
Yes, this does look like a valid issue with this plugin - and I agree with you @lammensj that probably these should be 2 different things.
Posting my comment from slack for reference:
Yes - that does look like an issue with the plugin. I'm going to be honest, I'd not see that plugin before! It is doing 2 things and feels like it should really be 2 different plugins. It allows encoding and decoding. Encoding functions can handle multiple values, but the decoding functions don't - they expect the format of whatever function is configured - json_decone, unserialize and base64_decode all require a string input.
I think removing handles multiple is going to break things for people encoding things - I'll move my thoughts to the Drupal issue, but I suspect maybe 2 plugins (encode allows multiple inputs / produces single value, decode allows single input/ can produce multiple values) and then we might need to provide some backwards compatibility for people using the existing encode plugin to decode this?
https://drupal.slack.com/archives/C34CECZAL/p1696530311707069?thread_ts=...
- last update
over 1 year ago 531 pass, 4 fail - @lammensj opened merge request.
- last update
over 1 year ago 531 pass, 4 fail - last update
over 1 year ago 542 pass, 2 fail - last update
over 1 year ago 543 pass - Status changed to Needs review
over 1 year ago 7:07pm 6 October 2023 - last update
over 1 year ago 545 pass - last update
over 1 year ago 545 pass - ๐ณ๐ฟNew Zealand ericgsmith
Thank you so much for the contribution!
I've updated the MR with a coding standard fix + added functional test to ensure the decode plugin is submitting configuration as expected.
I've then pushed a further commit as an approach to deprecate using the decode methods in the encode plugin. I think even though we only have an alpha release, we have enough modules and people relying on stability to not just remove them outright. The trouble is its hard for tamper to do any kind of update hook given we don't really know how all people are using this module.
I'm keen for feedback / discussion around the deprecation - specifically if we should be removing these options from the UI at this point but allowing existing config to be re saved.
- ๐ณ๐ฟNew Zealand ericgsmith
Ah, introduced a couple coding standards in that last push, will tidy up on Monday
- Status changed to RTBC
over 1 year ago 9:28am 13 October 2023 - ๐ฉ๐ชGermany jurgenhaas Gottmadingen
This looks pretty good to me, nice move. The deprecation path sounds solid, although the former version of the plugin wasn't really usable in the first place, if implementations were to take the
handle_multiples
serious because it should have always received an array as data, and that should have already thrown an exception. So, you could also assume, the decode plugins had not been used, or they had been used by some other module in the wrong way. - ๐ณ๐ฟNew Zealand ericgsmith
Thank you for the review.
The multiple flag indicates that the plugin can handle multiple values, not that it only handles multiples. E.g with encode you can still pass in a string or an array. So it's possible to still have used the encode plugin with decode options and string source values.
- Status changed to Needs work
12 months ago 8:48am 8 April 2024 - ๐ฎ๐นItaly kopeboy Milan
I tested the patch with both the stable and the latest-dev versions of ECA & ECA Tamper modules and I'm still getting the "must be of type string" error when I run the action to encode in base64 (the input to be encoded is a simple string):
Before applying MR 20 to latest dev version of tamper module:
TypeError: base64_encode(): Argument #1 ($string) must be of type string, array given in base64_encode() (line 79 of /var/www/html/web/modules/contrib/tamper/src/Plugin/Tamper/Encode.php
After:
TypeError: base64_encode(): Argument #1 ($string) must be of type string, array given in /var/www/html/web/modules/contrib/tamper/src/Plugin/Tamper/Encode.php on line 31 #0 [internal function]: base64_encode(Array)
After, with stable versions of ECA & ECA tamper (note the array is missing from the inside of last parentheses):
TypeError: base64_encode(): Argument #1 ($string) must be of type string, array given in /var/www/html/web/modules/contrib/tamper/src/Plugin/Tamper/Encode.php on line 31 #0 [internal function]: base64_encode()
The PHP Serialize and JSON encode options work fine instead, in both stable & dev versions (after applying MR20), but I noticed that even if the data to encode was a simple string, the encoded result was an array, eg:
- JSON encoded = a:1:{i:0;s:15:"ciao, che bello";}
- PHP Serialized = ["ciao, che bello"]
- ๐ง๐ชBelgium lammensj
For reference: the code responsible for always providing an array to the Encode-plugin, when using Tamper via ECA, can be found here: https://git.drupalcode.org/project/eca_tamper/-/blob/1.0.x/src/Plugin/Ta.... Be aware that this is a separate module.
- ๐ง๐ชBelgium lammensj
@kopeboy, after further investigation I believe we should shift the problem that you're encountering to the eca_tamper-module. The last comment from @ericgsmith explains what the annotion-property "handle_multiples" means. This was not clear when eca_tamper was created and thus its adaption should be altered.
I've create https://www.drupal.org/project/eca_tamper/issues/3439502 โจ Refactor usage of "handle_multiples" for Encode-plugin Active for this.
- Status changed to RTBC
12 months ago 8:39pm 8 April 2024 - ๐ฎ๐นItaly kopeboy Milan
I kind of agree but I'm not an expert coder ๐ I'll follow up there, Thank you.
- ๐ณ๐ฟNew Zealand ericgsmith
Just wanted to sanity check the deprecation removal - https://drupal.slack.com/archives/C34CECZAL/p1714710131685999
As discussed above - there has only been an issue with ECA using this plugin, or with feed tamper if decode options were called with a multiple value - calling with decode and a single value via feeds tamper is working - so feeds would be impacted by removal. I'm happy with the change.
- Status changed to Needs work
11 months ago 4:43am 3 May 2024 - ๐ณ๐ฟNew Zealand ericgsmith
Actually reviewing #11 - while ECA should not be converting it to an array there is still the same issue here.
base64_encode only accepts strings, but serialize and json_encode can handle multiple values.
Maybe we should split them all out?
- ๐ณ๐ฑNetherlands megachriz
Related issue: ๐ Encode plugin: only call function from the available options Active .
Perhaps we would need test coverage too to check if multiple values are handled well for those modes that can.
- ๐ฎ๐นItaly kopeboy Milan
FYI there was a recent MR at https://www.drupal.org/project/eca_tamper/issues/3439502 โจ Refactor usage of "handle_multiples" for Encode-plugin Active , but that didn't solve the issue on the Base64 encode action.
- ๐ณ๐ฑNetherlands megachriz
I've added a table to the issue summary that gives an overview of all modes and whether or not they can work with arrays.
Keeping it in a single plugin is doable
Adding a separate Decode-plugin and then deprecating the decode options for the encode plugin sounds like a nice idea. But at the same time I think we could fix the issue in the Encode plugin itself by just checking whether or not we receive an array or string. If it is an array, we could just iterate across all values. The Keyword Filter plugin does this as well. This would be less disruptive for existing users of the plugin.
Return value can be multiple values sometimes
Additionally, the plugin does need to report whether the result is multiple values or not. 'Unserialize', 'JSON decode' and 'YAML decode' can return multiple values, so for these modes the method
multiple()
should returnTRUE
if the result is an array. The plugin 'Required' does this as well. The Keyword Filter plugin does not, but I think that is a bug. I will create a new issue for this later.Test coverage
If we keep it all in a single plugin, we need additional test coverage to ensure that each mode can handle multiple values. We also need test coverage for the method
multiple()
returning the right value:TRUE
when an array is the result,FALSE
otherwise. I've added tasks for these to the remaining tasks list in the issue summary.Since ๐ Encode plugin: only call function from the available options Active touches the same code, I think it would be better to first finalize that one and then work on a fix for this one. Test coverage could already be written however. I think I'll make a start on that.
Let me know your thoughts! :)
- ๐ณ๐ฑNetherlands megachriz
๐ Encode plugin: only call function from the available options Active is now merged.
From #20:
the method
multiple()
should return TRUE if the result is an array. The plugin 'Required' does this as well. The Keyword Filter plugin does not, but I think that is a bug. I will create a new issue for this later.Created ๐ Keyword Filter should report that multivalued data remains multivalued Active for this.
- ๐ณ๐ฑNetherlands megachriz
I've now implemented a fix that makes the plugin handle multiples themselves. If an array is passed, but the selected encode method does not accept arrays, then the plugin iterates over all values.
I think that this approach is less desruptive for existing users of the plugin.
Let me know what you think of my approach to fix this issue and tests or code reviews are welcome! I'm aiming for a new Tamper release mid/end February, so it would be cool if you are able to test or review before that.
- ๐ณ๐ฑNetherlands megachriz
Updated issue title to reflect the current approach that is implemented.
- ๐ฉ๐ชGermany jurgenhaas Gottmadingen
@megachriz I've now had a chance to look at your table in the IS and the MR!42. I like the code clean-up a lot and it seems to go the right direction. But I'm uncertain that the modes serialize, json_encode , and yaml_encode should handle multiples. Imagine, you have an array of data, and you want to serialize that array into a string so that it can be stored and later be unserialized, you want to pass the array in and receive one serialized string back, The current implementation seems to serialize each element in the array individually and pass back an array or serialized objects. That's not what people expect, I guess.
- ๐ณ๐ฑNetherlands megachriz
@jurgenhaas
Thanks for the review! Not for every mode iteration happens. Only for the modes where'handle_multiples'
is set toFALSE
. SeeEncode::getModes()
.The following modes have set
'handle_multiples'
toTRUE
. These modes accept arrays as input:- serialize
- json_encode
- yaml_encode
When for these modes an array is passed, the array will be passed directly to the function and not be iterated over.
The following modes have set
'handle_multiples'
toFALSE
. These modes do not accept arrays as input.- unserialize
- json_decode
- base64_encode
- base64_decode
- yaml_decode
When for these modes an array is passed, there will be iterated over the array and each value of the array will passed to the function.
Demonstration
Given the following input:
$input = [ [ 'title' => 'foo', 'body' => 'footsie', ], [ 'title' => 'bar', 'body' => 'barkruk', ], [ 'title' => 'qux', 'body' => 'quxious', ], ];
When executing
var_dump($plugin->tamper($input));
:- 'serialize' will return
string(177) "a:3:{i:0;a:2:{s:5:"title";s:3:"foo";s:4:"body";s:7:"footsie";}i:1;a:2:{s:5:"title";s:3:"bar";s:4:"body";s:7:"barkruk";}i:2;a:2:{s:5:"title";s:3:"qux";s:4:"body";s:7:"quxious";}}"
- 'json_encode' will return
string(100) "[{"title":"foo","body":"footsie"},{"title":"bar","body":"barkruk"},{"title":"qux","body":"quxious"}]"
- 'yaml_encode' will return
string(105) "- title: foo body: footsie - title: bar body: barkruk - title: qux body: quxious "
base64_encode actually fails on the above, because it cannot handle multidimensional arrays, but for the following input:
$input = [ 'title' => 'foo', 'body' => 'footsie', ];
It returns:
array(2) { ["title"]=> string(4) "Zm9v" ["body"]=> string(12) "Zm9vdHNpZQ==" }
But the point is, modes 'serialize', 'json_encode' and 'yaml_encode' will always result into a string if an array, a scalar value or null is passed (classed objects might throw errors, depending on their implementation).
- ๐ฉ๐ชGermany jurgenhaas Gottmadingen
Not for every mode iteration happens. Only for the modes where 'handle_multiples' is set to FALSE. See Encode::getModes().
Ah, my bad. I missed the negation in the condition
if (!$handles_multiples && is_array($data)) {
. Now, with a closer look I can confirm this is perfect. Setting to RTBC. -
megachriz โ
committed 8c50bb58 on 8.x-1.x
Issue #3391096 by lammensj, megachriz, ericgsmith, jurgenhaas, kopeboy:...
-
megachriz โ
committed 8c50bb58 on 8.x-1.x
- ๐ณ๐ฑNetherlands megachriz
I merged the code! Thanks all who contributed to this issue.
- Status changed to Fixed
29 days ago 1:19pm 8 March 2025 Automatically closed - issue fixed for 2 weeks with no activity.