Uppercase JSON cannot be parsed

Created on 21 April 2024, 12 months ago
Updated 11 May 2024, 11 months ago

Problem/Motivation

JSON is case-sensitive. The jmespath parser (and perhaps other JSON parsers, not tested yet) produce a data array with all property names lowercased, which breaks tamper code that expects the JSON to have uppercase characters in the property name.

I have been unable to find the exact cause of this issue; the problem might lie in Feeds Tamper or Feeds itself.

Steps to reproduce

1. Create a feed type that uses the jmespath parser.
2. For the mappings, use some lowerCamelCase property names, like metaDescription and urlSource.
3. Use Feeds Tamper to process one of the mappings with a custom feeds import plugin.
4. In my_feeds_tamper_plugin->tamper($data, $item), item will have the JSON property mappings, but they will all be lowercase.

I tried to debug where this lowercasing is happening. In JmesPathParser.php, ->executeContext() and ->executeSourceExpression() have lowerCamelCase properties as expected (the $machine_name is all lowercase, and $expression is lowerCamelCase as expected, so I am guessing that the bug is caused by swapping the $machine_name and $expression somewhere). So at this point, the data is OK.

However, in Feeds Tamper's FeedsSubscriber->afterParse(), the data in the event contains only lowercase strings for the JSON properties. So at this point, the data is incorrect.

What happens between the parsing in JmesPathParser.php and FeedsSubscriber->afterParse()? Any hints about how to debug this further would be much appreciated.

🐛 Bug report
Status

Needs work

Version

1.0

Component

Code

Created by

🇯🇵Japan ptmkenny

Live updates comments and jobs are added and updated live.
  • Needs tests

    The change is currently missing an automated test that fails when run with the original code, and succeeds when the bug has been fixed.

Sign in to follow issues

Merge Requests

Comments & Activities

  • Issue created by @ptmkenny
  • 🇯🇵Japan ptmkenny

    As usual, immediately after posting the issue, I find the bug. The issue is ParserBase->executeSources(). Will debug further.

  • Merge request !39make json property names case-sensitive → (Open) created by ptmkenny
  • Open in Jenkins → Open on Drupal.org →
    Core: 9.5.5 + Environment: PHP 7.4 & MySQL 5.7
    last update 12 months ago
    88 pass, 12 fail
  • Status changed to Needs review 12 months ago
  • 🇯🇵Japan ptmkenny

    Ok, so here's a tentative fix for this. This makes the JSON property names case-sensitive by using $source['value'] instead of $source['machine_name'].

    However, this bug is not limited to JSON; I noticed that one of my Feeds Tamper plugins has a workaround for CSV names as well. So I think that perhaps the use of machine names rather than the value should be changed globally.

  • Pipeline finished with Failed
    12 months ago
    Total: 417s
    #152662
  • 🇳🇱Netherlands megachriz

    Thanks for finding the bug (although I haven't tried to reproduce it yet). I remember I had made a change to the CSV parser in Feeds for a similar issue in the past, but there was a reason that I needed to revert that fix.

    Anyway, it would be useful to have test coverage here.

  • 🇳🇱Netherlands megachriz

    I don't remember all the details anymore, but it was this issue that I reverted later: #2997951: Custom mapping source should use value as item getter instead of machine name? .

  • Status changed to Needs work 12 months ago
  • 🇳🇱Netherlands megachriz

    It is failing tests, so it at least needs work. Although I'm not certain (because it was long ago), I would think #2997951: Custom mapping source should use value as item getter instead of machine name? and #3015786-8: Changes for latest dev version of Feeds contain useful information for dealing with this issue.

  • Open in Jenkins → Open on Drupal.org →
    Core: 9.5.5 + Environment: PHP 7.4 & MySQL 5.7
    last update 12 months ago
    88 pass, 12 fail
  • 🇯🇵Japan ptmkenny

    Thank you for the links; those are helpful.

    I agree this needs tests. I'm having a little trouble with debugging the tests in ddev at the moment, so once I get that figured out, I'll try to fix this.

    I think it would be good to test both lowerCamelCase and snake_case JSON properties, since those are the two common ways to write JSON property names.

  • Pipeline finished with Failed
    12 months ago
    Total: 383s
    #153355
  • Open in Jenkins → Open on Drupal.org →
    Core: 9.5.5 + Environment: PHP 7.4 & MySQL 5.7
    last update 11 months ago
    88 pass, 12 fail
  • Pipeline finished with Failed
    11 months ago
    #158952
  • Open in Jenkins → Open on Drupal.org →
    Core: 9.5.5 + Environment: PHP 7.4 & MySQL 5.7
    last update 11 months ago
    88 pass, 12 fail
  • Pipeline finished with Failed
    11 months ago
    #158955
  • Open in Jenkins → Open on Drupal.org →
    Core: 9.5.5 + Environment: PHP 7.4 & MySQL 5.7
    last update 11 months ago
    88 pass, 12 fail
  • Pipeline finished with Failed
    11 months ago
    #158961
  • Open in Jenkins → Open on Drupal.org →
    Core: 9.5.5 + Environment: PHP 7.4 & MySQL 5.7
    last update 11 months ago
    94 pass, 10 fail
  • Pipeline finished with Failed
    11 months ago
    #158969
  • Open in Jenkins → Open on Drupal.org →
    Core: 9.5.5 + Environment: PHP 7.4 & MySQL 5.7
    last update 11 months ago
    88 pass, 12 fail
  • Pipeline finished with Running
    11 months ago
    #158974
  • Open in Jenkins → Open on Drupal.org →
    Core: 9.5.5 + Environment: PHP 7.4 & MySQL 5.7
    last update 11 months ago
    88 pass, 12 fail
  • Pipeline finished with Failed
    11 months ago
    #158976
  • 🇯🇵Japan ptmkenny

    Added tests, but I still need to fix the existing tests. I'm having trouble figuring out what is wrong.

  • 🇯🇵Japan ptmkenny

    Another problem: even with this fix, if you map the Feeds item (feeds_item): Item GUID to a lowerCamelCase property, it will fail (the property is not read). Setting it to an all lowercase property name does work.

    So it seems the item GUID is read differently than other properties, and perhaps should be tested separately.

  • 🇯🇵Japan ptmkenny

    Another issue with the current MR is that the Mapping UI displays the machine name if the casing differs from the input name.

    For example,

    metaDescription (metadescription)

    This is probably to indicate that the () value must be used (because Feeds doesn't currently handling casing), but in this case, since the MR adds support for casing, it is confusing to show the machine name and it should be removed.

  • 🇯🇵Japan ptmkenny

    And another problem: My testing was all through Feeds Tamper, where this MR helps out.

    But if you map a field and do not tamper it, even with this MR, Feeds will expect lowercase, not uppercase.

    TL;DR: This is going to be a hard problem to fix, so for now, I suggest putting a clear message on the project page: Uppercase characters are not currently supported in JSON keys (for example, { "uppercaseDoesNotWorkHere": "You can use uppercase here" }).

  • Pipeline finished with Failed
    7 months ago
    Total: 312s
    #277225
Production build 0.71.5 2024