Make external entities translatable

Issue created by @rp7
Merge request !57Draft: Make external entities translatable → (Open) created by rp7
Comment 6 months ago →
🇨🇦Canada colan Toronto 🇨🇦
Adding ✨ Translation support in vertical data aggregation Active as related.
Comment 6 months ago →
🇨🇦Canada colan Toronto 🇨🇦
@rp7: Would you kindly let us know what's left to do here? Thanks.
Comment 6 months ago →
🇧🇪Belgium rp7
Been using this on a quite busy project in production for a few months now, for 3 separate annotatable external entity types. No issues reported so far.

But it now appears to be in conflict with ✨ Translation support in vertical data aggregation Active , which is tailored to a different kind of external API format.

Besides that, test coverage is non-existent yet.
Comment 6 months ago →
🇨🇦Canada colan Toronto 🇨🇦
Thanks for the update!

Okay, so we need to figure out how to incorporate both approaches without collision.
Comment 4 months ago →
🇨🇦Canada colan Toronto 🇨🇦
Updated title so I stop getting it confused with ✨ Translation support in vertical data aggregation Active .
Comment 3 months ago →
🇫🇷France guignonv Montpellier
With #3506455, we can (I did not succeed yet though...) merge several language sources into a single external entity with sub-arrays keyed by __external_entity_translation__<LANG_CODE>.
For instance, example 1:

[ 'id' => 42, 'title' => 'English title', 'somefield' => 'other text', 'someotherfield' => 501, '__external_entity_translation__fr' => [ 'id' => 42, 'title' => 'Titre français', 'somefield' => 'autre texte', 'someotherfield' => 501, ], ]
@rp7, in your current approach, you expect raw data to be an array keyed by languages.
For instance, example 2:

[ 'en' => [ 'id' => 42, 'title' => 'English title', 'somefield' => 'other text', 'someotherfield' => 501, ], 'fr' => [ 'id' => 42, 'title' => 'Titre français', 'somefield' => 'autre texte', 'someotherfield' => 501, ], ]
What I don't like with example 1 is that untranslated fields can be duplicated (and it could cost a lot of memory, depending what is stored) and what I don't like in example 2 is that some object may be untranslated but have a key corresponding to a language code which may lead to an incorrect "language availability guessing". If you systematically change the raw structure of all external entities to add a language layer, then there would be no "incorrect guessing" but it would add a layer to raw data and make things more complicated.

I agree I don't see better/ideal solutions at the moment, but we should discuss how things could be handled here. From my side, my main concern is not how the final translatable raw structure will look like but rather how that structure will be generated.

For instance, you could consider consuming a REST API where the language is selected in the URL: you would use a vertical aggregator with as many source as languages you support. Each source provides values for a given language.
Now, if you consider a TSV file (Excel-like file) where there is a "lang_code" column telling the language of the record. You could find multiple time the same ID but with a different value for its "lang_code" column. How to aggregate that? Or should we consider each translated record as independent, but resolve the record to use using both its identifier and its lang_code? It can be interesting to do so to only load the appropriate translation.

So far, ideally, I would prefer the system to be able to only load the appropriate translation, without querying multiple sources (or multiple time the same source). For REST clients, it would mean to select the appropriate URL according to the selected language, for TSV clients or an SQL client to filter on a language column, for a file system client maybe to select the directory scheme according to a language code, etc. I think language should be managed by the storage clients because they know how to filter by language. Otherwise, they would always return all the translations and it would not be efficient. Now, how could that be achieved? I believe there should be a language parameter in the storage client ::load()/::loadMultiple() methods but it's an API change (fortunately, we're still in beta) or a service could be used to get the language instead? What if we want to edit and save another language while the current language is different? I need to think more about it but if you have ideas, please share! :)

What I would prefer to get is example 3:

Query storage client to get entity 42 in "en":

[ 'id' => 42, 'title' => 'English title', 'somefield' => 'other text', 'someotherfield' => 501, ]

Query storage client to get entity 42 in "fr":

[ 'id' => 42, 'title' => 'Titre français', 'somefield' => 'autre texte', 'someotherfield' => 501, ]
Comment 3 months ago →
🇫🇷France guignonv Montpellier
Here is my new proposal to manage languages:

It will be managed by ExternalEntityType class. On the external entity type edit form, there will be a new horizontal tab "Language settings", just below the "Storage" tab. On that tab, the user will be able to select a language and it will display current storage configuration form with language specific overrides. In other words, we would have a base storage config (data aggregator config) with possible overrides by languages. For instance, for a REST client, the REST service URLs could be overridden according to the language; for TSV clients, the source file could be changed or a filter could be added; for SQL clients, the queries could be adapted. Since it's the aggregator config that is overridden, it would also be possible to even change the storage client (ie. provide translations from another storage client). The idea would be to provide a checkbox to allow a "language" storage config override and highlight what is overridden (on client side through Javascript). Unchecking+re-checking the checkbox would reset "language" storage config for update. It will be possible to improve this user interface later.

On the run time, the config to load according to current language would be selected by the external entity type when ExternalEntityType::getDataAggregatorConfig() is called.

No change is required to storage clients (including base). No change is required to ExternalEntityStorage.
Modified: ExternalEntityType, ExternalEntityTypeForm and external entity type config schema (...and ExternalEntity for ->setTranslatable(TRUE) ;-) ).

That could be enough but we could get a step further to simplify user config management: we can take profit of the Token module recently added as a dependency to use it in storage clients where it could be pertinent. For instance, a REST service URL could use the token [language:langcode] and may not need to override storage config for any language.

What do you think?
Comment 3 months ago →
🇨🇦Canada colan Toronto 🇨🇦
Sounds reasonable.
Comment 2 months ago →
🇫🇷France guignonv Montpellier
UPDATE: I'll also include filed mapping override by language to support the use case of this issue: data in multiple languages.
I'll create a branch.
Comment 2 months ago →
🇫🇷France guignonv Montpellier
Investigation update:

Drupal ContentEntityBase expects to have translations stored in member ->translations[$langcode]['entity'].
Furthermore, translated field values need to be stored in ->values[$field_name][$langcode] (provided by the mapped raw data array to the constructor).

I want to support those use cases:
- data aggregator providing all the translated data at once in the raw data array (ie. not possible or not efficient to separate translations)
- data aggregator only return one (or a reduced set of) translated data but can fetch the requested translated data on demand
- fetched translated data may not be keyed using Drupal language codes

New approach, closer to @rp7 initial one:
- A new "Language settings" tab on external entity type configuration (ExternalEntityTypeForm) where:
- translations can be enabled/disabled
- for each non-default language, a checkbox to enable field mapping override and another one for storage (data aggregator) override
- for both overrides, the same sub-forms used for the default field mapping and storage settings. When override becomes checked, it will duplicate original settings and user will be able to completely change the settings (ie. remove a field mapping or even change the data aggregator for instance).
- on xntt loading, ExternalEntityStorage::extractEntityValuesFromRawData() will populate ->values[$field_name][$langcode] with the data it has
- to manage on-demand translation, ExternalEntity::getTranslation() will be overriden to fetch additional translations when needed
- to know if a translation is available for a given langcode, either their is a corresponding storage config override (meaning new data needs to be fetched) or a corresponding field mapping override (meaning data is already there but translated fields need to be mapped)

-This post may be updated later according to new investigations-
Comment about 2 months ago →
🇫🇷France guignonv Montpellier
I've added branch "3.0.x-translation-support" to main repos. Work in progress...
Comment about 1 month ago →
🇫🇷France guignonv Montpellier
I just committed (143a9d5e236ae1b6f9442b6e6711867427b2871b) a new working implementation for multi-lingual support on branch "3.0.x-translation-support".

Now, when you have external sources that provide multiple languages, you can override storage settings and field mapping accordingly. No need to have a specific raw structure keyed by Drupal language codes, and no need to use vertical aggregator with "language" type aggregation!

I've removed the "language" field mapping as it is automatically set according to the language overrides. However, I wonder if there is a use case where people would get external data without knowing their language and would need to force-map that field to an external value providing the language. I'll think more about it and may change things a little.

I'll need to try to write an update to turn vertical aggregators using the 'language' mode into language field mapping overrides.

I'll also need to add translation tests before asking for community tests and merging to core.
Comment about 1 month ago →
🇫🇷France guignonv Montpellier
Fixed by the new multi-lingual and translation support for external entities added since commit #a0d7c58e503c597242e5493971d44ee5375bd22e.
Comment about 1 month ago →
🇫🇷France guignonv Montpellier
Comment about 1 month ago →
🇫🇷France guignonv Montpellier
Quick translation support setup when a single request returns all the translations, each under a sub-key. Ex.:

[ 'id' => 42, 'en' => [ 'title' => 'some title', 'body' => 'some content', ], 'fr' => [ 'title' => 'un titre', 'body' => 'du contenu', ], ];

I assume your site has multi-lingual support enabled

Edit your external entity type and go to the new "Language settings" tab below the "Storage" tab

Check "Enable translations"

For each supported language, check "Override field mapping"

Now, for each mapped field, prefix your raw field name (or adapt your field path) with the corresponding language prefix.
For instance, a title field initially mapped to "en.title" should be mapped to "fr.title" for the French language field mapping override. If you know some external fields are not translated, you may keep the original mapping (for instance id remains "id"). You may also remove field mapping (ie. "Not mapped") of some fields if needed as well as map language-specific fields that are not mapped by default.

If you don't always have a translation for a given language, you may map its ID field property using the "Conditional mapping" property mapper and check if another translated field is set. For instance, you can check if the raw field "fr.title" is not empty to map the French ID field to the raw field "id". When the French title is not set or empty, the French ID field will not be mapped, resulting in a non-existent translation instance.
Issue was unassigned.
Status changed to Fixed 17 days ago2:29pm 31 July 2025
Comment 17 days ago →
System Message
Automatically closed - issue fixed for 2 weeks with no activity.

Make external entities translatable

Merge Requests

!57Make external entities translatable
Open

Comments & Activities

Make external entities translatable

Merge Requests

!57Make external entities translatableOpen

Comments & Activities

!57Make external entities translatable
Open