Spike: Explore storing component inputs in separate columns (aka field union)

Issue created by @larowlan

Comment about 2 months ago →

🇦🇺Australia larowlan 🇦🇺🏝.au GMT+10

Comment about 2 months ago →

🇦🇺Australia larowlan 🇦🇺🏝.au GMT+10

Comment about 2 months ago →

🇧🇪Belgium wim leers Ghent 🇧🇪🇪🇺

This potential direction is why I prioritized 📌 [later phase] Support matching `{type: array, …}` prop shapes Postponed and got it finished. Because I have a hard time seeing how this work with multi-value fields. Especially because of #3052670: Support multi-valued "field union"s → .

So, to avoid us adopting this and potentially losing multi-value support, I made sure 📌 [later phase] Support matching `{type: array, …}` prop shapes Postponed was working, and proves that multi-value scalars (type: array, items: { type: integer } — see the sparkline test SDC) and multi-value object shapes (see the image-gallery test SDC) can work in the current architecture.

(I'm not fundamentally opposed to this — just concerned we'd forget about that, and now we can't! 👍)

Related: I tried to push #3467890 forward and assigned it to you at #3467890-13: [later phase] Support `{type: object, …}` prop shapes with single level that require *multiple* field types: use `field_union`? — OUT OF SCOPE: nested components/component reuse → for feedback, @larowlan 😄

Comment about 2 months ago →

🇧🇪Belgium wim leers Ghent 🇧🇪🇪🇺

So for example if a block changes its settings, we have to loop over every revision and search for components and then update the whole blob.

Indeed. And for that, we have 📌 [SPIKE] Prove that it's possible to apply block settings update paths to stored XB component trees Active .

Comment about 2 months ago →

🇧🇪Belgium wim leers Ghent 🇧🇪🇪🇺

we would have one table per component version (set of fields)

Try to do it in a storage layer that supports one table per component (version), not one per component version per entity type (as is the case with fields in core)

🤯 That could easily be hundreds of DB tables: a site can easily have a 100 components (the issue summary assumes 50), and for each of those multiple versions. (Note that "version" here is a massively overloaded term — there can be very different reasons. See #3523841-6: Version component prop definitions for SDC and Code components → .)

⚠️ Concern: this would make 📌 [later phase] When the field type for a PropShape changes, the Content Creator must be able to upgrade Postponed much harder. What if a site with a million existing revisions decides to implement hook_storage_prop_shape_alter() to change the field type for a prop of an SDC that is present in all of them (to improve the authoring experience, or to switch from plain images to Media Library or $REASON).
This architecture would require rows to be removed from one table and moved into another!

Although I think it could be argued that that would be much clearer. It'd also allow dropping tables for older "component versions" that don't have any remaining rows anymore, and would also allow removing the corresponding entries in the Component config entity that 📌 Version component prop definitions for SDC and Code components Active would've added.

🤔 Not sure yet, but for sure interesting 😄

Explore what views integration would look like

I don't see yet how that'd be meaningful. Views lists things of the same type in a single list/grid/table/…. But here those same things (instances of the same component version) are spread across many entities and bear no relation to one another. Unless you're thinking about listing all the different component instances of a single entity? Or something else still? But listing the first or fifth or Nth instance of some component still is not meaningful?

I struggle to follow your thinking here 😇

Explore what this would like for e.g. Block settings that whilst modeled using config schema (and therefore typed data) are arbitrary in shape and would traditionally be stored in a serialized column

I'm really curious about this part 🧐

Comment about 2 months ago →

🇬🇧United Kingdom catch

@Wim 📌 [PP-1] Consider not storing the ComponentTreeStructure data type as a JSON blob Postponed is (I think, still catching up on the latest issues a bit) a row-per-component with a single JSON column for the values in a single table, so it would be mutually exclusive with this issue.

For me, having multiple tables, or multiple rows for a single delta, feels like it would be incredibly complex both from the point of view of having to adapt all SQL storage backends to support it, and also for views integration.

However row-per-component with a JSON column would simplify dependency checking, updates, potentially things like revision compression etc. and might well be useful for 🌱 [META] Support alternative renderings of prop data added for the 'full' view mode such as for search indexing or newsletters Active too. Views integration feels like a very low priority because the data is arbitrary as you say.

I have on occasion added listing filters with CONTAINS on the body field or similar on sites that otherwise don't use the search module, when the dataset is small enough that it won't kill the database. There might be the odd case like that but don't think there will be many.

I could see wanting to list entities that are using component x - that would be easy to do with row-per-component because it doesn't rely on the values. e.g. you could list all articles that have an image gallery in them, things like that.

A JSON column would make views integration (at least for the values if not other things like component) dependent on ✨ Add "json" as core data type Active , but that feels like a reasonable limitation to me. No matter how complicated it might be, it is almost going to be less complicated than views integration for the current JSON blob with everything in it, and it might even be less complicated than supporting a fully relational schema here.

So for me personally, I would postpone this issue on 📌 [PP-1] Consider not storing the ComponentTreeStructure data type as a JSON blob Postponed , and if that one works out, then this might not be very necessary to explore.

Comment about 2 months ago →

🇧🇪Belgium wim leers Ghent 🇧🇪🇪🇺

Agreed! Reflecting that in the issue metadata 👍

Comment about 1 month ago →

🇧🇪Belgium wim leers Ghent 🇧🇪🇪🇺

📌 [PP-1] Consider not storing the ComponentTreeStructure data type as a JSON blob Postponed is in.

📌 Version component prop definitions for SDC and Code components Active is actively being worked on and will pave the path for this.

So keeping the issue status the same. 👍

Comment about 1 month ago →

🇧🇪Belgium wim leers Ghent 🇧🇪🇪🇺

I'm working to update 🌱 [META] Production-ready data storage Active to be comprehensive. But this isn't linked yet from there. So I needed to dig deeper than #9.

So on second thought, I wondered how this was still relevant after 📌 [PP-1] Consider not storing the ComponentTreeStructure data type as a JSON blob Postponed . @catch wrote in #7:

#3468272 is (I think, still catching up on the latest issues a bit) a row-per-component with a single JSON column for the values in a single table, so it would be mutually exclusive with this issue.

+

So for me personally, I would postpone this issue on #3468272, and if that one works out, then this might not be very necessary to explore.

Beautiful. That's exactly what I think. And the "field union metadata" aspects of this proposed spike are actually covered by 📌 Version component prop definitions for SDC and Code components Active , as I wrote in #9.

So: closing :)

Spike: Explore storing component inputs in separate columns (aka field union)

Problem/Motivation

Proposed resolution

User interface changes

Comments & Activities