Optimize joins and table selection in SQL entity query implementation

echo "\nQuery 1:\n"; $ids = \Drupal::entityQuery('node') ->execute(); print_r($ids); echo "\nQuery 2:\n"; $ids = \Drupal::entityQuery('node') ->condition('title', 'First node') ->execute(); print_r($ids); echo "\nQuery 3:\n"; $ids = \Drupal::entityQuery('node') ->condition('uuid', '20478baa-64e4-4b01-bf68-5ea34e3db78b') ->execute(); print_r($ids); echo "\nQuery 4:\n"; $ids = \Drupal::entityQuery('node') ->condition('uid.entity.uid', 1) ->execute(); print_r($ids);

Query 1: SELECT base_table.vid AS vid, base_table.nid AS nid FROM {node} base_table Array ( [1] => 1 [2] => 2 [3] => 3 [4] => 4 ) Query 2: SELECT base_table.vid AS vid, base_table.nid AS nid FROM {node} base_table INNER JOIN {node_field_data} node_field_data ON node_field_data.nid = base_table.nid WHERE node_field_data.title LIKE :db_condition_placeholder_0 ESCAPE '\\' Array ( [3] => 3 ) Query 3: SELECT base_table.vid AS vid, base_table.nid AS nid FROM {node} base_table INNER JOIN {node} node ON node.nid = base_table.nid WHERE node.uuid LIKE :db_condition_placeholder_0 ESCAPE '\\' Array ( [1] => 1 ) Query 4: SELECT base_table.vid AS vid, base_table.nid AS nid FROM {node} base_table INNER JOIN {node_field_data} node_field_data ON node_field_data.nid = base_table.nid LEFT OUTER JOIN {users} users ON users.uid = node_field_data.uid INNER JOIN {users_field_data} users_field_data ON users_field_data.uid = users.uid WHERE users_field_data.uid = :db_condition_placeholder_0 Array ( [4] => 4 )

Query 1: SELECT base_table.vid AS vid, base_table.nid AS nid FROM {node_field_data} base_table Array ( [1] => 1 [2] => 2 [3] => 3 [4] => 4 ) Query 2: SELECT base_table.vid AS vid, base_table.nid AS nid FROM {node_field_data} base_table WHERE base_table.title LIKE :db_condition_placeholder_0 ESCAPE '\\' Array ( [3] => 3 ) Query 3: SELECT base_table.vid AS vid, base_table.nid AS nid FROM {node_field_data} base_table INNER JOIN {node} node ON node.nid = base_table.nid WHERE node.uuid LIKE :db_condition_placeholder_0 ESCAPE '\\' Array ( [1] => 1 ) Query 4: SELECT base_table.vid AS vid, base_table.nid AS nid FROM {node_field_data} base_table INNER JOIN {users_field_data} users_field_data ON users_field_data.uid = base_table.uid WHERE users_field_data.uid = :db_condition_placeholder_0 Array ( [4] => 4 )

Comments & Activities

Not all content is available!

It's likely this issue predates Contrib.social: some issue and comment data are missing.

Merge request !2378Issue #2875033: Optimize joins and table selection in SQL entity query implementation → (Open) created by rlmumford
Comment over 2 years ago →
🇺🇸United States smustgrave
Seems there are still some open questions to answer before review.

#39 and #41 should be answered (added to remaining tasks)
Comment about 2 years ago →
🇷🇺Russia Chi
Faced this issue with "single-table" entity type. Entity query joined base table to itself which caused bad performance. Patch #16 works well on Drupal 10.0.
Comment about 2 years ago →
🇷🇺Russia Chi
Patch #16 works well on Drupal 10.0.

Actually it does not. EFQ with entity references produces wrong SQL join. See comment #30.
Comment 10 months ago →
🇳🇱Netherlands spadxiii
We have been using the mr in #37 for a while, but there are some issues with it: when using multiple conditions on the same column in an entity-query, the same table is joined several times.

I've fixed this with another if-statement in the patch that checks if the table is already joined (with the same type).
Comment 10 months ago →
solideogloria
@spadxiii Please make the change to the merge request, rather than submitting a patch.
Comment 10 months ago →
🇳🇱Netherlands spadxiii
I seem to have attached the wrong patch. Here's the correct one, that works.
Comment 10 months ago →
🇳🇱Netherlands spadxiii
@solideogloria the mr is quite old and needs to be rebased :\

and when I push, I get an error that: remote: You are not allowed to push code to this project.
So I cannot update the mr.
Comment 10 months ago →
solideogloria
You have to click the "Get Push Access" button at the top of this page.
First commit to issue fork.
Pipeline finished with Failed
9 months ago
Total: 698s
#293174
Comment 9 months ago →
🇮🇳India arunkumark Coimbatore
arunkumark → changed the visibility of the branch 2875033-optimize-joins-and to hidden.
Comment 9 months ago →
🇮🇳India arunkumark Coimbatore
arunkumark → changed the visibility of the branch 2875033-optimize-joins-and to hidden.
Comment 9 months ago →
🇮🇳India arunkumark Coimbatore
Comment 9 months ago →
🇮🇳India arunkumark Coimbatore
Comment 9 months ago →
🇮🇳India mrinalini9 New Delhi
Hi,

I have tried to create MR for the changes mentioned in patch #48 but was unable to do so because the MR points to branch 9.5.x instead of 11.x. Also, I have tried to create a new branch from 11.x but getting the below error:

Thanks & Regards,
Mrinalini
Comment 9 months ago →
solideogloria
@mrinalini9 This should be helpful for you: https://www.drupal.org/docs/develop/git/using-gitlab-to-contribute-to-dr... →
First commit to issue fork.
Merge request !10783Optimize joins → (Open) created by ptmkenny
Comment 6 months ago →
🇯🇵Japan ptmkenny
To run the tests, I created an MR of patch #48.
Pipeline finished with Failed
6 months ago
Total: 481s
#384838
Comment 6 months ago →
🇫🇷France Nixou Toulon
Thanks for this !

Attach is the patch from #48 (2875033-46.patch) rerolled for Drupal 10.3.x and 10.4.x
Comment 6 months ago →
solideogloria
The changes need to be applied to the merge request.
Comment 4 months ago →
🇺🇸United States pwolanin
patch #61 is failing for my colleague when filtering with jsonapi on the value of a referenced entity referenced by the main entity.

It's writing the WHERE clause such that it's filtering the main entity to the node ID of the referenced entity.

example:

main entity house.

house references a city node

city references a state node

if I filter houses in jsonapi by state, the SQL where clause is filtering the house node ID by the state node ID.
Comment 3 months ago →
🇨🇦Canada Charlie ChX Negyesi 🍁Canada
This issue and 📌 \Drupal\Core\Entity\Query\Sql\Tables causes extremely poor performance when using MariaDB and filtering on multiple relationships in JSON:API Active IMO needs to be consolidated.
Comment 3 months ago →
solideogloria
Comment 29 days ago →
🇺🇦Ukraine HitchShock Ukraine
Hi all.
First of all, I want to thank everyone who is working on this task. It solves the performance problem for big data entities in certain cases.

But I also found a possible way to make it better for some scenarios.
We can remove `$type === 'INNER'` from the condition.
If the table is the same, then it doesn't matter which type of join is used. Anyway, the same table will be used.
Removing this condition can be useful for big data queries with base fields of the entity, which are stored in the same table if the data table is the same as the base table for an entity.

For example,
- we have a `custom_entity`
- we send a simple entity query to get IDs sorted by uuid
- the default query will be

SELECT base_table.id AS id, base_table.id AS base_table_id, custom_entity.uuid AS uuid FROM custom_entity base_table LEFT JOIN custom_entity custom_entity ON custom_entity.id = base_table.id ORDER BY custom_entity.uuid ASC LIMIT 20 OFFSET 0
What is the problem? If custom_entity is a big data entity, then we are trying to join big data to big data, which will take much more time than without 'join'. This impacts performance a lot

If we remove `$type === 'INNER'` part of the condition, the issue will be solved, because the query will be generated like

SELECT base_table.id AS id, base_table.id AS base_table_id, base_table.uuid AS uuid FROM custom_entity base_table ORDER BY base_table.uuid ASC LIMIT 20 OFFSET 0
I added a hidden patch with this fix.

P.S. Please let me know if my opinion is correct or if it has obvious flaws in the context of Drupal core

Optimize joins and table selection in SQL entity query implementation

Problem/Motivation

Proposed resolution

Remaining tasks

User interface changes

API changes

Data model changes

Merge Requests

!10783Optimize joins and table selection in SQL entity query implementation
Open

!2378Optimize joins and table selection in SQL entity query implementation
Open

Comments & Activities

Optimize joins and table selection in SQL entity query implementation

Problem/Motivation

Proposed resolution

Remaining tasks

User interface changes

API changes

Data model changes

Merge Requests

!10783Optimize joins and table selection in SQL entity query implementationOpen

!2378Optimize joins and table selection in SQL entity query implementationOpen

Comments & Activities

!10783Optimize joins and table selection in SQL entity query implementation
Open

!2378Optimize joins and table selection in SQL entity query implementation
Open