sync might be too strict during id comparison; rolls back everything

Comment over 2 years ago →
🇩🇪Germany berliner
As mentioned by @byrond, this seems to have been fixed in 6.0.x by this commit in 🐛 sync option doesnt work with track_changes Fixed .
Comment about 2 years ago →
🇫🇷France pacproduct
Version 6.0.1 does seem to fix the issue as long as the ids' "type" property is set correctly.

In my case migrate_tools was rolling back all my items because the source ID was an integer, and I had defined:

ids: source_id: type: string max_length: 100
Fixing my configuration to type "integer" fixed everything:

ids: source_id: type: integer

I overlooked it in the first place, so I'm wondering if this subtlety is documented somewhere?
Open in Jenkins → Open on Drupal.org →
Core: 9.5.x + Environment: PHP 8.0 & MySQL 5.7
last update about 2 years ago
Patch Failed to Apply
Comment about 2 years ago →
🇮🇹Italy robertom
Hi, sorry for my bad english.

This issue is still valid.

I have many d7 to d9 migrations suffering from this because, for example, d7_taxonomy_term_entity_translation states that entity_id will be integer, but the data coming from the db returns it as a string.

The approach used in patch #4/#17 might be incorrect because, if the id is an md5, the conversion might assume it's a numeric value when it doesn't contain letters and the strict in_array check will fail.

Attached a "workaround" that enforce the type of $source_id_values as stated by $source->getIds()
Open in Jenkins → Open on Drupal.org →
Core: 9.5.x + Environment: PHP 8.0 & MySQL 5.7
last update about 2 years ago
Patch Failed to Apply
Comment about 2 years ago →
🇮🇹Italy robertom
sorry, there is no point in checking repeatedly...

new patch attached
Open in Jenkins → Open on Drupal.org →
Core: 9.5.x + Environment: PHP 8.1 & MySQL 8
last update about 2 years ago
36 pass
Open in Jenkins → Open on Drupal.org →
Core: 10.1.x + Environment: PHP 8.1 & MySQL 8
last update about 2 years ago
29 pass, 3 fail
Comment about 2 years ago →
🇮🇹Italy robertom
Even with patch #21 I get some unwanted rollbacks when i run a migration group with the --group=... option.

The problem is due to the way the data is stored in the state variable. We need to add the migration id as the key of the array.

Attached a proposed patch and interdiff with #21
Comment about 2 years ago →
System Message
The last submitted patch, 22: sync-id-hash-3104268-22.patch, failed testing. View results →
- codesniffer_fixes.patch Interdiff of automated coding standards fixes only.
Open in Jenkins → Open on Drupal.org →
Core: 9.5.x + Environment: PHP 8.1 & MySQL 8
last update about 2 years ago
36 pass
First commit to issue fork.
Merge request !40Issue #3104268: Factor into method, clear source state every time → (Closed) created by jamsilver
Open in Jenkins → Open on Drupal.org →
Core: 10.1.x + Environment: PHP 8.1 & MySQL 8
last update about 2 years ago
29 pass, 3 fail
Comment about 2 years ago →
🇬🇧United Kingdom jamsilver West Midlands, UK
I had a situation with a unit tests that failed because it executed the same migration twice in the same page request. It wasn't enough for migrate_tools_sync state to be reset in the constructor.

I've created a Merge Request that adds the following changes to the patch in 22 (interdiff attached):

Reset migrate_tools_sync at the point of scanning through source rows to collect the IDs. This fixes my unit test

Factor that bit of code to a new method for readability.

I've also raised this as a Merge Request, rather than as a patch, since that is the workflow in place on this ticket originally. I raised a new one because things seem to have moved along quite a lot since the original was raised.

Tests fail, but the first error I'm seeing looks unrelated to my change. Is there a problem with the tests at the moment?
Comment about 2 years ago →
🇷🇴Romania bbu23
Unfortunately the MR from #26 cannot be applied to the latest version of the module.
Comment about 2 years ago →
🇨🇦Canada jigarius Montréal
While testing this behavior, I realized that it makes many, many write queries on the key_value table. This in turn results in huge MySQL bin logs which makes the server run out of memory at some point. Is anyone else facing this issue? Is it safe to say that there is an efficiency/memory optimization issue with the --sync option?
Status changed to Needs work about 2 years ago2:27pm 17 July 2023
Comment about 2 years ago →
heddn Nicaragua
I'm afraid that the change in #26 to move the set into the point of scanning might have lead to #28. Can we revert that change please?
Pipeline finished with Failed
almost 2 years ago
#36787
First commit to issue fork.
Pipeline finished with Failed
almost 2 years ago
#36788
Open in Jenkins → Open on Drupal.org →
Core: 10.1.x + Environment: PHP 8.1 & MySQL 8
last update almost 2 years ago
36 pass
Status changed to Needs review almost 2 years ago3:34pm 23 October 2023
Comment almost 2 years ago →
🇬🇧United Kingdom scott_euser
I updated the MR with a merge from 6.0.x and created a separate issue https://www.drupal.org/project/migrate_tools/issues/3396130 🐛 Switch GitLab CI configuration to the template developed by the DA Needs review for the pipeline failure
Comment over 1 year ago →
🇷🇴Romania bbu23
Thank you, tested the patch from MR !40. It seems to have fixed the rolling back everything situation for a migration that uses track_changes. Though the migration is much slower now, quite a difference. Track changes works pretty fast when not used in combination with sync. I'm not sure if this can be improved or not.
Comment over 1 year ago →
🇫🇷France pacproduct
@bbu23 Could you be facing the issue referenced in https://www.drupal.org/project/migrate_tools/issues/3378047 🐛 drush migratation with --sync has suboptimal performance (v6) Active ?

More generally, I have the feeling that this thread should take into consideration the other thread I posted above, because the way migrate_tools_migrate_prepare_row currently works gets exponentially slower with time and should be fixed first, in my opinion.
Comment over 1 year ago →
🇷🇴Romania bbu23
@pacproduct Yes, thank you, that could be it! One of my migrations took hours with sync option on.

So what is your suggestion? Should I try the patch from the referenced issue that has a dedicated new table to sync or do I need to combine both patches?
Also, is your comment #19 from current issue still valid? I feel that it's a bit strange to have to manually specify a type for each migration. Most of the migrations have the ID coming in as string, that means modifications everywhere. Thanks
Comment over 1 year ago →
🇫🇷France pacproduct
@bbu23 I believe my comment in #19 is still valid, but the best is to try it to be sure.

I feel like the global tendency in programming is to favor strict typing more and more as it tends to reduce issues and bugs.
Thus personally, I'm kind of okay with the idea of configuring what is the type of data you're expecting from the source.
Although maybe `migrate_tools` could warn you when source IDs type do not match the configured type so you know something is dubious, instead of converting it silently which causes it to rollback everything later...

Everyone's situation isn't the same, and there might be cases where having a strict source ID typing wouldn't fit one's need. From the top of my mind, the only case that could be problematic that I can think of is if the source system were to return a mix of different types as the primary ID (a mix of strings and ints, for instance). In that particular case, having a "loose type" for source IDs could be useful.
Hard for me to say which approach is better...

-- --

With regards to your case: If setting properly the source ID types on your migration configurations solves your rollback problem without patching `migrate_tools`, it feels like a quick and healthy fix in my opinion. In which case, you may apply the patch from the referenced issue if it fixes your performance problem.
Otherwise, you may have to merge the 2 patches together. Not sure how tricky this is going to be. YMMV.
Comment 9 months ago →
🇬🇧United Kingdom scott_euser
scott_euser → changed the visibility of the branch 3104268-sync-might-be to hidden.
Pipeline finished with Failed
9 months ago
#335990
Comment 9 months ago →
🇬🇧United Kingdom scott_euser
Things have changed too much to safely just resolve conflicts. This will need a more in-depth review and re-application of the MR.
Pipeline finished with Failed
9 months ago
#337084
Merge request !74Resolve #3104268 "Sync id too strict2" → (Closed) created by scott_euser
Comment 9 months ago →
🇬🇧United Kingdom scott_euser
scott_euser → changed the visibility of the branch 3104268-sync-might-be3 to hidden.
Comment 9 months ago →
🇬🇧United Kingdom scott_euser
scott_euser → changed the visibility of the branch 3104268-sync-id-too-strict to hidden.
Comment 9 months ago →
🇬🇧United Kingdom scott_euser
Pipeline finished with Success
9 months ago
Total: 229s
#337156
Pipeline finished with Failed
9 months ago
Total: 216s
#337253
Pipeline finished with Success
9 months ago
Total: 217s
#337257
Comment 9 months ago →
🇬🇧United Kingdom scott_euser

Updated code to work with new code from 🐛 drush migratation with --sync has suboptimal performance (v6) Active

Added an example source to the test coverage

Added additional test coverage method to prove the issue and demonstrate the fix: test only fails.

Updated issue summary to standard template with details and clear steps to reproduce
Comment 9 months ago →
🇬🇧United Kingdom scott_euser
Pipeline finished with Success
9 months ago
Total: 392s
#337259
Comment 9 months ago →
🇧🇪Belgium Jonasanne
Added a diff from merge request for ease of use as patch.

Thankyou @scott_euser for updating this.
Comment 9 months ago →
🇧🇪Belgium Jonasanne
I made a small mistake with the prev patch.
Goodmorning :D
Comment 9 months ago →
🇬🇧United Kingdom scott_euser
Hiding patch, you should download the patch locally as per https://www.drupal.org/docs/develop/git/using-gitlab-to-contribute-to-dr... → This avoids confusion for maintainers & further development as to what to work on.

Thanks!
Comment 9 months ago →
🇬🇧United Kingdom scott_euser
What would be useful though is RTBC'ing it if you have reviewed (+explaining what you did to test/review). Thanks!
Comment 8 months ago →
🇺🇦Ukraine nnevill Lutsk
Looks good and works!
Thanks!
Comment 7 months ago →
🇺🇸United States nicxvan
Does not apply
Merge request !77Resolve #3104268 "Sync id too strict3" → (Merged) created by nicxvan
Comment 7 months ago →
🇺🇸United States nicxvan
nicxvan → changed the visibility of the branch 3104268-sync-id-too-strict2 to hidden.
Comment 7 months ago →
🇺🇸United States nicxvan
nicxvan → changed the visibility of the branch 6.0.x to hidden.
Comment 7 months ago →
🇺🇸United States nicxvan
Rebased on 6.0.x
Pipeline finished with Failed
7 months ago
Total: 508s
#391268
Comment 7 months ago →
🇬🇧United Kingdom scott_euser
Thanks for sorting out the conflict! Still working well! Phpcs errors are unrelated and due to changes in standards I believe, will need a separate issue to solve that, but not a blocker here.
Comment 7 months ago →
🇬🇧United Kingdom scott_euser
Created here for phpcs 📌 PHPCS fixes for MigrateToolsCommands Active
Comment 7 months ago →
🇬🇧United Kingdom scott_euser
Status changed to RTBC 5 months ago2:11pm 13 February 2025
Comment 5 months ago →
🇺🇸United States sonfd Portland, ME
Here's a patch that applies against 6.0.5 (does not include tests)
Comment 5 months ago →
🇺🇸United States mikelutz Michigan, USA
Pipeline finished with Success
5 months ago
Total: 235s
#426211
Pipeline finished with Failed
5 months ago
Total: 263s
#426270
Pipeline finished with Canceled
5 months ago
Total: 89s
#426286
Pipeline finished with Failed
5 months ago
Total: 215s
#426288
Comment 5 months ago →
🇬🇧United Kingdom scott_euser

Added change record: https://www.drupal.org/node/3507073 →

Created follow-up to remove deprecation: https://www.drupal.org/project/migrate_tools/issues/3507077 📌 Remove deprecation trigger at MigrateTools::addToSyncSourceIds in 7.x Postponed
Pipeline finished with Failed
5 months ago
Total: 225s
#426299
Pipeline finished with Success
5 months ago
Total: 227s
#426304
Comment 5 months ago →
🇬🇧United Kingdom scott_euser
Okay tests for deprecation now passing

🇫🇷France MacSim

Tested it on a migration where I have bigint as source and string as destination since integer can't be the solution (SQLSTATE[22003]: Numeric value out of range: 1264 Out of range value for column 'sourceid1' at row 1)

Before patch:

vendor/bin/drush mim my_migration --sync
 [notice] Rolled back 74 items - done with 'my_migration'
 [notice] Processed 74 items (74 created, 0 updated, 0 failed, 0 ignored) - done with 'my_migration'

After patch:

vendor/bin/drush mim my_migration --sync
 [notice] No item has been rolled back - done with 'my_migration'
 [notice] Processed 1 item (0 created, 0 updated, 0 failed, 0 ignored) - done with 'my_migration'

Seems good to me

Comment 5 months ago →
🇫🇷France MacSim
woops sorry for the status change ; for me it's RTBC
I leave "Needs review" until few more people test it
Comment 5 months ago →
🇺🇸United States jrockowitz Brooklyn, NY
Status changed to Needs review 3 months ago12:51am 24 April 2025
Comment 3 months ago →
🇺🇸United States ramzypro
I'm successfully using the patch from #63 on a sync from an external MySQL data source with no issues.
Comment 3 months ago →
🇬🇧United Kingdom scott_euser
Hiding patch to avoid confusion.

I can't change to RTBC since I've been doing the work so next step is someone here in the community confirming they've reviewed and tested to change the status so we can get this back infront of the maintainers now that feedback is resolved + test coverage is in place.
Comment 3 months ago →
🇺🇸United States ramzypro
Apologies; I had to read up more on Drupal etiquette about changing status - I have used the patch in #63 on large datasets without issue, many of them mixing string values that start with "0" which are impossible to handle correctly without it.
Comment 3 months ago →
🇬🇧United Kingdom scott_euser
Sorry wasn't aimed at you necessarily, just the notification reminded me of this issue and wanted to comment what is needed to move forward :) Thanks for confirming RTBC in any case!
Pipeline finished with Success
3 months ago
Total: 397s
#487859
Pipeline finished with Skipped
23 days ago
#539468
Pipeline finished with Skipped
23 days ago
#539471
Comment 23 days ago →
System Message

heddn → committed 1264f3e3 on 6.0.x authored by nicxvan →
Issue #3104268 by scott_euser, heddn, robertom, ahebrank, nicxvan,...
Status changed to Fixed 23 days ago8:47pm 4 July 2025
Comment 23 days ago →
heddn Nicaragua
Hopefully I caught everyone who contributed to this issue. Thanks for all your work on it.
Comment 23 days ago →
🇬🇧United Kingdom scott_euser
Thank you!
Comment 19 days ago →
System Message
heddn → closed merge request !40
Comment 19 days ago →
System Message
heddn → closed merge request !74
Comment 5 days ago →
System Message
Automatically closed - issue fixed for 2 weeks with no activity.

sync might be too strict during id comparison; rolls back everything

Merge Requests

!74sync might be too strict during id comparison; rolls back everything
Closed

!40sync might be too strict during id comparison; rolls back everything
Closed

!77sync might be too strict during id comparison; rolls back everything
Merged

Comments & Activities

sync might be too strict during id comparison; rolls back everything

Merge Requests

!74sync might be too strict during id comparison; rolls back everythingClosed

!40sync might be too strict during id comparison; rolls back everythingClosed

!77sync might be too strict during id comparison; rolls back everythingMerged

Comments & Activities

!74sync might be too strict during id comparison; rolls back everything
Closed

!40sync might be too strict during id comparison; rolls back everything
Closed

!77sync might be too strict during id comparison; rolls back everything
Merged