[random test failure] ImageStylesPathAndUrlTest

Issue created by @dww

This use of randomString() looks suspicious.

    $this->drupalGet(\Drupal::service('file_url_generator')->generateAbsoluteString($directory . '/' . $this->randomString()));

Merge request !10626Draft: DEBUG: #3487371 Debug failing test → (Open) created by donquixote
Pipeline finished with Success
7 months ago
Total: 3063s
#374161
Comment 7 months ago →
🇳🇱Netherlands spokje
Comment 7 months ago →
🇳🇱Netherlands spokje
spokje → changed the visibility of the branch 3487371-debug-failing-test to hidden.
Comment 7 months ago →
🇳🇱Netherlands spokje
spokje → changed the visibility of the branch 3487371-random-test-failure to hidden.
Merge request !10701Draft: Resolve #3487371 "2500x imagestylespathandurltest as is" → (Closed) created by spokje
Merge request !10702Draft: Resolve #3487371 "2500x imagestylespathandurltest key value" → (Closed) created by spokje
Pipeline finished with Canceled
7 months ago
Total: 99s
#379763
Pipeline finished with Canceled
7 months ago
Total: 3608s
#379726
Pipeline finished with Canceled
7 months ago
Total: 94s
#379768
Pipeline finished with Failed
7 months ago
Total: 3360s
#379767
Pipeline finished with Failed
7 months ago
Total: 3391s
#379773
Comment 7 months ago →
🇳🇱Netherlands spokje
MR !10701 (which is Drupal\Tests\image\Functional\ImageStylesPathAndUrlTest and some pipeline changes to make it, and only it, run 1750 times) gives us 13 fails out of the 1750 runs.

These 13 fails breaks down to:
2x testImageStyleUrlAndPathPrivateUnclean()
10x testImageStyleUrlAndPathPrivateLanguage()
1x testImageStyleUrlAndPathPrivate()

MR !10702 shows that changing \Drupal::state()->set()/get() to Drupal::keyValue()->set()/get(); as suggested in the Slack thread passes a 1750 times run without errors.
Merge request !10703Resolve #3487371 "Imagestylespathandurltest fix" → (Closed) created by spokje
Comment 7 months ago →
🇳🇱Netherlands spokje
MR!10703 contains the changes that should be committed after review.
Comment 7 months ago →
🇳🇱Netherlands spokje
Comment 7 months ago →
🇺🇸United States smustgrave
Based on the findings in #9, and I re-ran 10703 a few times and never got these random failures, when before I probably least got 1 each time.
Pipeline finished with Success
7 months ago
Total: 7096s
#379841
Comment 7 months ago →
🇺🇸United States dww
Fantastic work! Great find.

Agree that the changes to ImageStylesPathAndUrlTest look good. That much is indeed RTBC.

However, grep is finding more \Drupal::state() in core/modules/image/tests/src/* and I'm wondering if we want to expand the scope here to fix ImageStyleFlushTest and ImageEffectsTest while we're at it. I don't remember seeing those randomly fail as often, but it seems like we shouldn't just do this conversion when we happen to be nailed by the trouble. I'm in favor of preemptively fixing all the tests to stop using state(). So we should probably fix more than 1 test class per issue.
Comment 7 months ago →
🇳🇱Netherlands spokje
Strongly disagree with bundling more issues together, but feel free to add-n-fix whatever you like.

Problem here IMHO is where the scope-creep ends, also more tests to test 2500x, or at least as-many-times-fit-in-an-hour makes things very slow, distinct tests run lower and fail rate/ success prove drop.

Doubt that it's OK to just create an MR that changes state to keyvalue, without the above test-runs, since the mostly (very) low failure rate will probably mean a single run will pass the tests.

To me that would prove absolutely nothing about the need for the fix, but n=1, YMMV etc, etc :)

Let the powers-that-be decide on this one.
In the mean time I will continue to create single test fix issues, and surely not for the easy core credits, which I really don't need in the first place.
Comment 7 months ago →
🇺🇸United States nicxvan
In general I would agree with @dww's comment to expand, but here I think!@spokje's reasoning is sound.

Further there is a separate issue to address the race condition and anecdotally I've never seen this other tests fail.
Comment 7 months ago →
🇺🇸United States dww
I was thinking that if release managers are saying “we shouldn’t use state in tests, only keyvalue, that we could do bulk.

But yes, strongly appreciate the long time it takes to do even single tests carefully. Wouldn’t dream of accusing you of lots of issues for credit. You’re a prolific contributor!

I was only wondering if we could add more scope without as much effort and rigor if we want to convert entirely. It’s a bit weird having a mix of both, so I’m hoping to minimize the time it takes to convert.

All that said, yes, following the issue where the underlying race condition in State API is hopefully going to be fixed. So maybe it’s moot to do this conversation?

Yeah, I’m torn, I guess we’ll leave it for the core committers to decide. Back to RTBC.

Thanks!
-Derek
Comment 7 months ago →
🇩🇪Germany donquixote
I'd say this issue needs an explanation why the problem occurs with state, but not with keyvalue.
From the comments here I understand it is a race condition, and from looking at the code, I assume it is caused by the cache layer in state, because otherwise state is just a wrapper around keyvalue.

as suggested in the Slack thread

So, this should be in the issue summary..

I do find this issue from April 2024, which might be related:
🐛 [random test failures] Race condition in state when individual keys are set with an empty cache Fixed
Comment 7 months ago →
🇳🇱Netherlands spokje
@donquixote There's a problem with multiple issues: When you're working on a lot of them, you forget that other people didn't. :)

Here's the issue where the root cause is attacked, hope that helps?
🐛 Race conditions/bad lock logic in CacheCollector/State Active
Comment 7 months ago →
🇬🇧United Kingdom catch
Generally on the scope:

I would essentially commit any quick fix to ImageStylesPathAndUrlTest to stop it random failing including skipping it, because it is so annoying when it fails all the time.

Fixing it instead of skipping it is much better.

I do think we should switch all test usage from state to key/value because it massively simplifies things, however it's currently useful that we haven't done that because it's the only way I know to reproduce 🐛 Race conditions/bad lock logic in CacheCollector/State Active . So would be quite happy to commit individual fixes for the frequently random tests to make the pipelines more reliable, then handle other issues which aren't randomly failing (as often) together in one issue later.
Comment 7 months ago →
System Message

catch → committed 7031fb42 on 11.x
Issue #3487371 by spokje, dww: [random test failure]...
Comment 7 months ago →
🇬🇧United Kingdom catch
Committed/pushed to 11.x, thanks!

This doesn't apply to any previous branches including 11.1.x. I think it would be worth backporting to 10.5, 11.1, and 10.4 (11.0 and 10.3 if it's an easy cherry pick) so that we get less random failures on branch runs.
Comment 7 months ago →
System Message
catch → closed merge request !10703
Comment 7 months ago →
🇳🇱Netherlands spokje
spokje → changed the visibility of the branch 3487371-2500x-ImageStylesPathAndUrlTest-as-is to hidden.
Comment 7 months ago →
System Message
spokje → closed merge request !10702
Comment 7 months ago →
System Message
spokje → closed merge request !10701
Merge request !10718Resolve #3487371 "11.1.x" → (Closed) created by spokje
Comment 7 months ago →
🇳🇱Netherlands spokje
spokje → changed the visibility of the branch 11.x to hidden.
Pipeline finished with Success
7 months ago
Total: 871s
#380525
Comment 7 months ago →
🇳🇱Netherlands spokje
Not sure what either I or this issue has done to deserve waiingt 2:42 minutes for a full spellcheck on all files, but here we are...
Comment 7 months ago →
System Message

catch → committed 69a509f1 on 10.3.x
Issue #3487371 by spokje, dww: [random test failure]...
Comment 7 months ago →
🇬🇧United Kingdom catch
Cherry-picked back through to 10.3.x, thanks!
Comment 7 months ago →
System Message

catch → committed f8d2454b on 10.4.x
Issue #3487371 by spokje, dww: [random test failure]...
Comment 7 months ago →
System Message

catch → committed 87ab7b3c on 10.5.x
Issue #3487371 by spokje, dww: [random test failure]...
Comment 7 months ago →
System Message

catch → committed d5f2b47e on 11.0.x
Issue #3487371 by spokje, dww: [random test failure]...
Comment 7 months ago →
System Message

catch → committed d5b6fbbd on 11.1.x
Issue #3487371 by spokje, dww: [random test failure]...
Comment 7 months ago →
System Message
catch → closed merge request !10718
Comment 7 months ago →
🇳🇿New Zealand quietone
Updated the proposed resolution with the issue where the root cause was discussed.
Comment 6 months ago →
System Message
Automatically closed - issue fixed for 2 weeks with no activity.

[random test failure] ImageStylesPathAndUrlTest

Problem/Motivation

testImageStylePrivateWithConversion()

testImageStyleUrlAndPathPrivate()

Steps to reproduce

Proposed resolution

Remaining tasks

User interface changes

Introduced terminology

API changes

Data model changes

Release notes snippet

Merge Requests

!10718[random test failure] ImageStylesPathAndUrlTest
Closed

!10701[random test failure] ImageStylesPathAndUrlTest
Closed

!10702[random test failure] ImageStylesPathAndUrlTest
Closed

!10703[random test failure] ImageStylesPathAndUrlTest
Closed

!10626[random test failure] ImageStylesPathAndUrlTest
Open

Comments & Activities

[random test failure] ImageStylesPathAndUrlTest

Problem/Motivation

testImageStylePrivateWithConversion()

testImageStyleUrlAndPathPrivate()

Steps to reproduce

Proposed resolution

Remaining tasks

User interface changes

Introduced terminology

API changes

Data model changes

Release notes snippet

Merge Requests

!10718[random test failure] ImageStylesPathAndUrlTestClosed

!10701[random test failure] ImageStylesPathAndUrlTestClosed

!10702[random test failure] ImageStylesPathAndUrlTestClosed

!10703[random test failure] ImageStylesPathAndUrlTestClosed

!10626[random test failure] ImageStylesPathAndUrlTestOpen

Comments & Activities

!10718[random test failure] ImageStylesPathAndUrlTest
Closed

!10701[random test failure] ImageStylesPathAndUrlTest
Closed

!10702[random test failure] ImageStylesPathAndUrlTest
Closed

!10703[random test failure] ImageStylesPathAndUrlTest
Closed

!10626[random test failure] ImageStylesPathAndUrlTest
Open