Fix flaky test: testInstallUnlockMessage

Created on 12 February 2025, about 2 months ago

Problem/Motivation

The testInstallUnlockMessage seems to be flaky. It is often failing, and I think we're just not getting the JS message back in time.

Steps to reproduce

Post apply exception
     ✘ Install unlock message
       ┐
       ├ Failed asserting that two strings are identical.
       ┊ ---·Expected
       ┊ +++·Actual
       ┊ @@ @@
       ┊ -'The·process·for·adding·the·project·was·locked·1·hours,·15·minutes·ago.·Use·[+·unlock·link]·to·unlock·the·process.'
       ┊ +'The·process·for·adding·the·project·was·locked·55·minutes·ago.·It·should·not·be·unlocked·while·changes·are·being·applied·to·the·site.'
       │
       │ /builds/project/project_browser/tests/src/Functional/InstallerControllerTest.php:418
       │ /builds/project/project_browser/tests/src/Functional/InstallerControllerTest.php:466
       ┴
     ✔ Can break lock
     ✔ Can break stage with missing project browser lock
     ✔ Core module activate
    
    FAILURES!
    Tests: 16, Assertions: 193, Failures: 1.

Proposed resolution

...?

🐛 Bug report
Status

Active

Version

2.0

Component

Code

Created by

🇺🇸United States chrisfromredfin Portland, Maine

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Merge Requests

Comments & Activities

  • Issue created by @chrisfromredfin
  • First commit to issue fork.
  • First commit to issue fork.
  • 🇺🇸United States pfrilling Minster, OH

    I was able to consistently reproduce this issue locally about every third or fourth test run with

    ddev phpunit --filter=testInstallUnlockMessage

    When I deleted the `__timestamp` key/value, then it would consistently pass. I believe it's because of the ::getFirstUpdatedTime() method call in the ::begin method. Of course, I could be completely wrong.

  • Pipeline finished with Success
    about 1 month ago
    Total: 551s
    #430972
  • 🇺🇸United States phenaproxima Massachusetts

    Can you explain a bit more about why the __timestamp thing makes it fail? It's hard for me to grok how that's related, or why it would make the test fail only now and then, rather than every time.

  • 🇺🇸United States pfrilling Minster, OH

    To be honest, I'm not 100% sure... I started stepping through the InstallerController::begin() method and saw this line:

    $updated_time = $this->installState->getFirstUpdatedTime();
    

    Which is returning the `__timestamp` value from the key/value store:

    return $this->keyValue->get('__timestamp');
    

    The only place I found the `__timestamp` value being set was in InstallState::setState(). In this method, it is being set if not exists.

    $this->keyValue->setIfNotExists('__timestamp', $this->time->getRequestTime());
    

    So, my theory was that the `__timestamp` value was not getting cleared/reset during the test's call to ::setFakeTimeByOffset(), and causing the flaky behavior.

    Could it be that $this->keyValue->setIfNotExists should actually be $this->keyValue->set()?

  • 🇺🇸United States phenaproxima Massachusetts

    Yeah, this is a tricky one. I think we should probably figure out exactly what's causing the flakiness, so that we don't introduce possible fragility (or worse, more flakiness) by implementing a fix even we don't understand :) That said, the tracing of it to __timestamp is a really helpful lead.

  • 🇺🇸United States pfrilling Minster, OH

    Maybe I was over thinking it. If we're reliant on time later in the tests, we should probably mock the initial request time. This seems to be giving me consistent passing tests locally.

  • Pipeline finished with Success
    about 1 month ago
    Total: 369s
    #432862
  • 🇺🇸United States pfrilling Minster, OH

    I am still getting the failure locally, but it happens less frequently now. One out of every ten or fifteen times now. I think that request time needs to be set to a constant instead of relying on the modifier, which puts us at the mercy of the container's time.

  • 🇺🇸United States phenaproxima Massachusetts

    There's something else going on here, but I'm not sure what it is.

    It's not that the time needs to be constant; TestTime is already able to override the request time, and it does it in a way that works. The problem is that it doesn't work consistently. I suspect this has something to do with the reliance on state, which is a destructable cache collector.

    So I also suspect container wonkiness, but until I completely understand why, I'd rather hold off and take the time to figure out the real cause, rather than just try to find any fix that seems to work.

  • Pipeline finished with Failed
    about 1 month ago
    Total: 1014s
    #432908
  • Pipeline finished with Failed
    about 1 month ago
    Total: 1045s
    #432917
  • Pipeline finished with Failed
    about 1 month ago
    Total: 996s
    #432925
  • Pipeline finished with Failed
    about 1 month ago
    Total: 890s
    #432943
  • Pipeline finished with Canceled
    about 1 month ago
    Total: 228s
    #432952
  • 🇺🇸United States phenaproxima Massachusetts

    We should increase the scope of this one.

    The unlock behavior in InstallerController is a hard-to-parse mess, and is the only part of our tests that necessitates a date-time decorator. We could remove that (and move the time override into InstallerController specifically), which should sidestep this issue. While we're at it, we should refactor the unlock logic to make more sense, have better phrasing, and remove crufty messages that are never presented to the user.

    This does introduce a behavior change, but I think that's worth the increased clarity. Users don't care about "1 hour, 15 minutes"; "an hour ago" is sufficient, and line with what you see elsewhere on the internet.

  • 🇺🇸United States phenaproxima Massachusetts

    Self-assigning to ensure the flakiness is squashed.

  • Pipeline finished with Failed
    about 1 month ago
    Total: 1149s
    #432956
  • Pipeline finished with Failed
    about 1 month ago
    Total: 1630s
    #432965
  • 🇺🇸United States phenaproxima Massachusetts

    I pushed a commit that ran the faulty test 100 times. It passed every run in four different jobs:

    400 straight passes? I think we can safely say the flakiness is eliminated. State caching turned out to be the problem.

    Just needs final review and an issue summary update.

  • Pipeline finished with Success
    about 1 month ago
    Total: 363s
    #432987
  • 🇺🇸United States pfrilling Minster, OH

    Looks good to me!

  • 🇺🇸United States phenaproxima Massachusetts
  • Pipeline finished with Skipped
    about 1 month ago
    #433816
  • 🇺🇸United States chrisfromredfin Portland, Maine

    More than anything on this one, the comments are the real MVP.

  • Pipeline finished with Success
    about 1 month ago
    Total: 1562s
    #438567
  • Pipeline finished with Skipped
    about 1 month ago
    #438604
  • Automatically closed - issue fixed for 2 weeks with no activity.

Production build 0.71.5 2024