drupal_phpunit_find_extension_directories() uses infinite recursion โ‡’ more directories = slower tests

Created on 15 November 2021, over 3 years ago
Updated 30 January 2023, about 2 years ago

Problem/Motivation

Fresh D9 git clone, with only composer install and a core/phpunit.xml added:

$ time vendor/bin/phpunit --configuration core/phpunit.xml core/modules/ckeditor5/tests/src/Kernel/ValidatorsTest.php 
PHPUnit 8.5.21 by Sebastian Bergmann and contributors.

Testing Drupal\Tests\ckeditor5\Kernel\ValidatorsTest
..............................                                    30 / 30 (100%)

Time: 20.33 seconds, Memory: 8.00 MB

OK (30 tests, 50 assertions)

real	0m20.421s
user	0m8.002s
sys	0m11.805s

versus in my dev environment, where I've got 57 contrib modules installed, with most of them git clones (for development), it takes 3.6 times longer:

$ time vendor/bin/phpunit --configuration core/phpunit.xml core/modules/ckeditor5/tests/src/Kernel/ValidatorsTest.php
PHPUnit 8.5.21 by Sebastian Bergmann and contributors.

Testing Drupal\Tests\ckeditor5\Kernel\ValidatorsTest
..............................                                    30 / 30 (100%)

Time: 1.2 minutes, Memory: 8.00 MB

OK (30 tests, 50 assertions)

real	1m12.087s
user	0m17.227s
sys	0m53.724s

Steps to reproduce

  1. Install many modules, preferably using git clone
  2. Do development on them, and have a few modules with a node_modules directory (find them using find . -name 'node_modules' -type d -prune from your Druapl root)

Proposed resolution

  1. drupal_phpunit_contrib_extension_directory_roots() lists every directory that could contain Drupal extensions. Each of those is passed to drupal_phpunit_find_extension_directories(), which scans them recursively โ€ฆ INFINITELY!.

    Ironically, @sun warned all of us about this 7 years ago: https://www.php.net/manual/en/class.recursivedirectoryiterator.php#114504

    Search results only turned up ๐Ÿ“Œ Improve performance of functional tests by caching Drupal installations Needs work โ€” but that's about speeding up the DB installation part of functional tests. This is about discovering the tests in the first place.

    Fixing this brings it down to:

    $ time vendor/bin/phpunit --configuration core/phpunit.xml core/modules/ckeditor5/tests/src/Kernel/ValidatorsTest.php
    PHPUnit 8.5.21 by Sebastian Bergmann and contributors.
    
    Testing Drupal\Tests\ckeditor5\Kernel\ValidatorsTest
    ..............................                                    30 / 30 (100%)
    
    Time: 56 seconds, Memory: 8.00 MB
    
    OK (30 tests, 50 assertions)
    
    real	0m56.051s
    user	0m15.277s
    sys	0m40.484s
    

    Now it's "only" 2.8 times slower.

  2. This still leaves ~36 seconds performance delta. Still unacceptable. There must be some directory scanning somewhere else.

    Found root cause; fixed in #5, cleaned up in #7. Now only ~2 times slower.

  3. The remaining ~20 second performance delta can only be optimized away in one of two ways: a new caching mechanism (do we really want that in tests?) or tweaking Drupal's PHPUnit bootstrap.php to not construct a fresh autoloader on every run.

    Why does this matter? Because there are two factors that make this very noticeable in day-to-day (core) development:

    • Worse: drupal_phpunit_populate_class_loader() is called every single time you run tests. So it does all this work every time you run tests ๐Ÿ™ƒ
    • Worst: as part of debugging this, I discovered that for each returned value by a @dataProvider, PHPUnit ends up calling drupal_phpunit_populate_class_loader() again. So one test class with a single test method will call it once, but if that single test has a data provider with 10 cases, then the class loader will be computed eleven times. โ†’ this part is not addressed in this issue, but it makes the impact of this improvement bigger stillโ€ฆ

    I tried two dead-simple caching approaches in #8. First using APCu: it gets stored correctly, but gets wiped immediately. Not sure by what yet. But it's probably going to be hard to get rid of. Second using mtime of the Drupal root. While not perfect, this would allow caching the autoloader (which BTW only needs to be changed when modules are added or removed from disk!), and you'd only have to do touch . in the Drupal root (we could make it listen in more places) to get it to rebuild it.

    Still, this seems to add too much risk and contentiousness to the proposed patch, which now hopefully remains RTBC'able.

๐Ÿค“ Bonus: I suspect this will also speed up DrupalCIโ€ฆ

โ†’ turned out to be possible but uncertain โ€” see #25 and #28.

Remaining tasks

None.

User interface changes

None.

API changes

API addition: \Drupal\Core\Extension\Discovery\RecursiveGenericExtensionFilterIterator extracted from \Drupal\Core\Extension\Discovery\RecursiveExtensionFilterIterator, to keep the complexity of the code for low-level test infrastructure to a minimum. The pre-existing RecursiveExtensionFilterIterator now extends the simpler/more generic RecursiveGenericExtensionFilterIterator.

Data model changes

None.

Release notes snippet

TBD

๐Ÿ› Bug report
Status

Needs work

Version

10.1 โœจ

Component
PHPUnitย  โ†’

Last updated about 9 hours ago

Created by

๐Ÿ‡ง๐Ÿ‡ชBelgium wim leers Ghent ๐Ÿ‡ง๐Ÿ‡ช๐Ÿ‡ช๐Ÿ‡บ

Live updates comments and jobs are added and updated live.
  • Performance

    It affects performance. It is often combined with the Needs profiling tag.

Sign in to follow issues

Comments & Activities

Not all content is available!

It's likely this issue predates Contrib.social: some issue and comment data are missing.

  • The Needs Review Queue Bot โ†’ tested this issue. It either no longer applies to Drupal core, or fails the Drupal core commit checks. Therefore, this issue status is now "Needs work".

    Apart from a re-roll or rebase, this issue may need more work to address feedback in the issue or MR comments. To progress an issue, incorporate this feedback as part of the process of updating the issue. This helps other contributors to know what is outstanding.

    Consult the Drupal Contributor Guide โ†’ to find step-by-step guides for working with issues.

  • ๐Ÿ‡ฆ๐Ÿ‡บAustralia alex.skrypnyk Melbourne

    Re-rolled #37 for 9.5.x with the following changes:
    1. Fixed $ignore_directories = Settings::get('file_scan_ignore_directories', []); not working as settings has not been initialised yet.
    2. Updated deprecation messages to point to 9.6.x

  • ๐Ÿ‡ฆ๐Ÿ‡บAustralia alex.skrypnyk Melbourne

    When testing custom modules or themes using Drupal Core's PHPUnit configuration file, core/phpunit.xml.dist, note that the core/tests/bootstrap.php file scans all extension roots (core, modules, themes, etc.), and there is currently no way to opt out of this behavior. While we may want to use Drupal Core's bootstrap file to reuse PHPUnit behavior when testing custom modules or themes within real sites, we may not want to use the same targets.

    In practice, the current implementation of core/tests/bootstrap.php can lead to longer test execution times that scale proportionally with the number of installed modules. For instance, if you have a site with 100 contributed modules and one custom module with 1 unit test, running that test will scan through core and 100s of contributed modules to then only run one test in your custom module.

    And if you run these tests in a Docker container, the performance degradation can be even worse due to PHP's constant checking for directory and file existence while performing such a lookup.

  • ๐Ÿ‡ฆ๐Ÿ‡บAustralia larowlan ๐Ÿ‡ฆ๐Ÿ‡บ๐Ÿ.au GMT+10

    We've been running the version from [#14292061] with our phpunit.xml referencing a custom bootstrap.php for ~4 years. You just need to update it from time to time as the major phpunit version changes.

  • ๐Ÿ‡ญ๐Ÿ‡บHungary mxr576 Hungary

    @alex.skrypnyk thanks for finding the other issue where I was also involved, or made an attempt to make a related change...

    + $ignore_directories = Settings::get('file_scan_ignore_directories', []);

    I did the some, but @Lendude might have a valid point on the other issue

    Also, are we really sure we want to re-use an existing setting? Does 'file_scan_ignore_directories' always match the pattern we want to use here? The current use case is to ignore dirs where we look for modules, do we really want to add 'and where we look for tests' to that? Sure, there might be a lot of overlap, but is this always 100%?

    (Source [#2943172#comment-13216764])

  • ๐Ÿ‡ง๐Ÿ‡ชBelgium wim leers Ghent ๐Ÿ‡ง๐Ÿ‡ช๐Ÿ‡ช๐Ÿ‡บ

    @larowlan just reminded me of this. Still would be very nice to land this.

    P.S.: I forgot I even created this. @larowlan talked about #3050881: Use RecursiveExtensionFilterIterator in drupal_phpunit_find_extension_directories for performance โ†’ and then told me it's a duplicate of "mine" โ€” this issue ๐Ÿคฃ

Production build 0.71.5 2024