Poor performance when downloading files from a non-standard filesystem

Created on 26 April 2024, 9 months ago
Updated 27 May 2024, 8 months ago

Problem/Motivation

Each image that loads on our site comes from the private:// scheme, stored on AWS S3 (we use s3fs).

Although all metadata come from the local database and/or cache (only the actual image content is downloaded from the remote S3 service), all images take several seconds to load (5s to 15s, sometimes more) and cause the DB and PHP processes to consume a lot of CPU.

Loading images should be quicker.

Steps to reproduce

Have a Drupal 10 site loading all its image assets from a non-standard filesystem, like S3.
Also, and this is mainly what makes the issue worse in our case: have all images in a single directory containing ~20000 pictures.

Proposed resolution

It turns out this is caused by the "hook_file_download" function implemented by IMCE, because every-time a file gets downloaded, it checks if the file is accessible, but the whole directory containing the file gets scanned in the process too.

In our particular case, that's causing roughly 20K queries to the database, each time a picture gets downloaded.

Here is the bit of code that causes that (\Drupal\imce\ImceFolder::checkItem):

  public function checkItem($name) {
    if (!$item = $this->getItem($name)) {
      if (!$this->scanned) {
        $this->scan();
        $item = $this->getItem($name);
      }
    }
    return $item;
  }

I'm not sure I understand why it's needed to scan the whole directory for checking a single item?

Could we not have something closer to what follows:

  public function checkItem($name) {
    if (!$item = $this->getItem($name)) {
      $this->loadItem($name);
      $item = $this->getItem($name);
    }
    return $item;
  }

But maybe I'm missing something. Besides, applying such a change is not as trivial as one may think because currently "$this->scan()" relies on "ImceFM::scanDir()" which itself relies on "$this->getConf('scanner', 'Drupal\imce\Imce::scanDir');" for populating entries.

So function "$this->loadItem()" would ideally come from a new method "ImceFM::loadItem()" I guess and a similar conf key like above, or something along those lines...

What are your thoughts?

User interface changes

None.

API changes

None.

Data model changes

None I think, but I'm not sure :/

🐛 Bug report
Status

Fixed

Version

3.0

Component

Code

Created by

🇫🇷France pacproduct

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Comments & Activities

Production build 0.71.5 2024