NotOnServer only shows files if parent directory of missing file exists

Created on 10 September 2024, 7 months ago
Updated 19 September 2024, 7 months ago

Problem/Motivation

The "Not On Server" report won't show a file if the parent directory of the missing file doesn't exist.

Steps to reproduce

Upload a document with a path of the following form, possibly using a media entity. I think the below is the default path generated for the "Document" media entity.

public://documents/2024-10/missing_filename.pdf

Delete only missing_filename.pdf then view the "Not On Server" report and the missing file will be shown.

Now delete either directory 2024-10 or documents and view the report again. The missing file won't appear in the report.

In our situation, we've deleted the documents directory as we didn't need any of them. Recreating that directory doesn't show what files were missing from the subdirectories.

Proposed resolution

Find out why this is happening and fix it.

Remaining tasks

See above.

User interface changes

API changes

Data model changes

🐛 Bug report
Status

Needs review

Version

4.2

Component

Not on server report

Created by

🇦🇺Australia imclean Tasmania

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Comments & Activities

  • Issue created by @imclean
  • 🇦🇺Australia imclean Tasmania
  • 🇦🇺Australia imclean Tasmania

    I suspect the culprit might be this line in Drupal\auditfiles\Auditor\AuditFilesNotOnServer:

    $target = $this->fileSystem->realpath($result->uri);
    

    This is converting a URI found in the file_managed table to a realpath, but it appears to return false if the file parent directory doesn't exist when it needs to return the full path to the missing file.

    For context, here's the entire method:

      public function getReferences(): \Generator {
        $maximumRecords = $this->auditFilesConfig->getReportOptionsMaximumRecords();
        $query = $this->connection->select('file_managed', 'fm');
        $query
          ->orderBy('changed', 'DESC')
          ->range(0, $maximumRecords)
          ->fields('fm', ['fid', 'uri']);
        /** @var array<object{uri: string, fid: string}> $results */
        $results = $query->execute()->fetchAll();
        foreach ($results as $result) {
          $target = $this->fileSystem->realpath($result->uri);
          if ($target !== FALSE && !file_exists($target)) {
            yield FileEntityReference::create((int) $result->fid);
          }
        }
      }
    
  • @imclean opened merge request.
  • Status changed to Needs review 7 months ago
  • 🇦🇺Australia imclean Tasmania
  • 🇦🇺Australia imclean Tasmania
  • 🇦🇺Australia imclean Tasmania

    Ref: https://www.php.net/manual/en/function.realpath.php

    Note:

    The running script must have executable permissions on all directories in the hierarchy, otherwise realpath() will return false.

    If a parent directory doesn't exist then realpath will return false, excluding it from this module's file_exists check and therefore not showing it in the "not on server report".

Production build 0.71.5 2024