Document how the link capture date on archive.org is found

Created on 16 August 2025, 7 days ago

Problem/Motivation

It looks like Wayback Filter automagically does not link to the last instance, which may be a 404, but somehow instead to a capture with actual content. Maybe it could be described how this is accomplished in the intro text? Because I think that's a great feature, if it's by design.

See for example this link, where the last few instances are 404's (2023 and 2024):
http://web.archive.org/web/*/https://www.fsb.dk/om-fsb/nybyggeri/tingbje...

The module links to this capture, which does not seem to exist:
https://web.archive.org/web/20181004134330/https://www.fsb.dk/om-fsb/nyb...

... and the final link it lands on is a capture from 7 dec 2016:
https://web.archive.org/web/20161207214929/https://fsb.dk/om-fsb/nybygge...

Steps to reproduce

Wonder how the link is found.

Proposed resolution

Describe in a sentence or two how it works.

Remaining tasks

User interface changes

API changes

Data model changes

📌 Task
Status

Active

Version

1.1

Component

Documentation

Created by

🇩🇰Denmark ressa Copenhagen

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Comments & Activities

  • Issue created by @ressa
  • 🇩🇰Denmark Steven Snedker

    It's a great feature, and it's by design. Wayback Filter asks for a specific archived version based on the node's created date (sensible). Archive.org serves up the best match (sensible). We are all overcome by gratitude.

    I have added a "Nice to know"-section to the settings page of the module and a "I don't get exactly the archived version I want"-paragraph to the module page .

    You're the first to ask this question in 12 years. Nice to finally detect some interest. I wonder how fast the Wordpress lot will react.

  • 🇩🇰Denmark ressa Copenhagen

    Thanks for explaining the magic, it's a killer feature, and great additions under "Nice to know" in the module, and on the project page. They really drive home how this feature works.

    I had originally planned to just create "dummy" links to archive.org, simply pointing to the overview page ("calendar view of all captures" for example web.archive.org/web/*/https://www.fsb.dk/om-fsb/nybyggeri/tingbjerg-kulturhus/) where the user would then laboriously have to sift through captures, to find a good one.

    I hoped to avoid this ... So I was very glad when I found this module, and saw actual direct links, and thought it deserved to be highlighted, since having a direct link to an actual content page is incredible. I sincerely hope archive.org will exist forever.

  • 🇩🇰Denmark Steven Snedker

    I was very happy when I made the module 12 years ago. It's ultra lightweight and solves a real problem elegantly for millions of people.

    Somehow, no-one else cared about it. Two installations (probably both my own) for a decade, and a steadfast, opaque decision of the almighty archive.org NOT to write a single word about it ever. Even if prompted lovingly annually. Since 2014. Archive.org must find thousands of incoming links somehow unpalatable.
    Archive.org have, however, written quite extensively about the Wayback bot running on the Wikipedias.

    I'm happy that you found the module and understood it. May you be a better patient zero than I have been.

    I wonder whether The Drupal version or The Wordpress Version will be the first to reach 1.000 active installations.

  • 🇩🇰Denmark ressa Copenhagen

    Strange how Archive.org ignores this great tool, you would think that having the module installed on potentially thousands of Drupal sites would be appealing ... I have tried to do my little bit with the https://www.drupal.org/forum/general/general-discussion/2025-01-23/whats... Forum post.

    Great that a WordPress version has been created as well -- remember to slap on a $40/month WP-fee, when it hits 500 installs :)

    I thought about something else -- I fear a day in the future where archive.org is gone ... Is a belt-and-suspenders kind of feature worth considering? I created an issue Download and serve archive.org captures locally Active .

Production build 0.71.5 2024