- last update
about 1 year ago 25,906 pass, 1,808 fail - ๐ต๐ญPhilippines abhaypai
Landed here from Bug smash initiative.
+1 Patch is applied successfully for version 11.x-dev too and starting test for 11.x-dev version
- ๐ฎ๐ณIndia prashant.c Dharamshala
Prashant.c โ made their first commit to this issueโs fork.
- Status changed to Needs review
about 1 year ago 3:13pm 19 December 2023 - ๐ฎ๐ณIndia prashant.c Dharamshala
@Anybody
Created MR against 11.x by taking the changes from #8 ๐ Removing trailing slashes from robots.txt Needs review .Thank you.
- Status changed to RTBC
about 1 year ago 4:18pm 19 December 2023 - ๐ฉ๐ชGermany Anybody Porta Westfalica
Thanks @Prashant.c! All tests are passing green, so I think we should get this out of the way! Marking this RTBC.
- last update
about 1 year ago Patch Failed to Apply - Status changed to Needs review
about 1 year ago 8:43pm 19 December 2023 - ๐ธ๐ฐSlovakia poker10
According to the Google robots.txt description here: https://developers.google.com/search/docs/crawling-indexing/robots/robot... , the
/search
should match any path that starts with/search
. So I do not think we need both/search
and/search/
(and similar for the second change). The rules from robots.txt should be the "starts with" rules.The second question is, we have
/admin/
in robots.txt./admin
path is valid as well, so why we are not changing this too? - Status changed to Needs work
about 1 year ago 9:26pm 19 December 2023 - ๐บ๐ธUnited States smustgrave
Can it be documented in the issue summary what paths were chosen and why.
- First commit to issue fork.
- ๐ฉ๐ฐDenmark ressa Copenhagen
I agree @poker10, we should remove all trailing slashes, and I have updated the MR to reflect this.
I have added the table below in the Issue Summary, is that sufficient documentation @smustgrave, or do we need some more?
From How Google interprets the robots.txt specification > URL matching based on path values:
PS. Personally, I would add
Disallow: /node
, since:- The vast majority of sites use Pathauto, installs January 2025:
Drupal core: 723,408 Pathauto: 514,780
- Getting paths such as
/node/100
indexed instead of the human readable URL alias/my-alias
is bad for SEO ...
... but that's for another issue :)
- The vast majority of sites use Pathauto, installs January 2025:
- ๐ฉ๐ฐDenmark ressa Copenhagen
I created an issue about disallowing
/node
. - ๐ฉ๐ฐDenmark ressa Copenhagen
The table outlining the rules I wanted to add in the Issue Summary got lost, now it's actually added.