Multiple strings can match one path alias

Created on 31 March 2023, over 1 year ago

Problem/Motivation

Installation of Drupal recommend utf8mb4_unicode_ci or utf8mb4_general_ci for COLLATION (which is good).

https://www.drupal.org/docs/installing-drupal/step-3-create-a-database#:... .

But that have implication for querying to path_alias. For example this URL:

https://www.drupal.org/node/add/project-issue/drupal

is return same content as this URL:

https://www.drupal.org/node/add/PrOjEcT-íššúé/drůpál

This is probably issue on all Drupal sites, not only D9 o D10. I can tell you - our SEO guy is not happy for that :-)

Steps to reproduce

Log in to drupal.org and try open URL:

https://www.drupal.org/node/add/project-issue/drupal

This URLs return same content (you can try change any character to accented or uppercased version):

https://www.drupal.org/node/add/PrOjEcT-íššúé/drůpál
https://www.drupal.org/NODE/aDď/PrOjEcT-íssue/drupal
...

Proposed resolution

This behavior should be configurable. Simplest solution to avoid that behavior is changed collation, in MySQL, for column alias to utf8mb4_bin. But probably that is not good for people who want for example, use case insensitive paths.

🐛 Bug report
Status

Active

Version

10.0

Component
Path 

Last updated about 5 hours ago

  • Maintained by
  • 🇬🇧United Kingdom @catch
Created by

🇸🇰Slovakia havran

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Comments & Activities

Production build 0.71.5 2024