Add support for Unicode input texts.

Created on 16 December 2020, over 4 years ago
Updated 6 June 2023, almost 2 years ago

Problem/Motivation

Regex word bounday \b doesn't work as expected for accented characters.

Steps to reproduce

For Example. Take the input word as kúpeľniach and if we have a glossified taxonomy term name as peľ

The word peľ is highlighted correctly if separated by spaces. But if a word like kúpeľniach is there, the filter is also picking this word. Ideally, the glossify filter should not pick this.

Proposed resolution

Need to add a Unicode modifier to the regular expression.

Since this is a case-specific requirement, suggesting a filter configuration which will be disabled by default. So that it doesn't affect existing users and also users get an option to enable this setting.

Remaining tasks

User interface changes

API changes

Data model changes

Feature request
Status

Fixed

Version

2.0

Component

Code

Created by

🇮🇳India JayKandari

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Comments & Activities

Not all content is available!

It's likely this issue predates Contrib.social: some issue and comment data are missing.

Production build 0.71.5 2024