Database fulltext search words with minus "-" character

Created on 4 July 2024, 2 months ago
Updated 20 July 2024, about 2 months ago

Hello everyone, and sorry for the bad English.

I have problems indexing/searching (backend database) some words (fulltext search) that contain the "-" character, such as product codes like:
P-0003
D-0012

I went crazy with the various configurations and processors, but all I could get was:

- nothing is found for me (for example searching for P-0003)
or "playing" with tokenizer/stop words/etc:
- I get results that have nothing to do with it (for example, searching for P-0003 also returns M-0002, P-0005, D-0043, ...)

Thanks for any help you can give me.

💬 Support request
Status

Postponed: needs info

Version

1.35

Component

Database backend

Created by

🇮🇹Italy gatiba

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Comments & Activities

  • Issue created by @gatiba
  • Status changed to Postponed: needs info about 2 months ago
  • 🇦🇹Austria drunken monkey Vienna, Austria

    It would be helpful if you could elaborate what configuration you have already tried. In any case, in theory you should have little more to do than configure the Tokenizer processor to either strip minusses (i.e., include them as an ignored character – alternatively you could also use the “Ignore characters” processor for that) or to treat them as normal text (i.e., neither ignore them nor treat them as whitespace).
    However, there is tons of other code that might interfere, I admit, so it can be tricky to get tokenizing and searching working exactly as you want it.

Production build 0.71.5 2024