Problem with SolrExtractor

Created on 9 January 2018, almost 7 years ago
Updated 17 October 2024, 28 days ago

I have a site that is hosted on Acquia and is configured per https://docs.acquia.com/acquia-search/search-api/attachments (so using the solr extractor), also has a patch from this issue: https://www.drupal.org/project/search_api_attachments/issues/2896418

All works fine for the most part, but sometimes I have the issue that text in a file is not indexed properly.
For example, if I have a file that contains the sentence "hey I'm an awesome file please index me mkay plz thx", the text will be indexed with whitespaces in between words and some words will be cut off, like "hey I'm an awe some file pl ease inde e mkay plz thx". I don't know if this is an issue with this module or a Solarium issue, but right now I'm a bit at a loss and just wondering if anyone else has experienced this and maybe can give some pointers into which direction to look?

💬 Support request
Status

Closed: cannot reproduce

Version

1.0

Component

Code

Created by

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Comments & Activities

No activities found.

Production build 0.71.5 2024