In order to have more metadata on the chunk of content that is being converted to embeddings and later stored in vector database, the title of the entity is added to each chunk as a headline 1. This happens of course only when title (entity label field) is present in the index fields and is not "Ignored". But this also leads to the duplication of the title, as it is later added to the end of the chunk as contextual content with label of the field and its value.
Install ai_search module
Do not include the title in the end of the chunk, as the title is already added in the beginning. This will save some tokens and will allow to use them more efficiently.
Alternatively, do not add title as h1 but leave the title in the chunk metadata.
Decide on the approach, create MR
Needs review
1.2
AI Search
Not all content is available!
It's likely this issue predates Contrib.social: some issue and comment data are missing.
No activities found.