Have different approaches to chunking, perhaps configurable, perhaps different chunking engines.

Created on 2 July 2024, 3 months ago
Updated 9 September 2024, 12 days ago

Problem/Motivation

In Search API we did a very basic approach to chunking where we looked at the advice at the time and just went with that.

Things to think about:

- Can we find Chunking libraries elsewhere so we don't do it in Drupal.
- Do you have whole "Chunking methods" which includes everything all in one plugin
- Or do we make things configurable when you set up the index (Such as 5000 characters vs 500, try and find the end of the sentence, overlap, Do you strip HTML)
- If its configurable should we have sensible defaults so someone doesn't have to understand how this all works.
- What about adding meta data into the chunk (Such as Node Title, Author, Genre, etc)

Proposed resolution

📌 Task
Status

Fixed

Version

1.0

Component

AI Search

Created by

🇬🇧United Kingdom yautja_cetanu

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Comments & Activities

Production build 0.71.5 2024