Allow analyzers to be specified for elasticsearch pipelines

Created on 21 November 2023, over 1 year ago

Problem/Motivation

Use case: we need to store HTML data in a field, and perform search filters and sorts on the same field, ignoring the HTML characters.

This can be achieved in elasticsearch by adding an analyzer, for example:

{
  "settings": {
    "analysis": {
      "analyzer": {
        "my_analyzer": {
          "tokenizer": "keyword",
          "char_filter": ["html_strip"]
        }
      }
    }
  },
  "mappings": {
    "properties": {
      "name": {
        "type": "text",
        "fields": {
          "keyword": {
            "type": "keyword"
          },
          "plain_text": {
            "type": "text",
            "analyzer": "my_analyzer"
          }
        }
      }
    }
  }
}

Steps to reproduce

N/A

Proposed resolution

Allow the analysis configuration to be provided in a pipeline yaml file:

my_pipeline:
  label: 'My Pipeline'
  class: '\Drupal\data_pipelines_elasticsearch\ElasticSearchDatasetPipeline'
  analysis:
    analyzer:
      my_analyzer:
        tokenizer: keyword
        char_filter:
          - html_strip
  mappings:
    properties:
      name:
        type: text
        fields:
          keyword:
            type: keyword
          plain_text:
            type: text
            analyzer: my_analyzer

Remaining tasks

Test coverage, reviews, etc

User interface changes

API changes

Data model changes

✨ Feature request

Status

Active

Version

1.0

Component

Code

Created by

🇦🇺Australia mstrelan

Live updates comments and jobs are added and updated live.

Merge Requests

!12Allow analyzers to be specified for elasticsearch pipelines
Closed
🇦🇺Australia mstrelan
updated 3 months ago

Comments & Activities

Issue created by @mstrelan
Merge request !12Allow analyzers → (Closed) created by mstrelan
Open in Jenkins → Open on Drupal.org →
Core: 9.5.x + Environment: PHP 8.1 & MySQL 8
last update over 1 year ago
69 pass
Comment over 1 year ago →
🇨🇦Canada jibran Toronto, Canada
The MR looks good let's add some tests.
Open in Jenkins → Open on Drupal.org →
Core: 9.5.x + Environment: PHP 8.1 & MySQL 8
last update over 1 year ago
69 pass
Status changed to Needs work over 1 year ago6:05am 21 November 2023
Comment over 1 year ago →
🇦🇺Australia mstrelan
Comment over 1 year ago →
🇨🇦Canada jibran Toronto, Canada
Status changed to Closed: outdated 4 months ago4:13am 14 March 2025
Comment 4 months ago →
🇦🇺Australia nterbogt
Issue has migrated to ✨ Allow analyzers to be specified for elasticsearch pipelines Active .
Comment 3 months ago →
System Message
mortim07 → closed merge request !12

contrib.social Blog FAQ Discussions

Production build 0.71.5 2024