Talking with content - is it the content management of the future? (plus Python side question)

Created on 20 September 2023, 9 months ago
Updated 13 June 2024, 15 days ago

Problem/Motivation

Python has an interesting library "LlamaIndex" https://github.com/jerryjliu/llama_index

  1. It offers data connectors to ingest content from all data sources and data formats:
    • Simple Directory
    • Psychic
    • DeepLake
    • Qdrant
    • Discord
    • MongoDB
    • Chroma
    • MyScale
    • Faiss
    • Obsidian
    • Slack
    • Web Page
    • Pinecone
    • Mbox
    • Milvus
    • Notion
    • Github Repo
    • Google Docs
    • Database (SQL etc.)
    • Twitter
    • Weaviate
    • Make
    • Deplot
  2. Provides ways to structure data (indices, graphs) so that this data can be easily used with LLMs.
  3. Provides an advanced retrieval/query interface over data: Feed in any LLM input prompt, get back retrieved context and knowledge-augmented output.
  4. Allows easy integrations with your outer application framework (e.g. with LangChain, Flask, Docker, ChatGPT, anything else).

To sum up: it allows to gather content from anywhere and talk with AI about it. In other words, manage content smart automatically.

Example how content management of the future might work:

Let's say, I want to find something on drupal.org, I could ask an AI:

"Dear AI, please find most innovative ideas that people have posted on drupal.org in the last 3 months"

And AI having index of all drupal.org content, would give the answer (ideally)

Proposed resolution

  1. Drupal has early version of chat interface https://drupal.org/project/aichat
  2. Drupal has many attempts to index content using Search API and various vector databases. Full list of modules here: https://www.drupal.org/project/ideas/issues/3346258#modules-list-indexing πŸ“Œ [META] Drupal could be great for building AI tools (like ChatGPT) Active
  3. Drupal has very nice new innovation to my knowledge: https://drupal.org/project/aiprompt
  4. Drupal has some traditional benefits for AI content management, as explained in this video "Beyond Vector Search: Knowledge Management with Generative AI"
  5. [Drupal has many other good AI things which are not related to this issue]

But LlamaIndex is way ahead of Drupal.

I tried to look what libraries PHP has to offer, but it is void of AI activity (only few unrelated libraries).

Maybe Drupal has to go all in to integrate with python? So it would be easier to adopt all the richness of Python ecosystem across Drupal websites? (and so, adopt llama index)

Remaining tasks

User interface changes

API changes

Data model changes

πŸ’¬ Support request
Status

Active

Component

Discussion

Created by

πŸ‡±πŸ‡ΉLithuania mindaugasd

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Comments & Activities

  • Issue created by @mindaugasd
  • πŸ‡±πŸ‡ΉLithuania mindaugasd
  • πŸ‡ΊπŸ‡ΈUnited States afinnarn

    Maybe Drupal has to go all in to integrate with Python? So it would be easier to adopt all the richness of Python ecosystem across Drupal websites? (and so, adopt llama index)

    I wholeheartedly agree with trying to leverage the Python libraries vs. trying to port them to PHP and keep up with the original Python library. IMHO, that approach would be far easier to maintain as you mention PHP not having as much activity in the AI/ML space as Python does...and PHP never will have a lot of activity since "connector tools" allow you access to the Python libraries without having to port the code to another language.

    Another benefit is if there is a list of "data connectors" for ChatGPT/Llama Index that includes Drupal, people can say "hmm, what's this Drupal thing?" and try Drupal out vs. not trying to integrate and "stay on the island" so only Drupal and PHP devs are aware of how AI tools can be integrated and used in a structured CMS system.

    My two cents...

  • Status changed to Postponed: needs info 15 days ago
  • πŸ‡¦πŸ‡ΊAustralia pameeela

    This feels more like a general discussion than a specific proposal for Drupal core?

  • Status changed to Active 15 days ago
  • πŸ‡±πŸ‡ΉLithuania mindaugasd

    Yes, I am doing the same as for other issues and moving it to AI initiative project.

Production build 0.69.0 2024