Problem/Motivation
Hi,
In Drupal 11.2 and Module AI 1.1, the AI Settings configuration form offers a large number of operations (for example: Chat with System Role, Embeddings, Chat with Tools, etc.) when setting up providers.
However, no help texts are currently provided to explain what each operation actually does. This complicates understanding the form, even for technical users, and makes configuring the module difficult to master.
Currently, users have to refer to the source code or external documentation to understand what some operations like "Chat with Structured Response" or "Chat with Tools / Function Calling" mean, which hinders adoption and proper understanding of the module.
Steps to reproduce (required for bugs, but not feature requests)
Please provide information like AI modules enabled, which AI provider, browser, etc.
Proposed resolution
I propose to add clear and concise help texts to each field related to an operation in the AI Settings form. These descriptions will concretely explain what each operation does and facilitate configuration.
Here is the initial full list of proposed descriptions (feel free to complete or correct):
- Audio to Audio : Transform an input audio file into a modified audio output, for example by changing the voice or improving sound quality.
- Image and Audio to Video : Combine an image and an audio track to create a video, for example animating a static image with associated sound.
- Speech to Speech : Convert spoken input into modified speech, including translation, voice conversion, or tone modification.
- Text to Speech : Generate spoken audio from written text using AI-generated synthetic voices.
- Embeddings : Represent text or images as numerical vectors capturing semantic meaning for search or comparison.
- Moderation : Detect and filter harmful or inappropriate content in texts or images.
- Text to Image : Create an image from a text description using generative AI.
- Image to Video : Animate a still image to produce a short video sequence with movements or effects.
- Speech to Text : Transcribe spoken audio into written text.
- Image Classification : Automatically identify the content and category of an image.
- Discussion : Enable free-form conversational interaction in natural language.
- Translate Text : Translate text from one language to another.
- Chat with Image Vision : Analyze and interpret images provided within a conversation to enrich responses.
- Chat with Audio : Process audio files within a discussion to understand and respond.
- Chat with Video : Analyze videos received in a chat to provide relevant responses.
- Chat with System Role : Guide the style and behavior of the conversation via predefined system instructions.
- Chat with Complex JSON : Produce structured and valid JSON outputs suitable for programmatic use.
- Chat with Structured Response : Format responses into predictable structures such as lists or tables to facilitate readability and integration.
- Chat with Tools / Function Calling : Dynamically execute external functions or API calls during the conversation.
Benefits
- Improved understanding of the form
- Faster and more intuitive configuration
- Reduced cognitive load for new users
- Smoother Drupal user experience