Create the concept of Guardrail agents

Issue created by @marcus_johansson
Comment about 2 months ago →
🇩🇪Germany marcus_johansson
Comment 27 days ago →
🇩🇪Germany marcus_johansson
Comment 20 days ago →
🇩🇪Germany breidert
Additional information from weekly meeting:

The UI/UX has to make it simple to create and manage guardrails. However, since there are many things to configure, this could become complex.

Guardrails can be general and apply to general functionality such as AI Translation or Content Suggestions, where you might just need to block things like PII data or flag things like <script> tags.

Guardrails can also be very specific and be tied to a single agent or a tool.

A specific example of a guardrail for a single agent might be something like:
“Check that the text in the image only includes cooking instructions—nothing else.”
You’d only want that guardrail running for the agent that generates food recipes—not every AI process.

A UI/UX should work for all use cases.
Comment 18 days ago →
🇩🇪Germany breidert
Comment 13 days ago →
🇬🇧United Kingdom yautja_cetanu
"it can immediately raise an error, which stops the expensive model from running and saves you time/money."

From the OpenAI Agents SDK, I can't see if Guardrails HAVE to stop the execution or CAN cut the operation. I think it should be the "default" approach for guardrails but not the only. Instead it should be possible (even if not possible with version 1.1 to)

Have guardrails stop the execution.

Have guardrails go back to the agent to give them another go at it.

Have guardrails trigger some kind of end-user action that would allow the execution to continue where its left off. Terminate it, or start down a new path.

I don't think we should build all the above options above. But I think we should think about it as a possibility.
First commit to issue fork.
Comment 10 days ago →
🇮🇹Italy lussoluca Italy
lussoluca → changed the visibility of the branch 3518963-meta-create-the to hidden.
Comment 10 days ago →
🇮🇹Italy lussoluca Italy
lussoluca → changed the visibility of the branch 3518963-meta-create-the to active.
Comment 10 days ago →
🇮🇹Italy lussoluca Italy
lussoluca → changed the visibility of the branch 3518963-meta-create-the to hidden.
Comment 10 days ago →
🇮🇹Italy lussoluca Italy
Sorry, I wanted to push an initial stub, but then I realized that this issue was opened into ai_agents module (I worked on the ai module...)

I think that guardrails are a generic concept that can be applied to every interaction with an LLM, not only when using agents. Maybe we should move this issue to the ai project?
Comment 10 days ago →
🇩🇪Germany marcus_johansson
You are right Luca - we will need an issue here as well, for actual UI implementation, but that is dependent on the other issue. Will move.
Comment 9 days ago →
🇮🇹Italy lussoluca Italy
lussoluca → changed the visibility of the branch 3518963-meta-create-the to active.
Comment 9 days ago →
🇮🇹Italy lussoluca Italy
lussoluca → changed the visibility of the branch 3518963-meta-create-the to hidden.

Create the concept of Guardrail agents

Problem/Motivation

Comments & Activities