Introduce AiQuotaException

Created on 8 July 2024, 9 months ago

Problem/Motivation

With many (all?) AI providers, free tier for API tokens either doesn't exist or is very limited.
That means, that after obtaining an API key, site user cannot send queries to provider, getting an error instead.

Currently, generic exception is thrown, and there is helpful enough message, which unfortunately is buried in the message log. It would be easier to debug/fix, if in such case provider throws designated exception. Handling such an exception could result in error messages in status report (or other places, like AI settings page)

Steps to reproduce

  • Create a free account with Open AI, and create a secret key in default project.
  • Configure AI to use Open AI for chat, using the newly created key.
  • Try to perform any AI operation

Proposed resolution

  • Create Drupal\ai\Exception\AiQuotaException, that is thrown when quota is exceeded.
  • When this exception is thrown, set a flag (in State?) marking the whole provider unusable.
  • Provide a way to clear the flag (on cron? button in AI settings?)
  • Implement provider-specific checks for the quota. This does not have to be done for all providers at once, since not having AiQuotaException for specific provider will only leave it with the current DX (no regression)

Remaining tasks

  • Decide where to display information about quota limit
  • Decide how to clear the flag about over-quota

User interface changes

None or manual clearance of over-quota flag

API changes

No API changes

Data model changes

No data model changes

Feature request
Status

Active

Version

1.0

Component

AI Core module

Created by

🇧🇬Bulgaria valthebald Sofia

Live updates comments and jobs are added and updated live.
Sign in to follow issues

Comments & Activities

  • Issue created by @valthebald
  • 🇬🇧United Kingdom yautja_cetanu

    We probably want a separation of errors shown to end-users verses admin logs. We want the end-user errors to be something generic such as

    "An error has occurred, contact the site admin for more information"

    Then in the Drupal logs, I think say very specifically that it is due to the quota being used up.

  • 🇱🇹Lithuania mindaugasd

    Related with issue on openai module 🐛 Misleading error: You exceeded your current quota, please check your plan and billing details Active
    In that case I think openai service was returning the error wrong

  • Assigned to marcus_johansson
  • 🇩🇪Germany marcus_johansson

    @valthebald - So I think there will be some separation here - the more specific exception the better.

    I have implemented this in Anthropic provider first, since I didn't have any money on my private account:

    The underlying clients have no context if this is the error, so in this I have done error message pattern recognition. This is of course not a stable solution, but as written above it would fail back gracefully to the generic ai response error.

    I will try to add these for the providers where you can discern it from the client/error message.

    But for the disabling of providers if they are over their quota, I would suggest that we create a follow up ticket that would create a submodule for this. Not everyone would want that logic and the event system makes it very easy for another module to handle such logic.

    @yautja_cetanu - This is controlled via $config['system.logging']['error_level'] configuration, if you set it to very low level (on a production) system, you logic entails. On a local development machine its great to see full stack traces, exception types and messages.

  • 🇩🇪Germany marcus_johansson

    @valthebald - If its ok to move out the logic of any provider that is seeing a lot of exceptions to some external module, this would be done and testable in the branch: ratelimit-quota-exceptions (https://git.drupalcode.org/project/ai/-/tree/ratelimit-quota-exceptions?...)

    This also fixes another exception: https://www.drupal.org/project/ai/issues/3460274 📌 Introduce AIRateLimitException Active

    The quota one works for Anthropic and OpenAI, while the rate limit currently works for Groq, OpenAi, HuggingFace.

    For Mistral I added this ticket that needs to be done first: https://www.drupal.org/project/ai/issues/3460283 Build custom client for Mistral or switch to HelgeSverre lib Active

    For Ollama and LM Studio these exceptions shouldn't occur. Both have queuing systems on multiple requests.

  • Status changed to Needs review 9 months ago
  • 🇧🇬Bulgaria valthebald Sofia

    @Marcus_Johansson - what do you mean external module? As in having it's own composer package, or submodule of drupal/ai?

  • 🇩🇪Germany marcus_johansson

    @valthebald - I think that would be up for discussion, having that available to enable when you download the AI module probably makes sense I would think? Assuming someone has the time to work on it.

    I just want to keep the core abstraction module as lean as possible so it basically does input to output normalization and any logic that needs to happen before, during normalization or after a request is up to the providers or external modules to handle.

  • 🇧🇬Bulgaria valthebald Sofia

    I'd put the logic of when to throw the new exceptions (quota/rate) into providers, the core handling them in the unified way

  • 🇩🇪Germany marcus_johansson

    Ah, missed the response here. Right - the core would handle those in a unified with the change I made and according to how you write in the last comment.

    The thing that is "missing" from your initial description is that it doesn't take care of flagging/disabling an provider. This would have to go into some 3rd party contrib module or a submodule to the AI module.

    Pull request here: https://git.drupalcode.org/project/ai/-/merge_requests/27?resolved_confl...

  • 🇧🇬Bulgaria valthebald Sofia

    @Marcus_Johansson: I've added a few comments, none of them are show stoppers

  • Status changed to Fixed 9 months ago
  • 🇩🇪Germany marcus_johansson

    Thanks @valthebald - they are fixed together with some other phpcs complaints. I will merge it into 1.0.x-dev now.

  • Automatically closed - issue fixed for 2 weeks with no activity.

  • 🇳🇬Nigeria chike Nigeria

    Using 1.0.x-dev, I couldn't get past this error with free tier. I tried with OpenAI and Anthropic.

Production build 0.71.5 2024