Exceeded quota after having one conversation with Azure OpenAI deployments

baysonfox 0 Reputation points
2024-09-13T04:41:02.7633333+00:00

Using Azure for Students, I was able to create a resource group and creating an Azure OpenAI resource in the resource group, both the resource group and the Azure OpenAI resource are in East US.
However, when I created a model deployment in the created Azure OpenAI resource and tried to test the model in the playground, it would return an error, saying that I've exceeded the quota, telling me to create another conversation after 1 minute.

Before asking, I've searched the web, and found an answer that creating the resource in the same region of the resource group could solve this problem, but in my case, both the resource group or the resource itself are in the same region.CleanShot 2024-09-13 at 12.33.48@2x

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,940 questions
{count} votes

2 answers

Sort by: Most helpful
  1. Deleted

    This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.


    Comments have been turned off. Learn more

  2. AshokPeddakotla-MSFT 33,026 Reputation points
    2024-09-16T11:33:27.0233333+00:00

    baysonfox Greetings!

    Thanks for providing additional details and confirmation.

    The error message is related to rate limits, which is a common practice in APIs to prevent abuse and ensure fair usage.

    Using Azure for Students, I was able to create a resource group and creating an Azure OpenAI resource in the resource group, both the resource group and the Azure OpenAI resource are in East US. However, when I created a model deployment in the created Azure OpenAI resource and tried to test the model in the playground, it would return an error, saying that I've exceeded the quota, telling me to create another conversation after 1 minute.

    In your case, the error message indicates that you’ve exceeded the token rate limit of your current AI Services S0 pricing tier.

    As you mentioned, you are using Azure for Students subscription. Quota Limit in tokens per minute (TPM) for all models is 1K.

    Please see Azure OpenAI Service quotas and limits for more details.

    TPM rate limits are based on the maximum number of tokens that are estimated to be processed by a request at the time the request is received. You can check this documentation for more details.

    To view your quota allocations across deployments in a given region, select Shared Resources -> Quota in Azure OpenAI studio and click on the link to increase the quota.

    User's image

    Since you are on limited quota, To minimize issues related to rate limits, it's a good idea to use the following techniques:

    • Implement retry logic in your application.
    • Avoid sharp changes in the workload. Increase the workload gradually.
    • Test different load increase patterns.
    • Increase the quota assigned to your deployment. Move quota from another deployment, if necessary.

    If you are not able to increase the quota on a student subscription, please contact customer service at any time so that we can adjust your limits appropriately.

    You must upgrade your Azure for Students Starter subscription to a Pay-As-You-Go subscription to increase your quotas or limits. For more information, see Upgrade your Azure Free Trial subscription to a Pay-As-You-Go subscription

    Do let me know if that helps or have any other queries.

    If the response helped, please do click Accept Answer and Yes for was this answer helpful.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.