Which model version should I be using for my openai deployment

Justin Baron 0 Reputation points
2024-04-19T10:49:09.8033333+00:00

Hi

I live in the UK and have written a webapp for a client in Holland. It uses GPT-4 and GPT-3 and have set up the appropriate deployments for both. I need to send the largest number of tokens as possible as the app is summarising data. Something seems to have changed over the last few days and I am struggling with GPT-4.

  1. Which model should I be using for GPT-4?
  2. I want to use my clients account and models so they can pay. Is there a GPT-4 model available to use in Holland?

At the moment my webapp (which was working fine) seems not to work for GPT4. It works locally for GPT-4. Am I missing something?

Thanks!
Justin

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,176 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. AshokPeddakotla-MSFT 27,401 Reputation points
    2024-04-19T11:25:02.07+00:00

    Justin Baron Greetings & Welcome to Microsoft Q&A forum!

    What is the exact issue you are facing GPT-4 model?

    Please see below answers to your queries.

    Which model should I be using for GPT-4?

    It depends on your requirement. The default quota for models varies by model and region. Default quota limits are subject to change.

    Quota for standard deployments is described in of terms of Tokens-Per-Minute (TPM).

    User's image

    Choose from the above list and use it accordingly based on token limits.

    I want to use my clients account and models so they can pay. Is there a GPT-4 model available to use in Holland?

    Please check the model availability. If is available you can deploy and use.

    For using your client's account and models, you can use Azure OpenAI's multi-tenancy feature to allow your client to pay for the usage of the models. This allows you to use your client's subscription and models while still being able to manage the usage and billing.

    At the moment my webapp (which was working fine) seems not to work for GPT4. It works locally for GPT-4. Am I missing something?

    Can you share more details on what is not working? it's difficult to say without more information. It's possible that there may be an issue with the deployment or configuration of the model.

    I recommend checking the logs and error messages to see if there are any clues as to what may be causing the issue.

    Do let me know if you have any further queries.