Business IT

GA Azure OpenAI Token Limit Policy in Azure API Management

Door

22 mei 2024

457

We’re excited to announce the General Availability of the Azure OpenAI Token Limit Policy in Azure API Management! This feature empowers customers to effectively manage and enforce limits on API consumers based on their usage of OpenAI tokens.

With this policy, customers can set limits on API consumers based on OpenAI token usage, expressed in tokens-per-minute (TPM). This allows for precise control over token consumption, ensuring fair and efficient utilization of OpenAI resources.

Customers have the flexibility to assign token-based limits on any counter key, such as Subscription key, IP Address, etc., tailoring the enforcement to their specific use cases. .

By relying on token usage metrics returned from the OpenAI endpoint, customers can accurately monitor and enforce limits in real-time. The policy also enables pre-calculation of prompt tokens on the APIM side, minimizing unnecessary requests to the OpenAI backend if the limit is already exceeded.

Furthermore, customers can leverage headers and variables such as tokens-consumed and remaining-tokens within policies for enhanced control and customization.

With the introduction of the OpenAI Token Limit Policy, customers can now centrally manage limits for multiple API consumers and OpenAI endpoints for both streaming and non-streaming scenarios, streamlining the management process and improving resource utilization efficiency.

Click here to learn more.

Source is Azure Business News

GA Azure OpenAI Token Limit Policy in Azure API Management

Recente posts

Every-write AOF persistence on Azure Cache for Redis Enterprise and Enterprise Flash will be...

Affective Computing Market: Competitive Landscape and Recent Industry Development Analysis 2020-2026 | nViso, Cogito...

Global Rivalries Are Miring the Clean Energy Revolution

Digital Genome Market Statistics 2026 | Becton, Dickson and Company, F. Hoffmann-La Roche Ltd.,...

Glitch signs collective bargaining agreement with unionised workers

Meest bekeken posts

Gartner: AI and datacentre spending ramps up

VMware vSphere 8 end-of-support challenges

Hoe erg is het nou?

Canva uses 1Password to secure ID during growth phase

Microsoft has already contracted GPUs to balance costs

POPULAIRE BERICHTEN

BIT-blogs – Security monitoring bij BIT

Booming Segments of AI Conversational Platform Market 2020-2028 with AWS, Google,...

Okta picks up Auth0 for $6.5bn

POPULAIRE CATEGORIE