Business IT

Public Preview Azure OpenAI Semantic Caching policy in Azure API Management

Door

22 mei 2024

312

We’re excited to announce the Public Preview for the Azure OpenAI Semantic Caching policy in Azure API Management! This innovative feature empowers customers to optimize token usage by leveraging semantic caching, which intelligently stores completions for prompts with similar meanings.

With this policy, customers can easily configure semantic caching for their Azure OpenAI endpoints. This caching mechanism utilizes Azure Redis Enterprise or any other external cache that has been onboarded to APIM, providing flexibility in caching solutions.

By leveraging the Azure OpenAI Embeddings model to calculate vectors for prompts, the semantic caching policy intelligently identifies semantically similar prompts and stores respective completions in the cache. This allows for efficient completions reuse, reducing token consumption and improving overall performance.

Customers can configure semantic caching in a centralized manner for multiple API consumers, streamlining management and ensuring consistent caching behavior across their API ecosystem. This capability enables customers to maximize the benefits of caching and optimize token usage, enhancing the scalability and efficiency of their Azure OpenAI integration.

Click here to learn more.

Source is Azure Business News

Public Preview Azure OpenAI Semantic Caching policy in Azure API Management

Recente posts

Making machine learning operational

Significant jump in number of hackers reporting vulnerabilities to companies

Whole-system view of railways from Resonate on Redis offers progress

Government cans electronic monitoring overhaul

5G Network Slicing Will Reduce TCO in the Hyper-Connected Era

Meest bekeken posts

The increasing concern of data center land acquisition

BT Group ramps up its cloud transformation efforts with another five-year AWS deal

Netwerk & Security Specialist

ChatGPT: Everything you need to know

Interview: David Walmsley, chief digital and technology officer, Pandora

POPULAIRE BERICHTEN

BIT-blogs – Security monitoring bij BIT

Booming Segments of AI Conversational Platform Market 2020-2028 with AWS, Google,...

Okta picks up Auth0 for $6.5bn

POPULAIRE CATEGORIE