Business IT

Kubernetes AI toolchain operator

Door

16 november 2023

492

You can now run specialized machine learning workloads like large language models (LLMs) on Azure Kubernetes Service (AKS) more cost-effectively and with less manual configuration.

The initial release of Kubernetes AI toolchain operator, an open source project, automates LLM model deployment on AKS across available CPU and GPU resources by selecting optimally sized infrastructure for the model. It makes it possible to easily split inferencing across multiple lower-GPU count VMs, increasing the number of Azure regions where workloads can run, eliminating wait times for higher GPU-count VMs, and lowering overall cost. You can also choose from preset models with images hosted by AKS, significantly reducing overall inference service setup time.

https://aka.ms/aks/ai-toolchain-operator

Source is Azure Business News

Kubernetes AI toolchain operator

Recente posts

Brazil’s Far-Right Disinformation Pushers Find a Safe Space on Telegram

France’s Macron among alleged Pegasus targets

AWS News Blog – Introducing AWS Gateway Load Balancer – Easy Deployment, Scalability, and...

Security Think Tank: Good training is all about context

Flash prices drop as drive production increases but demand lags

Meest bekeken posts

Business leaders raise concerns over public cloud data sovereignty

AWS apologises for 14-hour outage and sets out causes of US datacentre region downtime

UAE’s datacentre boom powers AI ambitions and digital sovereignty

Government faces questions about why US AWS outage disrupted UK tax office and banking...

Zelf VM’s aanmaken via het klantenportaal

POPULAIRE BERICHTEN

BIT-blogs – Security monitoring bij BIT

Booming Segments of AI Conversational Platform Market 2020-2028 with AWS, Google,...

Okta picks up Auth0 for $6.5bn

POPULAIRE CATEGORIE