Business IT

Kubernetes AI toolchain operator

Door

16 november 2023

394

You can now run specialized machine learning workloads like large language models (LLMs) on Azure Kubernetes Service (AKS) more cost-effectively and with less manual configuration.

The initial release of Kubernetes AI toolchain operator, an open source project, automates LLM model deployment on AKS across available CPU and GPU resources by selecting optimally sized infrastructure for the model. It makes it possible to easily split inferencing across multiple lower-GPU count VMs, increasing the number of Azure regions where workloads can run, eliminating wait times for higher GPU-count VMs, and lowering overall cost. You can also choose from preset models with images hosted by AKS, significantly reducing overall inference service setup time.

https://aka.ms/aks/ai-toolchain-operator

Source is Azure Business News

Kubernetes AI toolchain operator

Recente posts

Vodafone and Google Cloud expand big data platform collaboration with new six-year deal

NCSC offers teachers free cyber security training

New York Turns to Smart Thermometers for Disease Detection in Schools

General Availability: Virtual private network (VPN) with Azure Managed Instance for Apache Cassandra

Virtual Reality Content Creation Market (impact of COVID-19) to see massive growth by 2026...

Meest bekeken posts

Podcast: HDDs performance metrics and the workloads they excel at

AI in the data center: Transforming operations and careers

Accounting watchdog ‘disclaims’ College of Policing financial accounts after serious IT failures

UK government signs deal with Google Cloud to upskill 100,000 civil servants in AI...

BIT gecertificeerd volgens nieuwe ISO 27001 norm

POPULAIRE BERICHTEN

BIT-blogs – Security monitoring bij BIT

Booming Segments of AI Conversational Platform Market 2020-2028 with AWS, Google,...

Okta picks up Auth0 for $6.5bn

POPULAIRE CATEGORIE