Cloud

Interview: Nvidia on AI workload demands and storage performance

Door

19 juli 2024

485

Artificial intelligence (AI) workloads are new and different to those we’ve seen previously in the enterprise. They range from intensely compute-intensive training to day-to-day inferencing and RAG referencing that barely tickles CPU and storage input/output (I/O).

So, across the various genres of AI workload, the I/O profile and impacts upon storage can vary dramatically.

In this second of a two-part series, we talk to Nvidia vice-president and general manager of DGX Systems Charlie Boyle about the demands of checkpointing in AI, the roles of storage performance markers such as throughput and access speed in AI work, and the storage attributes required for different types of AI workload.

We pick up the discussion following the chat in the first article about the key challenges in data for AI projects, practical tips for customers setting out on AI, and differences across AI workload types such as training, fine-tuning, inference, RAG and checkpointing.

Interview: Nvidia on AI workload demands and storage performance

Recente posts

We’re Smarter About Facebook Now

Google Will Allow Some Apps, Starting With Spotify, to Offer Alternate Billing Methods

Sarah Wilkinson, most influential person in UK tech 2021 – winner’s speech

IR35 reforms: Loan charge and employers’ NI issues prompt calls for legislative revamp

Retailers Find TikTok a ‘Sunny Place’ for Advertising

Meest bekeken posts

Gartner: AI and datacentre spending ramps up

VMware vSphere 8 end-of-support challenges

Hoe erg is het nou?

Canva uses 1Password to secure ID during growth phase

Microsoft has already contracted GPUs to balance costs

POPULAIRE BERICHTEN

BIT-blogs – Security monitoring bij BIT

Booming Segments of AI Conversational Platform Market 2020-2028 with AWS, Google,...

Okta picks up Auth0 for $6.5bn

POPULAIRE CATEGORIE

Antony Adshead: Is there a kind of standard ratio of checkpoint writes to the volume of the training model?

Adshead: We’ve talked about training and you’ve talked about needing fast storage. What’s the role of throughput alongside speed?

Adshead: What’s the difference in terms of storage I/O between training and inference?

Recente posts

Meest bekeken posts

POPULAIRE BERICHTEN

POPULAIRE CATEGORIE