Cloud

Podcast: Storage and AI training, inference, and agentic AI

Door

6 december 2024

248

In this podcast, we look at storage and artificial intelligence (AI) with Jason Hardy, chief technology officer for AI with Hitachi Vantara.

He talks about the performance demands on storage that AI processing brings, but also highlights the extreme context switching it can result in as enterprises are forced to pivot between training and inferencing workloads in AI.

Hardy also talks about a future that potentially includes agentic AI – AI that designs its own workflow and takes decisions for itself – that will likely result in an even greater increase in workload context switching.

Antony Adshead: What demands do AI workloads place on data storage?

Jason Hardy: It’s a two-dimensional problem. Obviously, there is that AI needs speed, speed, speed, speed and more speed. Having that level of processing, especially when talking about building LLMs and doing foundational model training, it [AI] needs extremely high performance capabilities.

That is still the case and will always be the case, especially as we start doing a lot of this stuff in volume, as we start to trend into inferencing, and RAG, and all of these other paradigms that are starting to be introduced to it. But, the other demand that I think is – I don’t want to say overlooked, but is under-emphasised – the data management side of it.

For example, how do I know what data I need to bring and introduce into my AI outcome without understanding what data I actually have? And one could say, that’s what the data lake is for, and really, the data lake’s just a big dumping ground in a lot of cases.

So, yes, we need extremely high performance, but also we need to know what data we have. I need to know what data is applicable for the use case I’m starting to target, and then how I can appropriately use it, even from a compliance requirement, or a regulatory requirement, or anything like that from those themes.

It’s really this two-headed dragon, almost, of needing to be extremely performant, but also to know exactly what data I have out there, and then having proper data management practices and tools and the like all wrapped around that.

And a lot of that burden, especially as we look at the unstructured data side, is very critical and embedded into some of these technologies like object storage, where you have these metadata functions and things like that, where it gives you a little bit more of that descriptive layer.

But when it comes to traditional NAS, that’s a lot more of a challenge, but also a lot more of where the data’s coming from. So, it’s, again, this double-sided thing of, “I need to be extremely fast, but I also need to have proper data management tools wrapped around it.”

Podcast: Storage and AI training, inference, and agentic AI

Recente posts

Klantcase Codebee

Application Modernization Tools Market 2020 Key Factors, Growth Scenario and Emerging Opportunities with Current...

FireEye sold to private equity, Mandiant regains independence

Tech firms launch digital identity scheme for real estate

Guidelines for Getting to Reproducible Builds

Meest bekeken posts

8 ways to enhance data center physical security

Statistieken in de BIT Portal 2.0

Proliferation of on-premise GenAI platforms is widening security risks

How the UK's cloud strategy was hijacked by a hyperscaler duopoly

Tijdelijke omleiding i.v.m. reparatie toegangspoort

POPULAIRE BERICHTEN

BIT-blogs – Security monitoring bij BIT

Booming Segments of AI Conversational Platform Market 2020-2028 with AWS, Google,...

Okta picks up Auth0 for $6.5bn

POPULAIRE CATEGORIE

Features for AI use cases

Training and inference needs

Recente posts

Meest bekeken posts

POPULAIRE BERICHTEN

POPULAIRE CATEGORIE