Why building AI applications still means building infrastructure-first
AI success depends on secure, scalable data infrastructure
In February 2026, Moltbook, a social network built for AI agents, learned a familiar lesson in a new way: you can’t scale AI without building the right infrastructure underneath it.
Despite rapid traction and heavy funding, the company rushed into production with parts of the stack generated by AI agents, without fully validating the infrastructure.
The cracks showed quickly. Researchers found fundamental gaps in how data was stored, accessed, and protected, allowing unauthorized posting and exposing sensitive user information.
CTO at MinIO.
Vibe coding, no doubt, has applications in today's software world. However, Moltbook had received millions in venture capital money and has since been acquired by Meta. This wasn’t an experiment; it was a production system handling real user data.
And that’s the point: once AI moves from prototype to production, shortcuts in architecture across compliance, data privacy, and auditability become real business risks.
The lesson here is about more than security, it’s about how AI is raising the cost of getting data infrastructure wrong.
The growing challenge in AI infrastructure
AI agents don’t just use data, they’re constantly creating it, moving it, and acting on it at a scale most systems weren’t built for. And now that foundation models are widely available, the model itself isn’t what sets companies apart anymore.
Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!
The real advantage comes from the infrastructure behind it. For IT leaders, that shifts the risk from theoretical to operational: broken pipelines, gaps in compliance, and security vulnerabilities as data moves through the system.
The challenge is architectural. AI workloads don’t stay in one place, they move across on-prem environments, the cloud, and the edge, with machines authenticating and interacting with each other nonstop in the background.
And with AI projects going from experiment to production so quickly, teams are being forced to rethink how data actually flows: how fast it moves, how reliable it is, how it’s governed, and where it should live.
That’s why many are starting to move back toward on-prem and hybrid models, designed around where data naturally sits, rather than what’s most convenient in the cloud.
It’s at scale where things start to break down. Caches go stale, data formats don’t line up, and pipelines that worked in testing begin to strain, dragging down performance and creating inconsistent user experiences. Behind the scenes, access controls built for human behavior don’t hold up in a machine-driven world.
Governance rarely fails all at once; it slowly slips as teams focus on improving models instead of maintaining the data layer. And over time, that leads to something much riskier: a data foundation that’s increasingly messy, exposed, and out of compliance.
Why storage and governance must be foundational
AI initiatives at enterprise scale don't fail because of models, they fail because the data supporting them isn't ready. Training pipelines stall when data storage can't sustain the throughput GPU clusters demand.
Inference degrades when feature data is scattered across siloed environments with no consistent access layer. Governance breaks down when there is no single authoritative record of what data exists, who accessed it, and under what policy.
The consequences of treating storage as an afterthought are structural, and this is a pattern seen repeatedly across large financial institutions from my personal experience.
When the data under AI workloads lives in fragmented silos spanning on-premises systems and cloud environments, every AI practitioner becomes an integration engineer first. Teams spend cycles moving data instead of working with it. Compliance teams chase lineage across systems that were never designed to provide it.
Risk surfaces multiply precisely where visibility is lowest, in the data path between raw objects and the models consuming them, creating operational and regulatory exposure.
For regulated industries such as financial services and healthcare, making storage the governance layer, not a separate concern bolted on afterward, is the only solution.
With policy enforcement, encryption, and IAM controls embedded in the data path across both object storage and industry standard table formats, AI practitioners gain self-service access without sacrificing auditability or control.
Structured and unstructured data are governed under a unified platform, which means compliance teams have consistent lineage and access records whether the workload is model training against raw objects or analytics running against Apache Iceberg tables.
The data under every AI initiative becomes a controlled, observable, high-performance foundation.
What leaders must prioritize now
As organizations move from experimentation to production, the data layer becomes the factor that determines whether AI actually scales. It’s a subtle shift, but an important one: success is no longer defined by the sophistication of the model, but by whether the infrastructure around it can support real-world demands.
That means:
- Treat storage as a strategic decision, not a backend concern. Data integrity, governance, and performance are architectural requirements, not supplementary support. High-performance, S3-native object storage that can keep pace with GPU clusters and AI pipelines is now table stakes.
- Design for AI agents as primary data consumers. Autonomous systems depend on fine-grained access control and full auditability for machine-to-machine interactions.
- Stay cloud-flexible without becoming cloud-dependent. The ability to run consistently across on-premises, hybrid, and multi-cloud environments without egress penalties or lock-in is what gives organizations real control.
- Eliminate the silos AI exposes. Bringing governed, SQL-accessible structure to the same platform as raw object data closes the gaps where visibility and control tend to break down.
Autonomous, production-scale AI is within reach, but it’s only as strong as the foundation beneath it. Deploying AI tools without enterprise-grade data infrastructure isn’t bold, it’s a liability. The organizations that treat the data layer as foundational will be the ones that scale AI safely, reliably, and for the long term.
We've featured the best AI website builder.
This article was produced as part of TechRadar Pro Perspectives, our channel to feature the best and brightest minds in the technology industry today.
The views expressed here are those of the author and are not necessarily those of TechRadarPro or Future plc. If you are interested in contributing find out more here: https://www.techradar.com/pro/perspectives-how-to-submit
CTO at MinIO.
You must confirm your public display name before commenting
Please logout and then login again, you will then be prompted to enter your display name.