Artificial Intelligence is often positioned as the next phase of hyperscale computing—but most Enterprises operate under a very different set of constraints. Unlike cloud native companies, Enterprises face a complex intersection of legacy systems, economic pressures, and real-world operational constraints—factors often invisible to engineers building the underlying technologies. This session focuses on the realities of deploying AI at scale across the hybrid cloud with environments that do not resemble hyperscale architecture.
In this session we will examine how three forces are reshaping Enterprise architecture. First, the economic model: inference, retrieval, and orchestration create ongoing non-linear costs making efficiency—not scale—the primary design principle. Second, the technical reality: retrieval-augmented generation (RAG) requires new compute with latency and security constraints, making hybrid cloud more essential to balance the tradeoffs of performance, cost, and control. Third, the operational considerations: legacy systems, regulatory requirements, and the inability to trust AI with sensitive data fundamentally shape what can be deployed into production.
This session will focus on lessons learned from managing AI workloads constrained by database bottlenecks, cloud storage capacity shortages, storage networking congestion, inconsistent supply of hardware components, and the operational challenges of model drift, quantization choices, and cloud-specific performance variability. It reveals why Enterprise buyers prioritize reliability, predictable supply, and cost efficiency over cutting edge performance—and how misaligned industry innovation can unintentionally disadvantage non-hyperscale customers.
Ultimately, this keynote aims to bridge the gap between AI system and component builders and AI operators—illustrating the practical consequences of design decisions, the constraints faced by real customers, and the opportunities for the industry to build more durable, accessible, and economically sustainable AI infrastructure.