Sorry, you need to enable JavaScript to visit this website.

CXL Memory in Windows

Submitted by Anonymous (not verified) on
In this presentation, we will present the architecture of CXL memory in Windows and describe the support that will be available. We will describe the possible usages of CXL memory, the RAS workflows and the developer interfaces available to use CXL memory.

Storage Blending: The Evolving Role of HDD and SSD in Data Systems for an AI and Analytics Era

Submitted by Anonymous (not verified) on

As the rapid expansion of AI and analytics continues, storage system architecture and total cost of ownership (TCO) are undergoing significant transformation. Emerging technologies such as HAMR in rotating storage and high-capacity, data center-grade QLC in flash promise to redefine the landscape for both hyperscale and OEM data storage solutions. But what will that evolution look like?

Accelerating Object Storage for AI/ML with S3 RDMA

Submitted by Anonymous (not verified) on
Amazon S3 is the de facto standard for object storage—simple, scalable, and accessible via HTTP. However, traditional S3 access via TCP/IP is CPU-intensive and not designed for the low-latency, high-throughput needs of modern GPU workloads. S3 RDMA aims to bridge that gap. S3 RDMA implements S3 object PUT/GET data transfers over RDMA, essentially bypassing the HTTP stack entirely.

Beyond Throughput: Benchmarking Storage for the Complex I/O Patterns of AI with MLPerf Storage and DLIO

Submitted by Anonymous (not verified) on
Training state-of-the-art AI models, including LLMs, creates unprecedented demands on storage systems that go far beyond simple throughput. The I/O patterns in these workloads—characterized by heavy metadata operations, multi-threaded asynchronous I/O, random access, and complex data formats—present a significant bottleneck that traditional benchmarks fail to capture.

Global Distributed Client-side Caching for HPC/AI Storage Systems

Submitted by Anonymous (not verified) on

HPC and AI workloads require processing massive datasets and executing complex computations at exascale speeds to deliver time-critical insights. In distributed environments where storage systems coordinate and share results, communication overhead can become a critical bottleneck. This challenge underscores the need for storage solutions that deliver scalable, parallel access with microsecond latencies from compute clusters. Caching can help reduce communication costs when implemented on either servers or clients.

Towards Unified Knowledge Platforms: Evolving Storage Systems for Generative and Agentic AI

Submitted by Anonymous (not verified) on

The rise of Generative and Agentic AI has driven a fundamental shift in storage —from storing data to functioning as comprehensive knowledge management systems. Traditional model of storing data and system metadata and providing analytical capabilities on top of it is now inadequate. Agentic AI workflows require access to semantically enriched representations of data, including embeddings and derived metadata (e.g., classification, categorization). As data is ingested, storage systems must support real-time or near-real-time generation and association of such metadata.

KV-Cache Storage Offloading for Efficient Inference in LLMs

Submitted by Anonymous (not verified) on
As llm serve more users and generate longer outputs, the growing memory demands of the Key-Value (KV) cache quickly exceed GPU capacity, creating a major bottleneck for large scale inference systems. In this talk, we discuss KV-cache storage offloading, a novel technique that enables inference acceleration by relocating attention cache data to high speed, low latency storage tiers.

HDD Innovation for Hyperscale: CDLs, SMR, Depop and SCSI Advancements in Linux

Submitted by Anonymous (not verified) on
Hyperscale storage demands are pushing HDD technologies to new levels of sophistication and industry collaboration. This technical session brings together Damien Le Moal (Western Digital) and Rick Kutcipal (Broadcom, STA Board) to deliver a joint update on emerging and maturing HDD features designed to meet hyperscaler requirements. Topics will include the latest developments in Command Duration Limits (CDLs), drive depopulation (depop), and Shingled Magnetic Recording (SMR), as well as the state of SCSI protocol support in Linux.
Subscribe to