SNIA Developer Conference September 15-17, 2025 | Santa Clara, CA
Stevens Creek
Tue Sep 17 | 3:05pm
Discover the latest advancements in Linux file access, and explore recent enhancements to the Linux SMB3.1.1 client. It has been another great year of progress in improving access to remote storage from Linux whether in the cloud and or the wide variety of SMB3 file servers (Azure, Windows, Samba, ksmbd, Netapp, Macs and many others). Many improvements have been made to securely, reliably and efficiently access remote data. The Linux SMB3.1.1 client, cifs.ko, continues to be one of the most active filesystems in Linux. This presentation will cover new security, performance and reliability features recently added to the Linux client, and also new features you can expect to see over the coming year. Whether accessing data from the smallest devices or the largest (and even the cloud) these improvements are very exciting. Over the past year, new security features have been added, performance has improved, as has support for folios and the new Linux memory management model. Directory caching is now faster and more flexible, more POSIX/Linux compatibility features have been added, support for swapfiles over SMB3.1.1 mounts has improved, as has support for special Linux file types. As we add support for new authentication options for Linux (not just Kerberos and NTLMSSP), and also as we add support for SMB3.1.1 over QUIC (using the exciting new Linux kernel QUIC driver), and stronger and faster packet signing, SMB3.1.1 will be able to better address evolving security requirements, not just on-prem but also in the cloud. The protocol already supports extremely strong encryption options, and also man in the middle attack prevention. This presentation will describe many of these SMB3.1.1 features and improvements, and how to best use them to improve your workloads.
Describe new security and performance and reliability features that have been added to the Linux client. What new features for accessing file storage remotely from Linux are now available? How has the SMB3.1.1 client improved?
Describe new features (e.g. new syscalls or file system API enhancements) that have been added to the Linux File System layer that we need to be aware of? And add support for ...?
Describe what new enhancements are expected to be added to the Linux SMB3.1.1 client over the next year and why.
Describe ways that are being considered to enhance the protocol to make SMB3.1.1 even better from Linux, and what could be added to the protocol to make Linux user's experience even better.
Describe some helpful mount options, tools and configuration parameters that make it easier to access remote files better from Linux.
Mounting S3-compatible storage via S3FS seems like an easy way to enable POSIX-like access in Kubernetes. But in real AI/ML workloads—e.g., training with PyTorch or TensorFlow—we hit major issues: crashes from incomplete writes, vanished checkpoints, inconsistent metadata, and unpredictable I/O latency.
In this session, we’ll share how we overcame these challenges by designing a scalable, POSIX-compliant distributed file system that still leverages the cost-effectiveness of object storage. Instead of abandoning object storage, we rebuilt the access layer for better consistency, performance, and observability in large-scale environments.
Attendees will gain insight into architectural trade-offs, POSIX compliance in user space, Kubernetes integration via CSI and Operators, and observability benchmarks collected from real production AI training clusters.
Ideal for platform engineers, MLOps, and K8s architects seeking reliable, scalable storage for data-heavy workloads.
This is an intermediate session; attendees should be comfortable with object storage, file storage, and the basic concepts of the Kubernetes CSI driver.
Rubrik is a cybersecurity company protecting mission critical data for thousands of customers across the globe including banks, hospitals, and government agencies. SDFS is the filesystem that powers the data path and makes this possible. In this talk, we will discuss challenges in building a masterless distributed filesystem with support for data resilience, strong data integrity, and high performance which can run across a wide spectrum of hardware configurations including cloud platforms. We will discuss the high level architecture of our FUSE based filesystem, how we leverage erasure coding for maintaining data resilience and checksum schemes for maintaining strong data integrity with high performance. We will also cover the challenges in continuously monitoring and maintaining the health of the filesystem in terms of data resilience, data integrity and load balance. Further we will go over how we expand and shrink resources online from the filesystem. We will also discuss the need and challenge of providing priority natively in our filesystem to support a variety of workloads and background operations with varying SLA requirements. Finally, we will also touch on the benefits and challenges of supporting encryption, compression, and de-duplication natively in the filesystem.
GoogleFS introduced the architectural separation of metadata and data, but its reliance on a single active master imposed fundamental limitations on scalability, redundancy, and availability. This talk presents a modern metadata architecture, exemplified by SaunaFS, that eliminates the single-leader model by distributing metadata across multiple concurrent, multi-threaded servers. Metadata is stored in a sharded, ACID-compliant transactional database (e.g., FoundationDB), enabling horizontal scalability, fault tolerance through redundant metadata replicas, reduced memory footprint, and consistent performance under load. The result is a distributed file system architecture capable of exabyte-scale operation in a single namespace while preserving POSIX semantics and supporting workloads with billions of small files.
Enterprise IT infrastructures face soaring AI and analytics demands, driving the need for storage that leverages existing networks, cuts power-hungry server counts, and frees CAPEX for AI. Yet current solutions create isolated silos: proprietary, server-based systems that waste power, lack cloud connectivity, and force large teams to manage multiple silo technologies—locking data behind vendor walls and hampering AI goals. Modeled on the Open Compute Project, the Open Flash Platform (OFP) liberates high-capacity flash through an open architecture built on standard pNFS which is included in every Linux distribution. Each OFP unit contains a DPU-based Linux instance and network port, so it connects directly as a peer—no additional servers. By removing surplus hardware and proprietary software, OFP lets enterprises use dense flash efficiently, halving TCO and increasing storage density 10×. Early configurations deliver up to 48 PB in 2U and scale to 1 EB per rack, yielding a 10× reduction in rack space, power, and OPEX and a 33 % longer service life. This session explains the vision and engineering that make OFP possible, showing how open, standards-based architecture can simplify, scale, and free enterprise data.
The performance of network file protocols is a critical factor in the efficiency of the AI and Machine Learning pipeline. This presentation provides a detailed comparative analysis of the two leading protocols, Server Message Block (SMB) and Network File System (NFS), specifically for demanding AI workloads. We evaluate the advanced capabilities of both protocols, comparing SMB3 with SMB Direct and Multichannel against NFS with RDMA and multistream TCP configurations. The industry-standard MLPerf Storage benchmark is used to simulate realistic AI data access patterns, providing a robust foundation for our comparison. The core of this research focuses on quantifying the performance differences and identifying the operational and configuration overhead associated with each technology.
The Samba file server is evolving beyond traditional TCP-based transport. This talk introduces the latest advancements in Samba's networking stack, including full support for SMB over QUIC, offering secure, firewall-friendly file sharing using modern internet protocols. We’ll also explore the ongoing development of SMB over SMB-Direct (RDMA), aimed at delivering low-latency, high-throughput file access for data center and high-performance environments. Join us for a deep dive into these transport innovations, their architecture, current status, and what's next for Samba’s high-performance networking roadmap.