Massively Scalable File Storage (Fall 2013) | SNIA

Abstract

Internet changed the world and continues to revolutionize how people are connected, exchange data and do business. This radical change is one of the cause of the rapid explosion of data volume that required a new data storage approach and design. One of the common element is that unstructured data rules the IT world. How famous Internet services we all use everyday can support and scale with thousands of new users added daily and continue to deliver an enterprise-class SLA ? What are various technologies behind a Cloud Storage service to support hundreds of millions users ? This tutorial covers technologies introduced by famous papers about Google File System and BigTable, Amazon Dynamo or Apache Hadoop. In addition, Parallel, Scale-out, Distributed and P2P approaches with Lustre, PVFS and pNFS with several proprietary ones are presented as well. This tutorial adds also some key features essential at large scale to help understand and differentiate industry vendors offering.

Learning Objectives

Understand how enterprise databases can store data on file-based storage, using file-access protocols (NFS and SMB2/3).
Understand the advantages of using file-based storage, including easier and more rapid storage provision, performed by IT pros or DBAs rather than SAN specialists.
Understand the new features of SMB 3.0 that are specifically targeted at database reliability, availability, and performance.