• Public Area
  • Members Area
  • Workspace
SNIA
Data Management Forum
Members Login
  • Forums
  • SNIA Home
  • Forums & Initiatives
  • Data Management Forum
  • Programs
    • Data Protection Initiative
      • Continuous Data Protection
      • Data Deduplication and Space Reduction SIG
      • Virtual Tape Libraries
    • Information Lifecycle Management Initiative
      • ILM Maturity Model
      • ILM Vision Documents
    • Long-Term Archive and Compliance Storage Initiative
      • Database Information Management SIG
      • 100 Year Archive Task Force
      • Sub-committees
      • LTACSI Mission
      • Problem Statement and Resources
  • Knowledge Center
    • Terminology Bridge
    • Tutorials
    • "Snap In" Education
    • Webcasts
      • Advanced Deduplication Practices
      • Database Archiving Fundamentals
      • Deduplication in Backup World
      • ILM Maturity Model
      • Identifying Ideal Apps for DB Archiving
    • White Papers and Reports
    • Request a Speaker
      • Data Protection Topics
      • ILM Topics
      • Long-term Digital Information Retention & Preservation T...
      • Request a DMF Speaker - Form
  • News and Events
    • DMF eBulletins
    • Press Releases
    • Events
    • Media Coverage
  • About
    • Become a DMF Member
      • Participation Options
      • DMF BOD Responsibilities
      • Program Leadership Responsibilities
    • DMF Leadership
    • DMF Alliances
    • Contact the DMF
    • Current DMF Members
  • Ask the Experts

Become a Member!

  • Join Now!
  • Member Directory
Forums & Initiatives » Data Management Forum » Long-Term Archive and Compliance Storage Initiative » Problem Statement and Resources

Problem Statement and Resources

Defining Storage Practices and Standards for Long-Term Digital Information Retention

Long Term Retention Drivers

Although corporate and legal issues have recently brought data archiving to the light of day, the problems associated with preserving digital information are not new. Archiving for a few years is hard enough, but when requirements dictate that data be retained for longer, problems with media deterioration and technology obsolescence can seem insurmountable.
- Source: Galen Schreck - Forrester Aug. 2005

Long term preservation of digital content is a big challenge in the Information Society era, digital information in any form is at risk to be lost forever. Technology on which digital content relies becomes obsolete and application versions and files formats change, making data soon inaccessible. Even if content is coded in the simplest format, such as ASCII code, storage media degradation and obsolescence could make it disappear. Even on-line information such as web pages and databases, are vulnerable as much as their web structure become complex thanks to (aging) hyperlinks and cross references.
- Source: Alfredo M. Ronchi - Medici Framework

The Digital Crisis

  • Risk of losing digital information over time
  • Growing cost and complexity of physical and logical migration
  • Overwhelming volume of digital information to preserve long-term
  • Increased legal, business, and security risk

What is the state of best practices today?

Standards for Electronic Archives: Long-term electronic archive designs should consider incorporation of national or international specifications such MoReq or Open Archival Information System (OAIS). Standards such as ISO 154895 establish guidelines for records management policies and systems but fall short of specifying functional details of automated systems.

How do you cost-effectively manage and automate the preservation of petabytes of information forever, growing at 50% to 100% per year in the data center? How do you migrate multiple PBs per year logically and physically? The answer is you can’t. The IT archive process and system is technologically broken. It doesn’t scale.  - Michael Peterson, Chief Strategy Advocate, SNIA-DMF

Migration is Required: NARA says "If information is on disk drives, migrate it every 3 years and if on tape, every 5 years."  But, what about the ability to read and interpret the information long term?

Physically, long-term storage is not about media life (because migration is required) but about periodic migration to newer media. But, in a large repository, how do you migrate multiple pedabytes every year and at what cost?  At some point migration becomes overwhelming and the digital retention process is broken - it does not scale.

Logically, long-term retention is about the ability to read the information and to be able to use it. Applications have a relatively short life and rarely have the ability to read information older than a few revisions. Even standard formats evolve, change, and become obsolete. At some point, information has to be migrated periodically to a new standard logical format.

Then what about issues such as compliance, integrity, authenticity, privacy, and discovery? How are you guaranteeing these over the retention period?

The 100 Year Archive Task Force believes that it has identified solutions to these two "grand challenges" of long-term retention. Join us and help progress the work.

Standards and Resources:  

Long-term electronic archive designs should consider incorporation of national or international specifications such MoReq or Open Archival Information System (OAIS). Standards such as ISO 154895 establish guidelines for records management policies and systems but generally fall short of specifying functional details of automated systems. However, DoD 5015.2-STD and MoReq contain useful information defining functional requirements for electronic record archives. Both of these also define selected metadata elements required for an electronic records archive. Either document would be appropriate as a starting point for acquisition or construction of an electronic archive system. Finally, both ARMA International and the National Archives Records Administration (NARA) provide planning and guideline documents at their respective web sites

  • DOCUMENTS: 
    Model Requirements for the Management of Electronic Records (MoReq) http://europa.eu.int/idabc/en/document/2631  
    International Research on Permanent Authentic Records in Electronic Systems (InterPARES) http://www.interpares.org/
    ERM Guidance on Methodology for Determining Agency-unique Requirements  http://www.archives.gov/records-mgmt/policy/requirements-guidance.html
    OAIS Reference Model http://ssdoo.gsfc.nasa.gov/nost/isoas/overview.html
     
  • ASSOCIATIONS and ORGANIZATIONS; 
    ARMA International  http://www.arma.org
    The US National Archives’ Electronic Records Archives (ERA) http://www.archives.gov/era
    Digital Preservation Coalition  http://www.dpconline.org/graphics/index.html
    DLM Forum: http://dlmforum.typepad.com/
    ISO Archiving Standards (0AIS) http://ssdoo.gsfc.nasa.gov/nost/isoas/
    US Library of Congress Digital Preservation Program http://www.digitalpreservation.gov/
    The Dublin Core Metadata Initiative http://www.dublincore.org

  • EVENTS:
     IEEE Mass Storage Systems and Technology Conference MSST2007
     Enterprise Information World 2007
    Storage Networking World 
  • DMF Community

Search

  • View DMF eBulletins
  • Data DeDuplication & Space Reduction SIG
  • Get Involved with
    the DMF
  • Hot Topics

What's New

  • Step by Step Guide to using the ILM Maturity Model
    Download whitepaper here...

eBulletin

Sign-up to receive newsletter. Enter email address:

KAVI® where .orgs work
Home | Contact Us | Website Feedback | Site Map | Privacy Policy
Copyright © 2010 SNIA