27
Gabe Nault LDS Church [email protected] Future Perfect Conference 2012 Ensuring Data Integrity in a Digital Preservation Archive image courtesy of IBM

Gabe Nault Data Integrity

Embed Size (px)

DESCRIPTION

Ensuring Data Integrity in a Digital Preservation Archive Gabe Nault

Citation preview

Page 1: Gabe Nault Data Integrity

Gabe NaultLDS Church

[email protected]

Future Perfect Conference 2012

Ensuring Data Integrity in a Digital Preservation Archive

image courtesy of IBM

Page 2: Gabe Nault Data Integrity

Introducing the LDS Church• The Church of Jesus Christ of Latter-day

Saints

• Global Christian church with 14 million members

• 3 universities, 1 college

• State-of-the-art audio- visual capabilities

• Scriptural mandate to keep and preserve records since since 1830

photo by Henok Montoya

Page 3: Gabe Nault Data Integrity

The Church History Department

Church History Library on Temple Square

• Preserves records of enduring value from Church leaders, departments, universities, and affiliations (more than 35 organizations)

• Typically, less than 10% of records are

candidates for preservation

Page 4: Gabe Nault Data Integrity

Granite Mountain Records Vault• Bored into a solid

granite mountain

• Stores large microfilm collections and valuable Church artifacts

• Plans recently developed to renovate the facility for digital preservation

Page 5: Gabe Nault Data Integrity

• Audiovisual records will consume majority of our archive capacity

• 100+ PB in a decade for a single copy!

The Media Services Department

Conference Center on Temple Square

Mormon Tabernacle Choir and Orchestra

Free Bible videos from biblevideos.lds.org

Page 6: Gabe Nault Data Integrity

DRPS System Architecture

DigitalRecords Preservation System

DRPS Ingest ToolsFixity

Creation

Storage ExtensionsFixity

Bridge

StorageGRID Information

LifecycleManagement

PreservationFunctions

Tape Interface Tivoli Storage Manager

IBM

Page 7: Gabe Nault Data Integrity

DRPS Infrastructure

Page 8: Gabe Nault Data Integrity

DRPS Highlights• Multiple copies in multiple

geographic locations (eventually)• Approximately 1 PB spinning

media• Automatic replication to remote

site(s)• End to end data integrity• Tape base permanent storage

Page 9: Gabe Nault Data Integrity

Why Tape for Preservation?

• TCO - Over ten years, ownership and operating costs of tape are three to fifteen times less than associated costs for disk arrays

• Cost advantages of tape are expected to increase over time

• Conclusion—for now, tape is required to sustain a multi-PB digital archive

• But . . . tape presents some challenges

IBM TS3500Tape Libraries

image courtesy of IBM

Total cost of storage ownership study

Page 10: Gabe Nault Data Integrity

Why Tape for Preservation?

• Latency

• Limited to sequential access

• Limited number of read/writes

• Leads to greater system and operational complexity

IBM TS3500Tape Libraries

image courtesy of IBM

Limitations

Page 11: Gabe Nault Data Integrity

Data Integrity• Data integrity validation is provided by fixity

checks when data is written, transferred, moved, or copied

• Fixity checking should be performed from file creation to permanent storage to delivery

• Periodic validation of the entire archive should also be performed to detect data corruption(bit rot, drive errors, tape degradation, etc)

• DRPS uses a variety of integrity values for fixity

Page 12: Gabe Nault Data Integrity

DRPS Fixity Data Flow

Page 13: Gabe Nault Data Integrity

DRPS Data Integrity Validation

StorageGRID SHA-1 created for ingested files

Storage ExtensionsWeb service retrieves StorageGRID SHA-1, then Rosetta plug-in compares with Rosetta SHA-1

DRPS Ingest Tools SHA-1 created for producer files

SHA-1control

SHA-1control

SHA-1 checked upon ingest and write to permanent storage

SHA-1control

Page 14: Gabe Nault Data Integrity

StorageGRID Fixity Checking

• StorageGRID is constructed around the concept of object storage

• Provides a layered/overlapping set of protection domains to guard against object data corruption

1. SHA-1 object hash—checked on store and access

2. Content hash—checked on access

3. CRC checksum—checked with every operation

4. Key-based hash value—checked on access

StorageGRID

Page 15: Gabe Nault Data Integrity

DRPS Data Integrity Validation

SHA-1 and other fixity checked during write to storage nodesCRCs,

ECCs TSM end-to-end logical block protection

Tivoli Storage ManagerIBM

DRPS Ingest Tools SHA-1 created for producer files

StorageGRID SHA-1 created for ingested files

Storage ExtensionsWeb service retrieves StorageGRID SHA-1, then Rosetta plug-in compares with Rosetta SHA-1

SHA-1control

SHA-1control

SHA-1 checked upon ingest and write to permanent storage

SHA-1control

Page 16: Gabe Nault Data Integrity

• Supersedes SHA-1 fixity information with cyclic redundancy check values (CRCs) and error-correcting codes (ECCs)

• Enabled with new, state-of-the-art functionality of IBM LTO-5 and TS1140 tape drives

• Seamlessly extends validation of data integrity as data is written to tape

TSM End-to-End Logical Block Protection

Page 17: Gabe Nault Data Integrity

TSM End-to-End Logical Block Protection1. TSM server calculates and

appends “original data CRC” to logical data block

2. Tape drive computes its own CRC and compares to original data CRC

Page 18: Gabe Nault Data Integrity

TSM End-to-End Logical Block Protection3. As logical block is loaded

into drive data buffer, on-the-fly verifier checks original data CRC

4. In parallel, a “C1 code” (ECC) is computed and appended

Page 19: Gabe Nault Data Integrity

TSM End-to-End Logical Block Protection5. An additional ECC,

referred to as “C2 code,” is added to the logical block

6. More powerful than the original data CRC, the C1 code is checked every

time data is read from the buffer

Page 20: Gabe Nault Data Integrity

TSM End-to-End Logical Block Protection7. Data written to tape at

full line speed with read-while-write process

8. Just written data loaded to buffer and C1 code checked

Successful read-while-write operation assures no data corruption from TSM server to tape

Page 21: Gabe Nault Data Integrity

TSM End-to-End Logical Block Protection

11. TSM server verifies original data CRC, completing TSM end-to-end logical block protection cycle

9. When tape is read, all codes (C1, C2, original data CRC) are checked by drive

10. Original data CRC appended to logical block

Page 22: Gabe Nault Data Integrity

Ongoing Archive Data Integrity

• We must assume that data may become corrupted after being written correctly to tape

• Therefore, tapes must be read periodically to identify and correct data errors

image courtesy of IBM

Page 23: Gabe Nault Data Integrity

Ongoing Archive Data Integrity• Staging IEs to disk to verify

integrity is resource intensive!

• IBM LTO-5 and TS1140 tape drives provide a more efficient solution

• During “SCSI Verify” operation, a tape is mounted, drive checks all codes (C1, C2, original data CRC) as data is being read (at full line speed)

• Only status is reported as these internal checks are completed

image courtesy of IBM

Page 24: Gabe Nault Data Integrity

Summary

DRPS Ingest ToolsSHA-1control

StorageGRID

Storage Extensions

SHA-1control

SHA-1control

• Fixity information is the key to data integrity

• SHA-1 values ensure data integrity to StorageGRID

• TSM end-to-end logical block protection ensures data integrity to tape

• In-drive validation enables ongoing integrity checks for the entire archive

CRCs,ECCs Tivoli Storage Manager

IBM

Page 25: Gabe Nault Data Integrity

Thank you!

Questions?

Page 26: Gabe Nault Data Integrity

Trademarks

The Ex Libris logo and Rosetta are trademarks of Ex Libris Group.

The IBM logo and Tivoli Storage Manager are trademarks of International Business Machines Corporation.

The NetApp logo and StorageGRID are trademarks of NetApp, Inc.

Page 27: Gabe Nault Data Integrity

Rate of Bit Errors• Preliminary validation of DRPS archive

resulted in a 3.3x10-14 bit error rate

• USC Shoah Foundation Institute visit

• 8 PB tape archive of videotaped interviews of Holocaust survivors and other witnesses

• Experienced 1500 bit flips in 8 PB (2.3x10-14 bit error rate)` 100110111100001011

0