41
Copyright by Sara Natalie Fuchs 2005

Copyright by Sara Natalie Fuchs 2005sfuchs/sfuchs_mastersreport.pdf · Sara Natalie Fuchs Master’s Report Presented to the Faculty of the Graduate School of The University of Texas

  • Upload
    ngodieu

  • View
    214

  • Download
    1

Embed Size (px)

Citation preview

Copyright

by

Sara Natalie Fuchs

2005

The Preservation of Digital

Video: A Case Study

by

Sara Natalie Fuchs

Master’s Report

Presented to the Faculty of the Graduate School of

The University of Texas at Austin

in Partial Fulfillment

of the Requirements

for the Degree of

Master of Science in Information Studies

The University of Texas at Austin

May 2005

The Preservation of Digital

Video: A Case Study

Approved by Supervising Committee:

iv

Abstract

The Preservation of Digital Video:

A Case Study

Sara Natalie Fuchs, MSIS

The University of Texas at Austin, 2005

Supervisor: Patricia Galloway

The preservation of content is not a new problem. Archivists have been struggling

with this problem for many years, struggling to maintain access to formats that are

volatile and facing obsolescence. However, with the sheer amount of content that is

created digitally, new problems are arising, and at a much faster rate. One example can be

seen in the world of moving images, where more and more content is created digitally.

By examining a case study of a documentary film currently in production, the problems

inherent in preserving digital moving images are examined, and solutions are proposed.

v

Table of Contents

ANALOG VS. DIGITAL 1

The History of Traditional Film Preservation..........................................................1

The Rise of Digital Video ........................................................................................3

Digital File Preservation ..........................................................................................4 The Open Archival Information System.........................................................5

Initiatives in Preserving Digital Video ....................................................................6 The Universal Preservation Format ................................................................6 Preserving Digital Public Television Project..................................................8

Metadata Required for Digital Film Preservation..................................................10 Overview.......................................................................................................10 Dublin Core...................................................................................................11 Metadata Encoding and Transmission Standard...........................................12 Synchronized Multimedia Integration Language..........................................12 MPEG-7 & MPEG-21...................................................................................13

Identification of File Formats ................................................................................13 Digital Formats Web Site..............................................................................14 Digital Video File Formats ...........................................................................15

vi

CASE STUDY: A TEXAS CORRECTION 17

About the Project ...................................................................................................17

Formats Involved in Production of Film................................................................19

Why Were These Formats Chosen?.......................................................................22

Metadata Generation..............................................................................................22

Preservation of Formats .........................................................................................25

Future Online Access.............................................................................................27 The Moving Image Archive..........................................................................27 The Open Video Project................................................................................29

CONCLUSION 30

References..............................................................................................................32

Vita .......................................................................................................................35

1

ANALOG VS. DIGITAL

The preservation of content is not a new problem. Archivists have been struggling

with this problem for many years, struggling to maintain access to formats that are

volatile and facing obsolescence. However, with the sheer amount of content that is

created digitally, new problems are arising, and at a much faster rate. One example can be

seen in the world of moving images. For years, films have been created with traditional

film stock, a volatile format, certainly, but one that is still being used widely, and has a

great deal of support. Recently, though, more and more moving image content is created

digitally, which means that archivists are no longer dealing with an artifact, but are now

in charge of “stewardship of disembodied digital information” (Besser, 2001). With

digital video files, there are many unknowns. How long will this format last? What is the

best way to preserve it? How do you preserve something that you can’t even see? While

traditional film preservation is about conservation (preserving the film stock), digital

video preservation is much more akin to asset management—preserving not only the

content, but also the information related to the content (the metadata). There are

standards and best practices associated with film preservation, but digital video is still a

relatively new format. Due to its ease of use and low cost, digital video production has

grown in popularity, but with this ease of use come new challenges and obstacles,

namely, how to access this content in the future.

The History of Traditional Film Preservation

The preservation of moving images is a national topic, and one that has gained

momentum over the past ten years. According to the National Film Preservation

2

Foundation (NFPF), a nonprofit corporation created by the United States Congress and

committed to preserving our nation’s film heritage, less than 20% of feature films in this

country survive in their complete form in archives, and only half of American features

produced before 1950 still survive (National Film Preservation Foundation web site). In

August of 1994, the Library of Congress completed a report entitled “Redefining Film

Preservation: A National Plan,” with the goal of improving the state of American film

preservation, as “motion pictures of all types are deteriorating faster than archives can

preserve them” (Redefining Film Preservation, 1994) through nitrate deterioration, color

fading, and vinegar syndrome. The plan recommends storage procedures (low

temperature/low humidity), how to provide access to these films, how to develop

partnerships with private organizations, and how to secure funds for film preservation.

Film preservation as it has been traditionally defined emphasizes conservation

(protecting the original artifact and slowing decay), duplication (making surrogate

copies), restoration (enhancing the film), and access (sharing the film with the public).

Motion picture film is a known format, with a long history; it is over a century old and is

still being used. Because of this, there are industry standards for motion picture film,

namely, 35mm, 16mm, and 8mm. Originally, film was made of nitrocellulose (nitrate

film), a highly flammable material. A more stable format, acetate-based film, was

introduced to the professional community in 1954 (for 35mm) (The Film Preservation

Guide, 2004). Film made in the last thirty years can be expected to last at least one

hundred years (Wright, 2004). When a film print is in danger of decaying, the solution is

simple: make another film print. Although this is a costly process — it costs about

$32,000 to make a new master and viewing print of a seven-reel, black and white silent

feature, and even more to make a preservation copy of a sound feature (National Film

Preservation Foundation web site) — it is the only accepted method of preservation.

3

There is a great deal of training and support for those entering the field of film

archiving. Programs are in place to train aspiring moving image archivists, such as the

George Eastman House School of Film Preservation (established in 1996), the Moving

Image Archive Studies Program at UCLA (established in 2001), and the Moving Image

Archiving and Preservation program at NYU’s Tisch School of the Arts (established in

2002) (de Stefano, 2003). With more trained moving image archivists entering the

workforce, it is possible that more standard practices will be incorporated into the field of

moving image archives, both traditionally and digitally.

The Rise of Digital Video

Due to its ease of use and low cost, digital video (DV) is increasingly being used

by low-budget moviemakers. However, the trend is beginning to catch on in Hollywood,

as well. Eighty-five percent of the shots in The Matrix were digital, and almost all of Star

Wars: Episode 1—The Phantom Menace was shot digitally; both films were released in

1999. The third segment of the second Star Wars trilogy—The Revenge of the Sith—was

shot entirely on a digital format (Chute, 2000); this film will be released on May 19,

2005. Robert Rodriquez, an Austin-based filmmaker, has often extolled the virtues of

shooting digitally, and bypassing celluloid altogether. His films Once Upon a Time in

Mexico and Spy Kids 2: The Island of Lost Dreams were both shot using High Definition

24p digital video (Savlov, 2002). To keep pace with this trend, more and more theaters

are being outfitted to project films digitally. As technology improves and venues develop

to project films created digitally, traditional motion picture film may be a thing of the

past (Besser, 2001).

4

The Digital Cinema Initiatives, LLC is a joint venture between Disney, Fox,

MGM, Paramount, Sony Pictures Entertainment, Universal, and Warner Bros. Studios to

develop an open architecture for digital cinema systems in movie theaters. In March,

2005, the DCI released Version 5.1 of the voluntary DCI Technical Specification,

available to members of the DCI. The goal of these specifications is to ensure a “uniform

and high level of technical performance, reliability and quality control” in projecting

digital films (DCI website). This initiative utilizes the Digital Cinema Initiative

Distribution Master (DCDM) format, that bundles video, audio, timed text, subpictures,

metadata, and other elements, for the specific playback of motion pictures in movie

theaters (Digital Formats website).

It’s interesting to note that creating standards in digital projection quality is being

undertaken now, coinciding with the rise of big-budget “Hollywood” pictures that are

made in digital form. It can only be anticipated that preserving digitally-made films will

be the next logical step. For now, though, those charged with maintaining our cultural

heritage—the libraries and archives—are forced to confront this issue.

Digital File Preservation

The problems inherent in preserving films shot digitally are no different than the

problems faced with preserving any sort of digital content. While there is a growing

concern over this issue, there is an appalling lack of standards—the wide range of

formats, most of which are proprietary, means that a different standard would have to

exist for each format. And there is no guarantee that proprietary formats will be

supported into the future. The often proposed solutions of migration (moving a file to a

newer format) and emulation (recreating the original environment of the file) “may prove

5

effective for some types of media, but they are inadequate for handling the intricacies,

interdependencies, and sheer volume” of digital video productions. The ideal solution is

bundling, which is “the process of bonding metadata with content within the same file

format” (Ide, McCarn, Shepard, & Weiss, 2002).

THE OPEN ARCHIVAL INFORMATION SYSTEM

The Open Archival Information System (OAIS), developed by the Consultative

Committee of Space Data Systems (CCSDS), was developed in order to provide a

common terminology and framework for preserving digital objects. Although it does not

specify an implementation, it is designed to be applicable to any archive charged with

preserving digital objects for the long term. The OAIS environment is defined as

consisting of Producers, Management, and Consumers. Producers provide the

information to be preserved, Management is responsible for managing the OAIS archive,

and Consumers are the users who will use the OAIS archive in order to find information.

In order for material to be preserved, it must first be defined. OAIS defines each

submission of information to the archive in terms of an Information Package, consisting

of Content Information (the essence) and the Preservation Description Information

(metadata about the submission). Different Information Packages include the Submission

Information Package (SIP, that sent to the OAIS by the Producer), the Archival

Information Package (AIP, that preserved by the Management), and the Dissemination

Information Package (DIP, that sent to the Consumer upon request). Defining the OAIS

in such specific terms ensures that issues that need to be dealt with in long-term

preservation, such as copyright, access, description, and preservation issues, etc. are

considered at every phase of the ingest, storage, and dissemination process.

The OAIS framework is useful for the preservation of digital moving images,

because it provides not only a common terminology, but permits a valid way of

6

combining, or bundling, all the material that is necessary to preserve the digital object

(namely, the content, and the associated metadata). In 1993, the Committee for Film

Preservation and Public Access presented a statement entitled “Preservation Without

Access Is Pointless,” in which it was argued that, while a national policy to preserve our

motion picture heritage was important, it is equally important to provide access to that

which is preserved. The OAIS framework, by specifying the Dissemination Information

Package, ensures that not only will digital moving images be preserved, but that a way to

share these preserved packages is presented.

Initiatives in Preserving Digital Video

While the OAIS framework provides guidance on creating an information

repository for digital preservation of any digital files, projects specifically related to

digital video preservation have been few and far between. Two such projects are the

Universal Preservation Format project, which, similar to OAIS, provides a framework for

how to package digital video files to ensure future access. The Preserving Digital

Television Project takes this one step further, by developing a digital repository

implementation for digital content produced by public television stations.

THE UNIVERSAL PRESERVATION FORMAT

The idea of a universal preservation format to preserve a wide variety of digital

formats was introduced at the Society of Motion Picture and Television Engineers

Conference in October of 1996. The initiative was later sponsored by the WGBH

Educational Foundation (Shepard & MacCarn, 1998). As a public broadcasting station

and producer of programming for public television, WGBH had firsthand knowledge of

the need for a way to preserve the myriad formats of material being created, including

7

“audio, text, database files, captioning, descriptive video information and the whole

gamut of original digital content produced for the World Wide Web” (Shepard &

MacCarn, 1999). Although most of their older formats had been digitized, addressing the

problems of technological obsolescence with their analog material, WGBH was still

faced with the problem of preserving these newer digital forms, often in various digital

tape formats.

In 1997, WGBH was awarded a grant from the National Historical Publications

and Records Commission of the National Archives to develop a Recommended Practice

for a Universal Preservation Format (UPF). The mission of this project was to thoroughly

analyze the issue of preserving digital video records, to raise awareness, to build a

coalition for a solution to the problem, to develop a Recommended Practice, and to

present the solution to parties in both the profit and non-profit sectors, through

conferences and publishing journal articles. The idea was to develop an understanding

between those who develop the technology (the engineers) and those who are entrusted to

preserve it (the archivists) (Shepard & MacCarn, 1998).

In 1999, WGBH released a publication entitled “The Universal Preservation

Format: A Recommended Practice for Archiving Media and Electronic Records,”

consisting of “Part 1: User Requirements” and “Part 2: Technical Requirements.” Part 1

provides a context for the Universal Preservation Format, which utilizes a wrapper

structure that combines essence (or the data) with metadata in the same container, what

OAIS would call the Archival Information Package (Shepard & MacCarn, 1999). Called

“insightful and ahead of its time” (Fleischhauer, 2003), the UPF is designed to be

independent of both operating systems and physical media, as its metadata contains “the

technical specifications required to build and rebuild appropriate media browsers to

access contained materials throughout time” (Shepard & MacCarn, 1999). The UPF

8

Recommended Practices document also stressed the importance for a unique identifier,

the idea of a “digital Rosetta stone” (instructions for the hardware specifications, saved in

an environment not dependent on platforms), the use of open standard formats for data

types, and establishing digital fixity (or provenance). Part 1 concludes with a call for

“open standards or shared systems designed specifically for long-term digital storage”

(Shepard & MacCarn, 1999).

Part 2 details the technical requirements for the Universal Preservation Format,

which includes both an analog requirement and a digital component. The analog

requirement consists of “a human readable system that describes the content stored on the

physical media,” as well as instructions for a system that can read the content, and how to

access the structure that contains the content. The digital component consists of a

wrapper, the essence (or content), and the metadata. A media compiler would be used to

assemble the essence and metadata components for storage in the UPF, with an emphasis

on automation. It is stressed that the UPF would be an open standard (Shepard &

MacCarn, 1999).

The Universal Preservation Format never progressed beyond recommendations. It

would be five years before a similar project, emphasizing implementation, would be

initiated.

PRESERVING DIGITAL PUBLIC TELEVISION PROJECT

The National Digital Information and Preservation Program (NDIIPP), part of the

Library of Congress, was authorized by Congress in December 2000 to establish a

strategy for the long-term preservation of digital materials. On September 30, 2004,

awards totaling $14.9 million were given to digital preservation projects (Ackerman,

2004). A project dealing with the challenge of preserving digital moving image files was

awarded this grant. “Preserving Digital Public Television: Adopting Standards and

9

Selection Criteria and Defining Structure” is a project partnership between Channel

13/WNET (Educational Broadcasting Corporation), WGBH Educational Foundation, and

New York University, and aims to continue with the recommendations established by the

Universal Preservation Format by developing a solution for preserving digital television

content. Because of the enormous amount of digital video content produced by public

television stations in varying formats, entailing not only the finished product but also the

original raw materials, digital preservation is of primary concern (Interviews with Library

Staff: Carl Fleischhauer, 2005). The project has the following goals:

• To conduct a series of discussions focusing on standards, metadata, and best

practices.

• To inventory at-risk materials housed at WNET and WGBH, with particular

emphasis on signature series, such as NATURE, American Masters, Religion &

Ethics Weekly, American Experience, and NOVA.

• To complete and test the design for the Universal Preservation Format.

• To outline an administrative structure for creating a long-term digital public

television repository.

• To design and test a model repository (Preserving Digital Public Television:

Adopting Standards and Selection Criteria and Defining Structure Project

Proposal).

This project raises a number of interesting questions regarding the field of digital

video preservation, and, hopefully, can begin to provide some answers. Among these are

the following:

• The lack of standards and best practices for digital video preservation;

• Identifying the best file formats and encodings for digital video preservation;

• The best management system for these types of digital assets;

10

• How to manage the final product, other versions, and raw materials; and

• Identifying and preserving corollary material (such as web sites and DVDs).

It will be interesting to see what develops regarding this project over the next few

years, and how it will affect video projects that do not have access to the same resources

and finances as the Library of Congress.

Metadata Required for Digital Film Preservation

OVERVIEW

Capturing appropriate and relevant metadata is a crucial element in ensuring not

only the longevity of digital objects, but also the retrieval of content. For video files,

metadata is even more important, as “unlike textual resources, audio and video cannot be

adequately searched by themselves, as a raw resource.” Metadata for video can also

“enable the exchange of resources between institutions, and facilitates the internal

management and preservation of such resources” (NINCH Guide to Good Practice,

2002).

Gilliland-Swetland (2000) describes five types of metadata for preservation:

administrative, descriptive, preservation, technical, and use. Metadata can be stored

within the object, or separately, as a wrapper, with a way to link the metadata to the

specific digital object, as part of the Archival Information Package (AIP), as specified by

OAIS. Although “there is no shared good practice for what constitutes minimum

metadata for digital audio and video in the cultural heritage sector” (NINCH Guide to

Good Practice, 2002), it is possible to adapt a metadata standard for use with digital video

files, and capture as much information as possible.

11

DUBLIN CORE

The Dublin Core Metadata Set contains fifteen elements: title, creator, subject,

description, publisher, contributor, date, type, format, identifier, source, language,

relation, coverage, and rights. The beauty of Dublin Core lies in its simplicity—it is able

to describe a wide variety of digital objects, and is fairly easy to create. However, it may

be too simple to describe the complexities of complex multimedia objects at a granular

level, such as the many video files that compose a finished digital video production.

Dublin Core does not address many types of information specific to broadcasting, such as

technical, preservation transfer, documentary, and transactional metadata (Wright, 2003).

One solution is to combine Dublin Core with something like the Resource

Description Framework (RDF), which can combine both Dublin Core Elements and

MPEG-7 descriptors within one container (Hunter & Armstrong, 1999). The Video

Development Initiative (ViDe) has created a Dublin Core Application Profile for Digital

Video, which defines the fifteen Dublin Core elements in terms of describing video

elements by adding qualifiers and extensions. Recommendations for standardizing

information in those fields by using controlled vocabulary are also included (Agnew &

Kniesner, 2001).

Another solution is utilizing the SMPTE Metadata Dictionary, developed by the

Society of Motion Picture and Television Engineers, and considered “a comprehensive

complement to the minimalist Dublin Core element set” (Wactlar & Christel, 2002). The

SMPTE Metadata Dictionary was developed for the television and video industry, and

contains “audiovisual descriptors for the production environment, which covers the entire

production chain: pre-production, post-production, acquisition, distribution, broadcasting,

storage and archiving of digital audiovisual material” (Yang, Marchionini, Wildemuth,

Geisler, & Mu, 2002).

12

METADATA ENCODING AND TRANSMISSION STANDARD

Inspired by the Dublin Core/Warwick Framework and the Resource Description

Framework, the Metadata Encoding and Transmission Standard (METS) schema is a

standard for encoding objects in a digital library, including descriptive, administrative,

and structural metadata, using the XML schema language. It was developed in response

to the Library of Congress’s Making of America (MOA) project, when a need was found

to express the relationships between digital objects (METS website). Because METS is a

metadata standard and can be adapted with various schemas, it is a good solution for

encoding appropriate metadata for digital video files. The NINCH Guide to Good

Practice recommends METS, with the appropriate schema, “to describe the content and

structure of time-based digital files such as audio and video.”

The Digital Audio-Visual Preservation Prototyping Project of the Library of

Congress, which explored “aspects of digital preservation for audio and video during

1999-2004” (Digital Audio-Visual Preservation Prototyping Project web site), created an

extension schema for METS specific to video files, the VMD schema. In addition, the

other schemas developed for this project, dealing with rights restrictions and migrating

and reformatting processes, may be useful.

SYNCHRONIZED MULTIMEDIA INTEGRATION LANGUAGE

Synchronized Multimedia Integration Language (SMIL) is a World Wide Web

Consortium (W3C—an international consortium that develops World Wide Web

Standards) recommendation for encoding multimedia presentations, combining streaming

audio, video, text, images, and other media types, using XML (W3C Synchronized

Multimedia web site). The NINCH Guide to Good Practice also recommends the use of

SMIL, which “can be used to describe structural metadata about a particular frame of

13

video (frame 30, timecode 01:20:36.01) as well as link the appropriate series of frames to

alternate representations such as a transcription of the dialogue in that scene.”

MPEG-7 & MPEG-21

The Multimedia Content Description Interface (MPEG-7) “aims at providing

standardized core technologies allowing description of audiovisual data content in

multimedia environments” (Wactlar & Christel, 2002). Some of the detailed descriptors

specific to digital video include the creation and production processes of the content,

usage of the content, storage features, genre, and content. These descriptors can be used

for an entire production, or on a more granular level, describing each particular scene.

MPEG-21 is being designed to accommodate a wider range of description,

including “content creation, content production, content distribution, content

consumption and usage, content packaging, intellectual property management and

protection, content identification and description, financial management, user privacy,

terminals and network resource abstraction, content representation, and event reporting”

(Wactlar & Christel, 2002).

Identification of File Formats

In order to best preserve digital formats, the formats must be correctly identified.

With so many digital formats available, and the rapid increase in digital formats,

correctly identifying the proper one to ensure long-term preservation is difficult, at best.

Not only must the format be identified, but the correct documentation about the format

must also be found. In addition, “formats are platform-dependent to particular playback

machines” and “physical media require constant migration to new formats” (Ide &

McCarn, et al, 2002).

14

DIGITAL FORMATS WEB SITE

The National Digital Information Infrastructure and Preservation Program

(NDIIPP) has created a digital format web site dedicated to the technical analysis of

digital formats. The Digital Formats Web Site defines file formats as “packages of

information that can be stored as data files or sent via network as data streams (aka

bitstreams).” These file formats are not tied to a particular physical medium.

The Digital Formats Web site identifies seven sustainability factors that affect the

long-term preservation of digital formats:

• Disclosure (is it open source or proprietary?),

• Adoption (is it widely used?),

• Transparency (how easy is it to analyze the format with basic tools?),

• Self-documentation (does it contain a certain amount of basic metadata?),

• External dependencies (does it depend on particular hardware, software, or

operating systems?),

• Impact of patents (is the format inhibited by patents?), and

• Technical protection mechanisms.

For moving image formats, the recommended quality and functionality factors

include clarity (high image resolution), fidelity (high audio resolution), and support for

multiple sound channels (Digital Formats web site).

The Digital Formats Web site is working in conjunction with the Global Digital

Format Registry in order to provide comprehensive identification of file formats. The

Global Digital Format Registry (GDFR) is an initiative created by Harvard University

and MIT DSpace in 2002 to “maintain persistent unambiguous bindings between public

identifiers for digital formats and representation information for those formats.” The

15

GDFR aims to be an informative resource to not only identify digital formats, but also to

validate attributes. Potential uses for this registry include the following:

• Identification (what format is this object?),

• Validation (is this really the format that it purports to be?),

• Transformation (how is this object produced?),

• Characterization (what are the properties of this object?),

• Risk assessment (what is this object’s risk of obsolescence?), and

• Delivery (how do I render this object?).

The next steps for the GDFR are to find funding to design, implement, and

populate a prototype system and to communicate the specifics of the project to relevant

parties, with an emphasis on cooperation between commercial and non-profit endeavors

(Abrams, 2004).

DIGITAL VIDEO FILE FORMATS

Digital Video (DV) was introduced in 1996, and was the first digital recording

format to reach the consumer market (VidiPax website). The variants of DV include

Mini-DV, DVCAM, and DVCPRO, among others. Mini-DV is the smaller format

version of DV, and, because of its high quality and low price, has become the standard

for semiprofessional and consumer video production. Mini-DV tapes (the “S” cassette)

are small in size (6.5 × 4.8 × 1.2 cm) and can hold between an hour or an hour and a half

of video. Although mini-DV does encode video using a compressed format (DCT

intraframe compression at a 5 to 1 rate), the compression is minimal, and there is no

noticeable effect on picture quality (Wikipedia website). Whether the mini-DV format

can be relied upon as an archival format at this point in time is a difficult question to

answer; as it is relatively new, no one knows how fast it deteriorates, even when kept in

proper storage conditions (TCA website). Ten years from now, will it be possible to find

16

machines to play back these tapes? For now, however, the format is still widely

supported, and finding playback equipment is not a concern.

Table 1: Digital Video Formats (from NINCH Guide to Good Practice)

Format Tape Size Compressor Compression Ratio

YUV Sampling

Running Time (minutes)

DVCAM/ mini-DV

6mm DV25 5:1 4:1:1 NTSC 4:2:0 PAL

184

DVCPRO 6mm DV25 5:1 4:1:1 183 DVCPRO 50

6mm DV50 3.1:1 4:2:2 90

Digital S 12.5mm DV50 3.1:1 4:2:2 124 Digital Betacam

12.5mm Sony 3:1 4:2:2 94

17

CASE STUDY: A TEXAS CORRECTION

While the current initiatives in preserving digital video hold promise for

establishing standards and building solutions for both the commercial entities creating the

content and the non-profit organizations committed to preserving it, the truth of the

matter is that the bulk of video production is still being done by the low-budget

filmmaker—those without the money to shoot on traditional film stock and, quite likely,

without the technological expertise to know how to preserve it. One such production is A

Texas Correction, a documentary directed by Susanne Mason, which has been in the

making for the last seven years. Because this is a low-budget, small-staffed production,

funded almost entirely with grant money, it exemplifies the difficulties of preserving

digital content from the filmmaker’s perspective, some of which might be facing

obsolescence and deterioration before the production is even completed.

About the Project

A Texas Correction, an hour-long documentary for public television, tells the

story of Fred Cruz, a Latino American prisoner from San Antonio who challenges the

Texas prison system by initiating a lawsuit that produced the most comprehensive state

prison reform in this country’s history. The film reconstructs this story with a

combination of contemporary film images, historical footage, interviews, and archival

still photos.

The film is directed by Susanne Mason, an Austin-based filmmaker. Mason is an

award-winning filmmaker whose credits include the documentary Stories from the

Riverside (1993), a riveting portrait of three women incarcerated for murdering their

alleged abusers. The film led Mason to pursue other prison stories, culminating in the

18

production of A Texas Correction. She produced Town In Transition, a short historical

documentary about the city of Austin, Texas during the period from 1950 and 1975. She

was associate producer for Mobilus Media on the hour-long PBS documentary Are the

Kids Alright? broadcast by public television stations across Texas on June 24, 2004. She

has served as associate producer and writer on several PBS documentaries, including

Words of Our Ancients (1997), The Struggle in the Fields (1996), Songs of the Homeland

(1994), and Go Back to Mexico! (1994). She produces video components and audio

elements for educational multimedia, and recently authored ten historical segments for

the upcoming Austin Past & Present Documentary DVD. A Texas Correction is Mason’s

feature debut.

Mason initiated development of the program in 1996, conducting extensive

original research into the lives of Fred Cruz and Frances Jalet and a series of

approximately 40 interviews with former and current Texas prisoners and prison officials.

This research formed the basis for the original outline and treatment of the film.

Fundraising and archival research began in 1996. Film production on 16mm followed in

1997 and 1998, after sufficient funds were raised. Alternating production and fundraising

activities made it possible to secure funds for different phases of production, including

interviews with friends of Fred Cruz, former prison wardens who knew him, and

prisoners who were housed on the Ellis unit during Cruz’s incarceration. In 2004, a rough

cut of the film was completed. Mason will direct completion of the project.

Funders of this film include the Austin Film Society, City of Austin Cultural Arts

Contract, Diocese of Austin, individual contributors, Latino Public Broadcasting,

National Endowment for the Arts, Paul Robeson Fund for Independent Media, Southern

Humanities Media Fund, Summerlee Foundation, Texas Commission on the Arts, Texas

19

Council for the Humanities, and the Texas Filmmakers’ Production Fund. The total

anticipated cost of the finished film is $280,000.

Formats Involved in Production of Film

Because this is a documentary film developed for the public television market

(although it may indeed have a theatrical release), it shares many traits with other shows

produced for television (the preservation of which the NDIIPP project is currently

addressing). A Texas Correction consists of a number of different components, on a

variety of formats. Media sources include original footage, archival footage, photos,

documents, and audio sources (such as music and voiceover).

The original footage consists of digital video plus B-roll (secondary footage) shot

on 16mm film stock. The video component is mostly interviews, shot on a variety of

mini-DV cameras (Canon XL1, Sony VX1000, Sony DSR 300, and the Sony DSR 500)

onto either Sony or Panasonic mini-DV tapes. A few interviews were shot on a BetaCam

onto Beta SP tape (transferred to mini-DV), and one interview was shot on 3/4" SP tape

(transferred to Beta and then to mini-DV). The B-roll was shot on 16mm film and

transferred to digital Beta, which was then transferred to mini-DV (with identical time

codes). There are approximately 120 mini-DV tapes of this footage.

The archival footage was ordered from a variety of archives, such as CBS, ABC,

Producers Library, and the National Archives. The archival footage is drawn from

documentary films about Texas prisons (dating as far back as 1947), educational and

training films, news film, and surveillance video (from 1978). Initially, a VHS copy is

sent to view the footage; when the footage is deemed appropriate for use in the film, a

Beta version is sent, which is converted to mini-DV. There are approximately 40 mini-

DV tapes with archival footage.

20

There are approximately 1,461 still images, consisting of photos, documents, and

newspaper headlines. The hard copy of these images is scanned into Photoshop, and

sections of the documents are converted into jpgs, which are used for editing.

In order to be edited, all of the footage must be in a digital format. Initially, the

project was edited using Avid, which was loaned to the project temporarily. When access

to the Avid program was lost, the project was moved to Final Cut Pro 2.0, as it was the

cheapest system that could handle a project of this magnitude, with Mac OS 9 as the

operating system. As footage is imported into Final Cut Pro (using the GV-D9000 Sony

NTSC mini-DV deck), it is converted to QuickTime files. Because of the size of the

project, the files are stored on six different hard drives (three internal and three external).

The files associated with the project take up over 500 gigs of hard drive space. Once the

edit is complete, the project is output to an Edit Decision List (EDL), which provides a

blueprint for assembling the film by referencing the unique ID and time code for each

piece of media. The EDL is then sent to an online editing house with the original media

to construct the film. Online editing involves editing with the highest-quality original

media available; Mason’s edit, using compressed media, is called on offline edit. An

editing house is used to properly correct the video for broadcast standards, which is

essential for this project, as the film was shot on so many different formats. The editing

house creates a master copy of the film on digital Beta, which is then used to create

access copies, such as VHS tapes or DVD.

Table 2: Life Cycle of the Production Process for A Texas Correction

21

22

Why Were These Formats Chosen?

Because of the limited budget for this film, the formats used were chosen because

of necessity, and not necessarily because of quality, or longevity. Larger-budget films

have more luxury in determining their formats. Mini-DV was chosen because of cost;

Beta is preferred because it is a higher quality (it records video at a 3:1 compression ratio,

while mini-DV records at a 5:1 compression ratio), but one Beta tape costs over twice as

much as one mini-DV tape. An exception was made for the B-roll, which was shot on

16mm film to evoke the time period in which the film takes place.

It’s important to note that shooting for this project began in 1997; if this project

were to begin shooting today, it might be shot using a high definition format, or on digital

Beta. Also, the operating system and editing system that Mason uses is outdated—Final

Cut Pro 5.0 is now in use, and the Macintosh operating system she uses (Mac OS 9.2)

should also be upgraded. However, changing platforms and software programs could

cause more problems than it’s worth at this point in the project.

Metadata Generation

Because most of this content was created digitally, a certain amount of metadata

is automatically generated, such as the time codes from the mini-DV tapes. In addition, a

huge amount of content metadata was generated by the filmmaker and her staff, which is

being stored in Filemaker Pro databases. Three databases were designed specifically for

this project, based on a model provided by WGBH Educational Broadcasting Company.

The specificity of the information collected for this project highlights one of the problems

with choosing a metadata set—how do you find and/or adapt an existing metadata set that

meets all of your specific project needs? The three databases that track the material

contained in this project are: the original footage database (divided up by tape), the stills

23

database (divided into different categories, depending on the topic—each photo has its

own record), and the stock footage database (which contains information on the archival

footage). The fields contained in each database are listed below.

Table 3: Databases and Database Fields

Database Database Fields Original Footage Database

date entered tape # story # time in: time out: length subject event date: event location source: source ref. copyright ordered format original format sync color quality/comments description

The Stills Database

date entered series program # program: Let It Roll picture # subject picture date category: agriculture, building tenders, characters, cityscapes, civil rights demonstration, construction, disciplinary reports, diagnostics/welcoming committee, diaries/correspondence, field labor, industry: machine, industry: office, legal and criminal records, newspaper articles, official portraits, prison life and portraits, prison units/farms, recreation/rodeos, solitary/isolation, warden/guard life and portraits event location personalities

24

description artist source source ref image type copyright owner color ordered format location restrictions/release notes frame # rights obtained date portrayed

The Stock Footage Database

date entered series program # program: A Texas Correction picture # subject picture date category: agriculture, building tenders, characters, cityscapes, civil rights demonstration, construction, disciplinary reports, diagnostics/welcoming committee, diaries/correspondence, field labor, industry: machine, industry: office, legal and criminal records, newspaper articles, official portraits, prison life and portraits, prison units/farms, recreation/rodeos, solitary/isolation, warden/guard life and portraits event location personalities description artist source source ref image type copyright owner color ordered format location restrictions/release notes frame # rights obtained date portrayed

25

Preservation of Formats

Mason has collected quite an array of both digital and non-digital formats

associated with her project to preserve. Her photos, documents, and traditional film stock

are not in danger, as long as they are stored in the proper conditions. However, some of

her digital content is quite old—her oldest mini-DV tape dates back to 1997, although in

2001, all of her mini-DV tapes were duplicated onto new mini-DV tapes, so currently,

her oldest mini-DV formats are at least four years old.

What can Mason feasibly do to preserve her digital content? If Mason had an

unlimited budget, she could purchase a Digital Asset Management system to manage all

of her digital files. For a project with this budget, however, that’s out of the question. One

option is to store all of the footage on mini-DV tapes onto external hard drives—prices

for storage of this nature have dropped considerably, and removing the footage from the

platform-dependent nature of the mini-DV cassette tape, which has the same deterioration

issues as analog tape (TCA website), would mean that she would no longer be dependent

on playback machines, nor would she be reliant on constant migration, what Ide and

MacCarn call “a nightmare for collecting institutions.”

Her other digital materials would need to be migrated to non-proprietary formats.

Her image files should be converted to tiff formats, the information in her Filemaker Pro

databases should be exported to tab-delimited ASCII text (or an open-source relational

database), her Edit Decision List should be stored as an RTF file or ASCII text, and her

QuickTime files (used during the editing process) should also be stored on external hard

drives, and migrated when necessary.

26

Table 4: Formats to Preserve

Formats to Preserve Ideal Storage Mini-DV tapes Raw video files on external hard

drives or migrated to mini-DV tapes 16 mm film Standard film storage conditions Beta stock Migration Photos/Documents Standard photo/paper storage

conditions Filemaker Pro databases Tab-delimited ASCII text or open-

source relational database Image files (jpgs) Tifs QuickTime files External hard drives Edit Decision List RTF file

An ideal solution, although one possibly limited by Mason’s expertise, is to create

her own Archival Information Package, as stipulated by OAIS, of both her content and

her metadata. Using METS, Mason could create a comprehensive schema, utilizing

standard Dublin Core elements and specialized elements relevant to video, that

encompass not only the metadata automatically generated by the raw video files, but also

the information she created for her databases. Each piece of content is identified by a

unique number (for the video files, a unique number, identifying the specific mini-DV

tape, as well as a time code, identifying where on the tape the scene is located), which

can be used to link the metadata to the content. Additionally, the QuickTime compressed

file could be linked to the identical, uncompressed raw video file. Using a METS

document to formalize the generated metadata (as a wrapper for the digital content)

would allow Mason to manage the entire project as one task.

As this AIP would be solely for Mason’s use, there’s no need for metadata for

resource discovery across platforms. But the other metadata she has already generated,

especially the content descriptions and the copyright restrictions, will prove invaluable

for future identification and use of her material. The external hard drives would act as

27

Mason’s own digital repository, storing both the content and the metadata associated with

the content. The material on these drives would be backed up on a regular basis.

Future Online Access

Because A Texas Correction is designed for broadcast on public television, online

access to this footage is not really an issue. There is no website for the project, and no

footage is available online. However, due to the tenuous financial footing of this project,

the future is uncertain. The film might never be finished. It might be finished, but may

never be broadcast on public television. It might never be distributed, either locally or

nationally. The film may not receive future funding, and may never amount to anything

other than several boxes of mini-DV tapes stored in a closet in Austin, Texas. It has been

said that preservation without access is pointless; why should Mason bother preserving

her digital content if no one is able to see them? Even if Mason never is able to release

her film in the traditional ways (through television broadcast or distribution in movie

theaters), several options do exist for future use of the massive amounts of original

footage that have been created during the eight years of shooting this production. If she

chooses, Mason can provide online access to her original footage, to be used for research

or in the use of other productions, provided she is able to preserve her original content so

that it can not only be identified, but also accessed. Mason already has compressed

QuickTime versions of her digital content, which is an appropriate digital format for

online access. Two such options are the Moving Image Archive and the Open Video

Project, both online access repositories for moving image content.

THE MOVING IMAGE ARCHIVE

The Internet Archive (http://www.archive.org/) was founded in 1996 as an online

digital library, preserving digital collections and making them available to the public. The

28

Internet Archive provides access to 30 billion archived web pages through the Wayback

Machine, digitized texts through the Text Archive, audio collections through the Live

Music Archive, and moving image collections through the Moving Image Archive

(http://www.archive.org/movies/movies.php). A huge component of the Moving Image

Archive is the Prelinger Archives. The Prelinger Archives is a collection of over 48,000

ephemeral (advertising, educational, industrial, and amateur) films, recently acquired by

the Library of Congress. Currently, 2,002 titles from the Prelinger Archives are available

in digital format through the Moving Image Archive.

All of the films on the Moving Image Archive are free to view, download, and

repurpose, making this an invaluable resource not only for scholars conducting research,

but for artists and filmmakers as well. Users are encouraged to “download the movies,

sample them, use them for classroom study, run them at theaters, or even use the footage

to make a for-profit film” (Slaton, 2001). This certainly differs from the footage available

for viewing in a traditional film archive. The Internet Archive maintains a collection of

Open Source Movies (http://www.archive.org/movies/opensource_movies.php), which

are short movies contributed by the community, oftentimes utilizing footage found in the

Moving Image Archive. There are currently 3,904 movies in the Open Source Movies

collection.

The moving image files from the Prelinger Archive are in the MPEG-2 format, “a

widely accepted standard for video playback” (Moving Image Archive website). MPEG-

2 provides a high-quality format for viewing and reusing these movies, but as a result, the

files are larger, which may be prohibitive for users on slower computers or utilizing dial-

up connections. Some movies are also available in the DivX format and editable MPEG-4

files, which can be used in an editing program, such as Final Cut Pro.

29

In order to digitize these films, the original films were transferred to Betacam SP

videotapes, which were then digitized at bitrates ranging from 2.75 Mbps to 3.5 Mbps,

and encoded at 480 × 480 pixels or 368 × 480 pixels. According to the Moving Image

Archive website, the “picture quality is equal to or better than most direct broadcast

satellite television” (Moving Image Archive website).

Submissions can be uploaded to the Internet Moving Image Archive, using an ftp

client. Once the uploads are complete, the movies are imported, which includes supplying

metadata, such as title, producer, and year of production. A curator views the movie, at

which point it is made available on the site.

THE OPEN VIDEO PROJECT

The Open Video Project (http://www.open-video.org) was developed by the

University of North Carolina-Chapel Hill’s Interaction Design Laboratory. Although the

main goals of the Open Video Project are to provide open source digital videos for use

for research and development and to provide a training ground for students in digital

libraries, a nice side benefit is that it also serves the needs of the public by providing “an

open source repository of video” (Open Video website). The collection is searchable and

browsable, and consists primarily of videos acquired from government and academic

institutions which are in the public domain (Mattison, 2004).

Currently, the collection contains 2,543 videos. The majority of the films are

available in MPEG-1 format, although MPEG-2, MPEG-4, and Quicktime formats are

represented as well. In addition to providing the files for download, the Open Video

Project provides a graphical user interface that allows users to view slideshows of the

video, storyboards, and fast forwards. This means that users need not sit through an entire

video segment, or wait for it to be downloaded, if they are only looking for a particular

item. Collections include video from the following projects: the University of Maryland

30

HCIL Open House Video Reports, the Informedia Project at Carnegie Mellon University,

the 2001 TREC Video Retrieval Test Collection, the CHI Video Retrospective, the

Digital Himalaya Project, and the NASA K-16 Science Education Programs (the Open

Video project web site).

Due to limited staffing, the Open Video Project is not accepting new video

submissions at this time. However, in the past, collections containing multiple videos and

videos that might be of interest to the education community were actively solicited.

Mason’s original footage for A Texas Correction certainly seems to meet those

qualifications. Additionally, she has QuickTime versions of all her footage, which is one

of the formats accepted for inclusion into the Open Video Project.

CONCLUSION

What can filmmakers do to ensure preservation of their digital moving image

files? First, greater education is needed for the filmmaker. Digital preservation courses

should become part of the curriculum of every film school. Second, filmmakers need to

be cognizant of the preservation implications of the formats that they choose—how long

will these formats last? What are the best storage conditions? How compressed are these

files? What are the benefits to shooting and storing footage in as non-lossy a format as

possible? Third, filmmakers need to understand that digital preservation is a moving

target, and constant vigilance will be required on their part to ensure access not only to

their final product, but also to the individual parts that compose the final whole.

In the near future, films created digitally will be considered as great a part of our

cultural heritage as films created with film stock. There may never be a perfect solution

to preserving digital items; however, we are facing a shorter time span with digital

31

formats than with analog formats. The digital bits won’t wait. While it is encouraging to

see the strides that are being made in developing not only standards, but solutions, it is

essential that steps be taken now, not only by those entrusted with preserving the content,

but by those creating it.

32

References

Abrams, S. (2004). Global Digital Format Registry (GDFR). Presented at the Dspace

User Group Meeting, MIT, March 10-11, 2004. Retrieved March 12, 2005 from http://hul.harvard.edu/gdfr/Dspace_MIT_2004.ppt

Ackerman, D. (2004). NYU Partners on $5.2 Million in Awards from the Library of Congress. Connect—Information Technology at NYU. Retrieved February 15, 2005 from http://www.nyu.edu/its/pubs/connect/fall04/ackerman_grants.html

Agnew, G. & Kniesner, D., editors (2001). ViDe User’s Guide: Dublin Core Application Profile for Digital Video. Retrieved March 2, 2005 from http://www.vide.net/ workgroups/videoaccess/resources/vide_dc_userguide_20010909.pdf

Besser, H. (2001). Digital Preservation of Moving Image Material? [Electronic version]. The Moving Image, 1.

Chute, D. (2000). Film Preservation at the (Digital) Crossroads. Retrieved February 7, 2005 from http://www.geocities.com/Tokyo/Island/3102/f-prez.htm

Committee for Film Preservation and Public Access (1993). Preservation Without Access is Pointless. Retrieved March 15, 2004 from http://www.cinemaweb.com/access/ pre_stmt.htm

De Stefano, P. (2003). Moving Image Preservation in Libraries [Electronic version]. Library Trends, 52 (1), 118-132.

Digital Audio-Visual Preservation Prototyping Project Web Site. Retrieved April 25, 2005 from http://www.loc.gov/rr/mopic/avprot/avprhome.html

Digital Cinema Initiatives web site. Retrieved April 2, 2005 from http://www.dcimovies.com/

Digital Formats for Library of Congress Collections. Retrieved February 17, 2005 http://www.digitalpreservation.gov/formats/

DV. Wikipedia, the free encyclopedia. Retrieved April 19, 2005 from http://en.wikipedia.org/wiki/MiniDV

The Film Preservation Guide (2004). National Film Preservation Foundation. Retrieved March 30, 2004 from http://www.filmpreservation.org

Fleischhauer, C. (2003). Looking at Preservation from the Digital Library Perspective [Electronic version]. The Moving Image, 3.2, 96-100.

33

Gilliland-Swetland A. (2000). Setting the stage. Introduction to Metadata. Retrieved from http://www.getty.edu/research/conducting_research/standards/intrometadata/ pdf/swetland.pdf.

Hunter, J. & Armstrong, L. (1999). A Comparison of Schemas for Video Metadata Representation. Retrieved February 28, 2005 from http://www8.org/w8-papers/3c-hypermedia-video/comparison/comparison.html

Ide, M., MacCarn, D., Shepard, T., & Weisse, L. (2002). Understanding the Preservation Challenge of Digital Television. Retrieved March 10, 2004 from http://www.clir.org/pubs/reports/pub106/television.html

Interviews with Library Staff: Carl Fleischhauer. (n.d.). Retrieved electronically on April 14, 2005 from http://www.digitalpreservation.gov/index.php?nav=4&subnav= 5&pg=1

Mason, Susanne. Personal interview, March 20, 2005 and April 17, 2005.

Mattison, D. (2004). Lights! Camera! Action! Motion Picture and Video Databases [Electronic version]. Searcher, 12 no. 2, 18-28.

Metadata Encoding & Transmission Standard Official Web Site. Retrieved on March 12, 2005 from http://www.loc.gov/standards/mets/

The Moving Image Archive Web Site. Retrieved April 12, 2005 from http://www.archive.org/movies/movies.php

National Film Preservation Foundation Web Site. Retrieved March 20, 2004 from http://www.filmpreservation.org

The NINCH Guide to Good Practice in the Digital Representation and Management of Cultural Heritage Materials (2002). The Humanities Advanced Technology and Information Institute (HATII), University of Glasgow, and the National Initiative for a Networked Cultural Heritage. Retrieved April 20, 2005 from http://www.nyu.edu/its/humanities/ninchguide/

The Open Video Project Web Site. Retrieved March 15, 2005 from http://www.openvideo.org

Preserving Digital Public Television: Adopting Standards and Selection Criteria and Defining Structure. (n.d.). Project Proposal submitted to the National Digital Information Infrastructure and Preservation Program (NDIIPP). Received via email from Nan Rubin, Project Manager, on April 21, 2005.

Redefining Film Preservation: A National Plan (1994). Retrieved March 18, 2004 from http://lcweb.loc.gov/film/plan.html

34

Savlov, M. (2002). Will Video Kill the Celluloid Star? [Electronic version]. The Austin Chronicle, Vol. 21, No. 42. Retrieved April 20, 2005 from http://www.austinchronicle.com/issues/dispatch/2002-06-21/screens_feature.html

Shepard, T. & MacCarn, D. (1998) The Universal Preservation Format Background and Fundamentals. Paper presented at the Sixth Delos Workshop. Retrieved February 25, 2005 from http://www.ercim.org/publication/ws-proceedings/DELOS6/ upf.pdf

Shepard, T. & MacCarn, D. (1999). The Universal Preservation Format: A Recommended Practice for Archiving Media and Electronic Records. Retrieved February 25, 2005 from http://info.wgbh.org/upf/pdfs/20010818-UPF_RP.pdf

Slaton, Joyce. (2001). Open the archives! Retrieved March 19, 2004 from http://www.sfbg.com/SFLife/35/30/lead.html

Texas Commission on the Arts Videotape Identification and Assessment Guide. Retrieved April 12, 2005 from http://www.arts.state.tx.us/video/id/minidv.asp

VidiPax: The Magnetic Media Restoration Company Web Site. Retrieved February 27, 2005 from http://www.vidipax.com/

W3C Synchronized Multimedia Home Page. Retrieved April 25, 2005 from http://www.w3.org/AudioVideo/

Wactlar, H. & Christel, M. (2002). Digital Video Archives: Managing Through Metadata. Retrieved March 10, 2004 from http://www.clir.org/pubs/reports/pub106/ video.html

Wright, R. (2004). Digital Preservation of Audio, Video and Film. VINE: The Journal of Information and Knowledge Management Systems, 3.34, 71-76.

Yang, M., Marchionini, G., Wildemuth, B., Geisler, G. and Mu, X. (2002). “A Multi-level, Multi-layer and Multi-channel Metadata Schema for the Open Video Digital Library.” Presented at the ICADL 2002 conference. Retrieved May 4, 2005 from http://idl.ils.unc.edu/newidl/publications/ICADL2002.pdf

35

Vita

Sara Natalie Fuchs was born in Beaufort, South Carolina on May 9, 1974, the

daughter of Virginia Sayers-King and Richard Fuchs. After graduating from Alan C.

Pope High School in Marietta, Georgia, she attended the University of Georgia, Athens,

Georgia. She graduated cum laude and Phi Beta Kappa from the University of Georgia in

1996, with a Bachelor of Arts in English and a minor in Mass Communication. Between

her undergraduate and graduate schooling, she has been an Americorps volunteer in

Atlanta, Georgia; a bookshop employee in London, England; a waitress in Dublin,

Ireland; a project administrator, proofreader, and editor in Austin, Texas; and a traveler in

Ecuador, Peru, Bolivia, Chile, Argentina, and Mexico. In June of 2003, she entered the

School of Information at the University of Texas at Austin, to pursue a Master of Science

in Information Studies.

Permanent Address: 96 Robert T. Martinez Jr. St.

Austin, Texas 78702

This Master’s Report was typed by the author.