Upload
francis-richardson
View
219
Download
0
Tags:
Embed Size (px)
Citation preview
HATHITRUST A Shared Digital Repository
Why Digitize? or
The Limits of Preservation
2014 TEI/DHCS Plenary SessionEvanston, IL
Mike FurloughExecutive Director, HathiTrust
Books from Different <angles>
Unless otherwise noted, these slides and their contents are licensed under a Creative Commons Attribution Unported License.
2
Caveat auditor
23 October 2014
3
HATHITRUST.ORG
23 October 2014
Bethany Nowviskie, “Digital Humanities in the Anthropocene” http://nowviskie.org/2014/anthropocene/
From The Art of Google Books: http://theartofgooglebooks.tumblr.com/post/74936156541/married-employees-hand-over-bookplate-and
From Lorcan Dempsey’s Weblog: http://orweblog.oclc.org/archives/001284.html
≠
12
HathiTrust Mission
To contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge.
Efforts include, but are not limited to…building comprehensive collections co-owned and managed by partners.…enabling access by users with print disabilities.…supporting computational research with the collections.…stimulating shared collection storage strategies among libraries.
23 October 2014
13
HathiTrust MembersAllegheny CollegeArizona State UniversityBaylor UniversityBoston CollegeBoston UniversityBrandeis UniversityBrown UniversityCalifornia Digital LibraryCarnegie Mellon UniversityColby CollegeColumbia UniversityCornell UniversityDartmouth CollegeDuke UniversityEmory UniversityFlorida State UniversityGetty Research InstituteHarvard University LibraryIndiana UniversityIowa State UniversityJohns Hopkins UniversityKansas State UniversityLafayette CollegeLibrary of CongressMassachusetts Institute of
TechnologyMcGill University`Michigan State UniversityMontana State UniversityMount Holyoke CollegeNew York Public LibraryNew York UniversityNorth Carolina Central
UniversityNorth Carolina State
UniversityNorthwestern University
The Ohio State UniversityThe Pennsylvania State
UniversityPrinceton UniversityPurdue UniversityRutgers UniversityStanford UniversitySyracuse UniversityTemple UniversityTexas A&M UniversityTexas TechTufts UniversityUniversidad Complutense
de MadridUniversity of AlabamaUniversity of AlbertaUniversity of ArizonaUniversity of British ColumbiaUniversity of CalgaryUniversity of California
BerkeleyDavisIrvineLos AngelesMercedRiversideSan DiegoSan FranciscoSanta BarbaraSanta Cruz
The University of ChicagoUniversity of ConnecticutUniversity of DelawareUniversity of FloridaUniversity of Houston
University of IllinoisUniversity of Illinois at ChicagoThe University of IowaUniversity of KansasUniversity of MaineUniversity of MarylandUniversity of Massachusetts,
AmherstUniversity of MiamiUniversity of MichiganUniversity of MinnesotaUniversity of MissouriUniversity of Nebraska-LincolnUniversity of New MexicoThe University of North
Carolina at Chapel HillUniversity of Notre DameUniversity of OklahomaUniversity of PennsylvaniaUniversity of PittsburghUniversity of QueenslandUniversity of Tennessee, KnoxvilleUniversity of TexasUniversity of UtahUniversity of VermontUniversity of VirginiaUniversity of WashingtonUniversity of Wisconsin-MadisonUtah State UniversityVanderbilt UniversityVirginia TechWake Forest UniversityWashington UniversityYale University Library
23 October 2014
14
Shared Responsibilities
• Leverage expertise across institutions– Collective work
• Distributed Infrastructure– Preservation repository and access services
• University of Michigan• Mirror site: Indiana University
– Metadata management services (Zephir)• California Digital Library
– HathiTrust Research Center• Indiana University and University of Illinois
23 October 2014
15
Growth of Collection
2008 2009 2010 2011 2012 2013 20140
2,000,000
4,000,000
6,000,000
8,000,000
10,000,000
12,000,000
14,000,000
2,477,871
5,221,092
7,836,698
9,966,57210,599,355 10,878,121
12,104,793
23 October 2014
16
Language Distribution (1)
The top 10 languages make up ~87% of all content
English; 49%
German; 9%
French; 7%
Spanish; 5%
Chinese; 4%
Russian; 4%Japanese; 3%
Italian; 3%Arabic; 2%
Latin; 1%
Remaining Languages;
13%
* As of February 17, 2014
23 October 2014
17
Language Distribution (2)
Portuguese; 7%Polish; 7%
Dutch; 5%
Hebrew; 5%
Hindi; 5%
Indonesian; 4%
Korean; 4%Swedish; 4%
Thai; 3%Urdu; 3%Turkish; 3%Danish; 3%
Czech; 3%Croatian; 3%
Persian; 2%Tamil; 2%
Hungarian; 2%
Bengali; 2%Norwegian; 2%
Sanskrit; 2%
Greek,-Modern-(1453--); 2%
Vietnamese; 1%
Ukrainian; 1%
Serbian; 1%
Bulgarian; 1%
Greek,-Ancient-(to-1453); 1%
Armenian; 1%
Romanian; 1%
Marathi; 1%Panjabi; 1%
Telugu; 1%
Catalan; 1%
Malay; 1%
Multiple-languages; 1%
Malayalam; 1% Finnish; 1% Slovak; 1% Slovenian; 1%Turkish,-Ottoman; 1%Yiddish; 1% Nepali; 0%
The next 40 languages make up ~12% of total
* As of February 17, 2014
23 October 2014
18
Dates
* As of February 17, 2014
2000-200910%
1990-199914%
1980-198914%
1970-197913%
1960-196911%
1950-19596%
1940-19494%
1930-19394%
1920-19294%
1910-19194%
1900-19094%
1850-189910% 1800-1849
3%
23 October 2014
19
Preservation with Access
• Preservation– TRAC-certified– Long-term commitments on digital content facilitate planning,
decision-making• Discovery
– Bibliographic and full-text search of all materials– Mechanisms for local loading of records
• Access and Use – Full text search (all users)– Public domain and open access works (all users)– Collections and APIs (all users)– Lawful uses of in-copyright works (members)
23 October 2014
Title page of edition of JF Cooper’s Satanstoe presented in the Making of America database. (Accessed October 18, 2014)
Spine of edition of JF Cooper’s Satanstoe presented in the Early American Fiction database. (Accessed October 18, 2014)
6 of 15 records for different copies of JF Cooper’s Satanstoe presented HathiTrust. (Accessed October 18, 2014)
Some Issues
• Collection strategies– What else?– Associated access and preservation questions
• The “Evolving Scholarly Record”– The book and the network– Fragmentation and loss
26
How to find out more
• About: http://www.hathitrust.org/about• Resources: http://www.hathitrust.org/resources• Twitter: http://twitter.com/hathitrust• Facebook: http://www.facebook.com/hathitrust• Monthly newsletter:
– http:www.hathitrust.org/updates– RSS http://www.hathitrust.org/updates_rss
• Contact us: [email protected]• Blogs: http://www.hathitrust.org/blogs
– Large-scale Search– Perspectives from HathiTrust
21 October 2014