Upload
intranda-gmbh
View
318
Download
0
Embed Size (px)
Citation preview
Rioghnach Ahern, Digital Ingest Coordinator, 18th of November, 2016.
UK Goobi User Meeting
Current Work and New Developments
2
The vision for the Wellcome Library’s digital engagement programme is to create the world’s largest free and unrestricted
digital library focused on the cultural contexts of health.
3
Digitisation at the Wellcome Library• The Wellcome Library is developing a world-class online resource for the
history of medicine by digitising a substantial proportion of its holdings & making the content freely available on the web.
• We will also strive to include important content from other institutions, which complements our own holdings, & to explore commercial partnerships for cost-effective digitisation of other parts of our collections.
• The Wellcome Library began its digitisation programme in 2010; its ambition is to make freely available over 50 million pages of historic medical books, archives, manuscripts & journals by 2020.
4
27.8 million images ingested so far!
5
Digitised Material Types• Monographs – 93,719.• Archives – 40,010.• Reports – 5,818.• Artworks – 3,709.• Audio-visual – 1,096.• Manuscripts – 805.• Journals – 260.
6
On-going Projects• UK Medical Heritage Library• UK Medical Officer of Health Reports• Mental Health Care Archives• Medieval Manuscripts• Recipe Books• Royal Army Medical Corps Archives• Ancestry• Visual Arts Discovery
7
UK Medical Heritage Library
• Wellcome Library• Royal College of Physicians of London• Royal College of Physicians of Edinburgh• Royal College of Surgeons of England• UCL (University College London)• University of Leeds• University of Glasgow• London School of Hygiene & Tropical Medicine• King's College London• University of Bristol
8
9
11
FTP’ing of Third Party Content• FTP process linked to Goobi for processing of the content to be
automated.• Dedicated workflow created for Goobi which monitors the FTP
server for the arrival of new data.• Automatically virus checks & quarantines the content over a 24
hour period before automatically uploaded into Goobi.• If something fails the virus check or the image folders don’t
match the metadata in Goobi, they will go into a “suspicious” folder.
13
14
Sources of digitised content
Goobi(METS/OCR)
Preservica/DLCS
In-house
Institutions
Contractors
Harvesting
TIFF or JP2
TIFF or JP2HD & ftp
TIFF or JP2
Normalises TIFF to JP2
Manual
Automatic
Jpylyzer validates JP2Auto harvesting of JP2 & DMD
Grey literature
Pro
ject
Man
ager
s / I
nges
t Offi
cer
Pro
ject
Man
ager
s
Ingest Officer / Digital Curator
Snagging
Snagging
18
Benefits of Automation• Reduced quarantine time for FTP’d content speeds up projects
considerably.• Automation has upped our throughput.• Fewer human clicks in general.• Less ingest resource required.
19
Future Developments• Improve the matching process for automated image upload.• This would decrease the number of manual image upload tasks required.• Automate the METS edition task for non-sensitive and open content.• Create Rulesets for automating the METS edition of content according to
publication year.
20
Medical Officer of Health Reports
• Parsing the harvest of Medical Officer of Health reports and the UK Medical Heritage Library project.
• Downloading rights metadata directly from the Internet Archive.
Click icon to add image. Then send to back to see titles
Thank you
@RioghnachAhern Linkedin.com/rioghnachahern