Upload
lora-stevenson
View
218
Download
0
Tags:
Embed Size (px)
Citation preview
NAL-Institutional Repository: A Case StudyCSIR Metadata Harvester
I.R.N. GoudarHead, ICAST, [email protected]
National Symposium on Open Access and Building Institutional Repository
National Aerospace LaboratoriesBangalore- 560 017
21-23 Jan 2009
NAL-IR
• Started in 2003 using GSDL
• Adopted E-Prints in 2005
• Plans to Switch over to DSpace
• Presently about 3000 Documents
IR Download Statistics
6000-10000/PM from more than 120 Countries• USA 40%• India 25%• UK 10%• Canada 6%• Japan 5%• China 3%• Germany 3%
• France 3%
Metadata Harvesting
• Harvesting – in the OAI context, harvesting refers
specifically to the gathering together of metadata from a number of distributed repositories into a combined data store
• OAI-PMH (OAI Protocol for Metadata Harvesting) – OAI-PMH is a harvesting protocol for
sharing metadata between services.
• Data Provider (Ex. Institutional repository)
•Maintain repository
•Expose metadata according to a metadata standard (e.g. DC)
•Register with OAI
•Service provider
•Register with OAI
•Extract metadata from registered repositories (‘harvest’)
•Provide services (e.g. central index)
Interoperability through OAI-PMH Protocol*
* http://www.openarchives.org/
IR-1 IR-2
Harvesting Software
• To harvest metadata from the OAI-compliant repositories (data providers), a harvesting software is needed– PKP Harvester from SFU
• http://pkp.sfu.ca/harvester_download
– Arc from ODU• http://oaiarc.sourceforge.net/
CSIR Knowledge Harvester
• Set up at ICAST, NAL
• PKP Harvester
• Presently Covers 4 CSIR Labs
• About 5500 documents
Harvesting CSIR IRs
Tech Reports Pre-prints Journal Articles
Access & Dissemination
NAL NCL NIO NPL SERC Etc
Deposit
Metadata +Full Pub)
Service Provider
ICAST, NAL
Presentation Thesis, etc
Digital Repository
Local Intranetaccess
Remote Internet access
MetadataOAI-PMH
EPrints and DSpace Widely used IR software
Platform
– EPrints: Unix/ Linux/ Perl/ Apache/ MySQL/
XML/ HTML/
– DSpace: Unix/ Linux/ Java/ Tomcat or
Apache/ XML/ HTML/ Ant/ PostGreSQL
Imply software knowledge required for installing, configuring, and maintaining archives developed using these packages.
OAI-PMH: Structure Model
Se
rvic
e P
rovi
der
e-print
Da
ta
Pro
vid
er e-prints
e-print
Da
ta
Pro
vid
er Images
e-print
Da
ta
Pro
vid
er
OPAC
e-print
Da
ta
Pro
vid
er Museum
e-print
Da
ta
Pro
vid
er Archive
Requests:
Identify
ListMetadataformats
ListSets
ListIdentifiers
ListRecords
GetRecord
Responses:
General information
Metadata formats
Set structure
Record identifier
Metadata
Da
ta
Pro
vid
er Harvester
Repository
Repository
Repository
Repository
Repository
Some Useful References
• http://www.openarchives.org/• To register as data provider
– http://www.openarchives.org/pmh/
• For OAI-related tools– http://www.openarchives.org/pmh/tools/
• OAI Repository Explorer for interactive exploration and validation of OAI repositories– http://re.cs.uct.ac.za/