16
Introducing the Greenstone Introducing the Greenstone Digital Library Software Ian H. Witten Computer Science Department Computer Science Department Waikato University New Zealand http://greenstone.org http://nzdl.org Vittore Casarosa ISTI CNR ISTI-CNR

Greenstone aib 16_feb12_casarosa

Embed Size (px)

DESCRIPTION

AIB Toscana Formazione. Introducing the Greenstone Digital Library Software, Vittore Casarosa, Firenze, 16 febbraio 2012.

Citation preview

Page 1: Greenstone aib 16_feb12_casarosa

Introducing the Greenstone Introducing the Greenstone Digital Library

Software

Ian H. Witten

Computer Science DepartmentComputer Science DepartmentWaikato UniversityNew Zealand

http://greenstone.orghttp://nzdl.orgp g

Vittore CasarosaISTI CNRISTI-CNR

Page 2: Greenstone aib 16_feb12_casarosa

A conceptual view of DLsp

UNIPI BDG 2011-12 Parte 1 -2Vittore Casarosa – Biblioteche Digitali

Page 3: Greenstone aib 16_feb12_casarosa

The three Entities

Digital Library– An organization, which might be virtual, that comprehensively collects,

manages, and preserves for the long term rich digital content, and offers to its user communities specialized functionality on that content, of measurable quality and according to codified policies

Digital Library Systemg y y– A software system that is based on a defined (possibly distributed)

architecture and provides all functionality required by a particular Digital Library. Users interact with a Digital Library through the g y g y gcorresponding Digital Library System

Digital Library Management System– A generic software system that provides the appropriate software– A generic software system that provides the appropriate software

infrastructure both (i) to produce and administer a Digital Library System incorporating the suite of functionality considered foundational for Digital Libraries and (ii) to integrate additional software offering more refinedLibraries and (ii) to integrate additional software offering more refined, specialized, or advanced functionality

UNIPI BDG 2011-12 Parte 1 -3Vittore Casarosa – Biblioteche Digitali

Page 4: Greenstone aib 16_feb12_casarosa

Different types of DLMSyp

Extensible Digital Library System – A complete Digital Library System that is fully operational with respect to basic/

foundational functionality required. It is based on an open software architecture, so that further software components can be incorporated on top of the ones already there with ease (DelosDLMS, Greenstone)with ease (DelosDLMS, Greenstone)

Digital Library System Warehouse – A collection of software components that encapsulate the core suite of DL functionality

and a set of tools that can be used to combine these components in a variety of ways (in Lego-like fashion) to create Digital Library Systems offering a tailored integration of functionalities. New software components can easily be incorporated into the Warehouse for subsequent combination with those already there (BRICKS, DILIGENT)DILIGENT)

Digital Library System Generator– A highly parameterized software system that encapsulates templates covering a broad

range of functionalities, including a defined core suite of DL functionality as well as any advanced functionality that has been deemed appropriate to meet the needs of the specific application domain Through an initialization session, the appropriate parameters are set and configured; at the end of that session, an application is automatically generated, and this constitutes the Digital Library System ready for y g , g y y yinstallation and deployment (MARIAN)

UNIPI BDG 2011-12 Parte 1 -4Vittore Casarosa – Biblioteche Digitali

Page 5: Greenstone aib 16_feb12_casarosa

Greenstone main features

Accessible via any Web browser Server runs on anything (all Windows + Unix + Mac) Server runs on anything (all Windows + Unix + Mac) Collections can be published on CD-ROM/DVD Trivial to install GUI interface for building and publishing collections

Access

GUI interface for building and publishing collections

Collection-specific Full-text and fielded searchSearching/ Full text and fielded search Flexible browsing facilitiesMetadata-based (Dublin Core recommended) Creates all access structures automatically

Searching/browsing

Plugins — new document, metadata formats Classifiers — new metadata browsers

Creates all access structures automatically

Extensible Classifiers new metadata browsers

Multilingual: Documents and interfacesMultimedia: image video audio collections existMulti * Multimedia: image, video, audio collections existMultiformat: Documents and metadata

Multi-UNIPI BDG 2011-12 Vittore Casarosa – Biblioteche Digitali

Page 6: Greenstone aib 16_feb12_casarosa

Supported Standards

Can use any metadata set, Dublin Core supplied Pl i fMetadata Plugins forMetadata

XML ReferMARC OAICDS/ISIS METS ProCite DSpace

Web C bli h G ll i CD ROM

ProCite DSpaceBibTex

Can publish Greenstone collections on CD-ROM Can publish Greenstone collections on OAI Export collections to METS

Serving

Plugins forDocuments

Export collections to DSpace (ready for DSpace’s batch import program)

ug s oocu e sPDFPostScriptWord, RTF

Images (GIF, JPEG, TIFF …)MP3Ogg Vorbis

ZIPExcelPPT

HTMLPlain textLatex

MediaWikiUnknownPlug

(e.g. for audio, MPEG, Midi)

EmailSource codeRealMediaUNIPI BDG 2011-12 Vittore Casarosa – Biblioteche Digitali

Page 7: Greenstone aib 16_feb12_casarosa

Supported Platforms

Operating system: Operating system:– Windows (any version)– Linux (any version)– Unix (most versions, e.g. Solaris) – Mac OS X

Restrictions:– No longer runs under Windows 3.1/3.11– For Librarian interface (GLI), need Java—

which is no longer supported on Windows 95

UNIPI BDG 2011-12 Vittore Casarosa – Biblioteche Digitali

Page 8: Greenstone aib 16_feb12_casarosa

The Greenstone LibrarianInterface (GLI)Interface (GLI)

Building collections Building collections Interactive Java program Runs on anything Build a collection on the computer you are on Build a collection on the computer you are on… plus new applet version Includes metadata editor

C t t d l ith h h t d t ll ti G tCaveat: cannot deal with such huge metadata collections as Greenstone can

(Tutorial exercise: small collection of HTML files)Invoke GLI: build a small collection of HTML filesGatherCreateLook at extracted metadataLook at extracted metadata Set up shortcut in the Librarian interface

Page 9: Greenstone aib 16_feb12_casarosa

Create a new collection

Page 10: Greenstone aib 16_feb12_casarosa

Gather: Gather thefiles togetherfiles together

Page 11: Greenstone aib 16_feb12_casarosa

Create: Build thecollectioncollection

Page 12: Greenstone aib 16_feb12_casarosa

Preview: admirethe resultthe result

Page 13: Greenstone aib 16_feb12_casarosa

Set up environment variables

Building a collection

MakecolCreate a directory for the collection (with subdirectories) put collect cfg

Details aboutthe collection

subdirectories), put collect.cfg file in “etc” subdirectoryPut source

docs into a subdirectory

ImportConvert to archive formatExtract metadata

collect.cfg(plugins)

subd ec o y

Docs inGreenstoneArchive formatArchive format

Build Create indexing & browsing structures, compress …

collect.cfg

Greenstone collection

Search Resultscollect.cfg + macros (main.cfg)UNIPI BDG 2011-12 Vittore Casarosa – Biblioteche Digitali

Page 14: Greenstone aib 16_feb12_casarosa

Documentation

The Greenstone system can be downloaded from the web site (www.greenstone.org)– available for Windows, Mac and Linux– recommended version for Windows is 2 85recommended version for Windows is 2.85– recommended version for Mac and Linux is 2.83

All (most) Greenstone documentation also available at the web site Best way to learn Digital Libraries and Greenstone is:

– Read the book from Ian Witten “How to build a Digital Library”, Morgan Kaufmann, 2010, ISBN 978-0-12-374857-7 (Second edition), , ( )

– execute the recommended exercises (see the 4-days workshop on the download page)

– study the demo collections (see Documented examples on the download– study the demo collections (see Documented examples on the download page)

UNIPI BDG 2011-12 Vittore Casarosa – Biblioteche Digitali

Page 15: Greenstone aib 16_feb12_casarosa

Example collections

UNIPI BDG 2011-12 Vittore Casarosa – Biblioteche Digitali

Page 16: Greenstone aib 16_feb12_casarosa

Example collections

UNIPI BDG 2011-12 Vittore Casarosa – Biblioteche Digitali