Upload
erik-holland
View
220
Download
0
Tags:
Embed Size (px)
Citation preview
1
DELOS Conference (Pisa, Italy –14 Feb 2007)
Digital Libraries:From Proposals to Projects
to Systems to Theoryto Curricula
Edward A. FoxVirginia Tech
Blacksburg, VA 24061 USA
2
Outline
• Acknowledgments• Introduction• Proposals• Projects• Systems• Theory• Curricula• Examples• Summary• Discussion
3
Acknowledgements
• Students
• Faculty, Staff
• Collaborators
• Support
• Mentors
4
Acknowledgements: Students
• Pavel Calado, Yuxin Chen, Fernando Das Neves, Shahrooz Feizabadi, Robert France, Marcos Gonçalves, Doug Gorton, Nithiwat Kampanya, Rohit Kelapure, S.H. Kim, Neill Kipp, Aaron Krowne, Bing Liu, Ming Luo, Paul Mather, Uma Murthy, Sanghee Oh, Ananth Raghavan, Unni. Ravindranathan, Ryan Richardson, Rao Shen, Ohm Sornil, Hussein Suleman, Ricardo da Silva Torres, Srinivas Vemuri, Wensi Xi, Seungwon Yang, Baoping Zhang, Qinwei Zhu, …
5
Acknowledgements: Faculty, Staff
• Lillian Cassel, Lois Delcambre, Debra Dudley, Roger Ehrich, Joanne Eustis, Weiguo Fan, James Flanagan, C. Lee Giles, Sandy Grant, Eric Hallerman, Eberhard Hilf, John Impagliazzo, Filip Jagodzinski, Douglas Knight, Deborah Knox, Alberto Laender, David Maier, Gail McMillan, Claudia Medeiros, Manuel Perez-Quinones, Jeff Pomerantz, Naren Ramakrishnan, Layne Watson, Barbara Wildemuth, …
6
Other Collaborators (Selected)
• Brazil: FUA, UFMG, UNICAMP• Case Western Reserve University• Emory, Notre Dame, Oregon State• Germany: Univ. Oldenburg• Mexico: UDLA (Puebla), Monterrey• College of NJ, Hofstra, Penn State, Villanova• Portland State University• University of Arizona, University of Florida,
Univ. of Illinois, University of Virginia• VTLS (slides on digital repositories, NDLTD)
Acknowledgements: Support
ACM, Adobe, AOL, CAPES, CNI, CONACyT, DFG, IBM, IMLS, Microsoft, NASA, NDLTD, NLM, NSF (IIS-9986089, 0080748, 0086227, 0307867, 0325579, 0532825, 0535057, 0535060; ITR-0325579; DUE-0121679, 0121741, 0136690, 0333531, 0333601, 0435059), OCLC, SOLINET, SUN, SURA, UNESCO, US Dept. Ed. (FIPSE), VTLS, …
8
Acknowledgements - Mentors
• JCR Licklider – undergrad advisor (1969-71)– Author in 1965 of “Libraries of the Future”– Before, at ARPA, funded start of Internet
• Michael Kessler – BS thesis advisor– Project TIP (technical information project)– Defined bibliographic coupling
• Gerard Salton – graduate advisor (1978-83)– “Father of Information Retrieval”– Application of Scientific Methods toward Integration of
Theory, Systems, Experiments, and Education
9
Libraries of the FutureJCR Licklider, 1965, MIT Press
World
Nation
State
City
Community
10
Introduction – Mentor Challenges
• Scientific method– “Leonardo da Vinci: The first scientist”
• Theory-based -> integration– Across computing disciplines– Over content, representations, services
• Experimentally proven– Evaluation: formative, summative
• Practically useful and beneficial– Make the world better (smaller)– Task support, effectiveness, efficiency
Digital Libraries --- Objectives
• World Lit.: 24hr / 7day / from desktop• Integrated “super” information systems: 5S:
Table of related areas and their coverage• Ubiquitous, Higher Quality, Lower Cost • Education, Knowledge Sharing, Discovery• Disintermediation -> Collaboration • Universities Reclaim Property• Interactive Courseware, Student Works• Scalable, Sustainable, Usable, Useful
12
Digital LibrariesShorten the Chain from
Editor
Publisher
A&I
Consolidator
Library
Reviewer
13
DLs Shorten the Chain to
Author
Reader
Digital
LibraryEditor
Reviewer
Teacher
Learner
Librarian
14
Introduction – 1991 Workshop
• ACM SIGIR ’91 (Chicago)
• Workshop on Future Directions in IR
• Report planning with– Michael McGill– Michael Lesk
• How can we accomplish something?– Address society’s needs
• What if all undergrads had info. access?
• Funding lobbying leading to: DLI, NSDL
15
Computing (flops)Digital content
Com
mun
icat
ions
(ban
dwid
th, c
onne
ctiv
ity)
Locating Digital Libraries in Computing andCommunications Technology Space
Digital Libraries technologytrajectory: intellectualaccess to globally distributed information
less moreNote: we should consider 4 dimensions: computing, communications,content, and community (people)
17
Challenges, Apps, Projects
• US-Korea Collaboration on DLs Workshop
• Reagan Moore and Ed Fox report
• Chart Headings:– Application Domain– Related Institutions– Examples– Technical Challenges– Benefit/Impact
18
Application
Domain
Related Institutions
Examples Technical Challenges Benefit / Impact
PublishingPublishers, Eprint
archivesOAI Quality control, openness Aggregation, organization
Education
Schools, colleges, universities
NSDL, NCSTRL Knowledge management,
reuseabilityAccess to data
Art, Culture
Museum AMICO, PRDLA Digitization, describing,
catalogingGlobal understanding
ScienceGovernment,
Academia, Commerce
NVO, PDG, SwissProt, UK
eScience,European Union Commission
Data modelsreproducibility, faster reuse, faster
advance
(e) Governme
nt
Government Agencies (all levels)
Census Intellectual property rights,
privacy, multi-nationalAccountability, homeland security
(e) Commerce
, (e) Industry
Legal institutionsCourt cases,
patents Developing standards
Standardization, economic development
History, Heritage
Foundations American Memory Content, context,
interpretation
Long term view, perspective, documentation, recording, facilitating, interpretation,
understanding
Cross-cutting
Library, Archive
Web, personal collections
Multi-language, preservation, scalability, interoperability, dynamic
behavior, workflow, sustainability, ontologies,
distributed data, infrastructure
Reduced cost, increased access, pereservation, democratization, leveling, peace, competitiveness
Reagan Moore
Ed Fox
June
2002
for
NSF
19
Introduction – Alliteration
• 5S– Societies– Scenarios– Spaces– Structures– Streams
• 3C– Content– Context– Criticism, commentary
20
Introduction – Alliteration
• 5S– Societies
• Users• Collaboration, Web 2.0
– Scenarios• Workflow, Stories• Services, Components
– Spaces: GIS– Structures: DBMS– Streams: DSMS
• 3C– Content
• Content Management Systems
– Context• Link Structure• NLP• Mental models
– Criticism, commentary• Annotation, Talmud• Cataloging, indexing• Abstracting• Summarizing• Secondary literature
21
Introduction – Time to:
• Treat DL as a serious field
• Achieve balance– Research & Development– Systems & Services– Practice, Continuous Quality Improvement– Use, Benefit
• Train digital librarians
• Achieve sustainability
22
Introduction - Approach
1. Proposals
2. Projects
3. Systems
4. Theory
5. Curricula
1. Vision
2. Objectives
3. Generality
4. Abstraction, conceptualization
5. Education– Structure– Pedagogy
23
Introduction - Proposals
• Early visions• Providing rationale for funding, programs• USA• Europe• India, China, New Zealand, Australia, …• Sustainability, follow-on• Technology transfer
– Stanford DLI-1 -> Google
24
Introduction - Projects
• Body of information
• Media type (maps, video, speech, photos)
• Representation (DC, METS, FRBR)
• Architecture (SOA)
• Interoperability (OAI)
• Archiving and Preservation (UVC)
• Devices (SenseCam, PIM)
• Links with other fields
25
Introduction – Projects -2• Body of information
– Person’s works (Cervantes)– Content by organization
• Library (Library of Congress)• Publisher (ACM)• Million books project• Google consortium
– Content by discipline (Physics, CS, Archaeology)– Content by genre (ETDs)– Content by target audience (TEL, Learners)
26
NSDL Information ArchitectureEssentially as developed by the Technical Infrastructure Workgroup
referenceditems &
collections
referenceditems &
collections
Special Databases
NSDLServicesNSDL
ServicesOther NSDLServices
CI Services
annotation
CI Services
discussion
CI Services
personalization
CI Services
authentication
CI Services
browsing
Core Services:information retrieval
Core Collection-Building Services
harvesting
Core Collection-Building Services
protocols
Core Services:metadata gathering
Portals &ClientsPortals &
ClientsPortals &Clients
Usage Enhancement
Collection Building
User Interfaces
NSDLCollections
NSDLCollections
NSDLCollections
CoreNSDL“Bus”
27
D ig ita l L ib ra r y C o n te n t
A rtic le s ,R e p o rts,
B o o ks
T e xtD o cum e n ts
S p ee ch ,M u s ic
V id eoA u d io
(A e ria l)P h o tos
G e og rap h icIn fo rm ation
M o d e lsS im u la tio ns
S o ftw a re ,P ro g ra m s
G e no m eH u m a n,a n im a l,
p la n t
B ioIn fo rm ation
2 D , 3 D ,V R ,C A T
Im ag es a ndG ra p h ics
C o nte n tT yp e s
28
Introduction – Projects - 5
• Links with other fields– Art, sculpture, music, speech– Medicine: images, datasets, genomics– Law, government
• Statutes, regulations• Citations, commentaries
– Supercomputers, Grid– HCI, Cognitive Psychology– IR, HT, MM
29
CC2001 Information Management Areas
IM1. Information models and systems*
IM8. Distributed DBs
IM2. Database systems* IM9. Physical DB design
IM3. Data modeling* IM10. Data mining
IM4. Relational DBs IM11. Information storage and retrieval
IM5. Database query languages
IM12. Hypertext and hypermedia
IM6. Relational DB design IM13. Multimedia information & systems
IM7. Transaction processing IM14. Digital libraries
* Core components
30
Introduction - Systems
• IBM DL -> content management system
• MARIAN, ODL, WS-ODL
• Greenstone
• DSpace
• Fedora
• DELOS– DLMS– ISIS & OSIRIS
31
Introduction - Theory
• Definitions: Key ideas, concepts
• Taxonomy: Groups, clusters
• Abstraction/generalization: Components
• Models, metamodels
• Proofs: relationships, improvements
• Uses, benefits– Interoperability (map, wrap, mediate, harvest)
• User interface: Explore: browse/search/visualize
– Automation (lex/yacc -> 5SGraph, 5SGen)
32
Introduction - Curricula
• Audience– LIKES, LIS, CS – Developer, implementer, systems librarian– D. Librarian (reference, coll. development)
• Core• Tracks
– Libraries: public, school/univ., corporation– Cultural heritage– Science (research, education)– Persons (PIM)
33
Knowledge Society
HCI
Visualization
Knowledge Management
Systems Analysis & Design
Programming
Database
Algorithms
Architecture
Net-Centricity
Intelligent Systems
Social & Ethical
Library Information Science
Simulation
Chemistry
Biology
Communi-
cations
Healthcare
Art
Music
Marketing
Finance
Modeling
Engineering
Sociology
Psychology
Physics
Architecture
History
Political Science
Geography
Knowledge Society
HCI
Visualization
Knowledge
Systems Analysis & Design
Database
Algorithms
Intelligent Systems
Social & Ethical
Library & Information Science
Economics
Simulation
Chemistry
Biology
Healthcare
Art
Music
Marketing
Finance
Engineering
Sociology
Psychology
Physics
Architecture
History
Political Science
Geography
English
Math
Living In the KnowlEdge Society (LIKES):Core surrounded by enabling computing concepts
and problem providing disciplines
34
DL Curricula
• “Curriculum Development for Digital Libraries” – NSF grant to VT, UNC-CH
• Studied body of literature
• Modules: core, related
• Invite collaboration worldwide
35
Digital Librarian:Needed Skills and Knowledge
• Choi, Y., & Rasmussen, E. (2006)
• What is needed to educate future digital librarians: A study of current practice and staffing patterns in academic and research libraries.
• D-Lib Magazine, 12(9)
• doi:10.1045/september2006-choi.
36
D.Librarian Skills & Knowledge:Technology Related
• DL architecture and software
• Technical and quality standards
• Web markup languages
• Database development and DBMS
• Web design skills
37
D.Librarian Skills & Knowledge:Library Related
• The needs of users
• Digital archiving and preservation
• Cataloging, metadata
• Indexing
• Collection development
38
D.Librarian Skills & Knowledge:Other
• Communication and interpersonal skills
• Project management and leadership skills
• Legal issues
• Grant/proposal writing skills
• Teaching and group presentation skills
39
Development & Evaluation Process
· From research team (VT & UNC)
· From current courses at VT & UNC
· From Advisory Board· From CC 2001
Vision/plan
Design
· Modules· Lessons
· Inspection by Advisory Board
· Inspection by external experts
· Inspection by Doctoral Consortium participants
Evaluate
· CC 2001 context· Curricular needs· Student background
Analyze· Specific strengths· Specific weaknesses
Feedback
· At UNC & VT· At additional universities
(in CS & LIS programs)
Revise & Implement
· Modules ready for use
· Lessons ready for use
Products
· Teacher perceptions· Student perceptions· Student outcomes
Evaluatein the field
40
Curriculum framework
41
Modules
1. Collection Development2. Digital objects / Composites / Packages3. Metadata, Cataloging, Author submission4. Architecture, Interoperability5. Data visualization6. Services7. Intellectual property rights management,
Privacy, Protection8. Social issues / Future of DLs9. Archiving and Preservation
42
Conference papers x modules
0
20
40
60
80
100
120
140
160
180
200
1 2 3 4 5 6 7 8 9
Module ID
Nu
mb
er
of
con
fere
nce
pa
per
s
JCDL 05
JCDL 04
JCDL 03
JCDL 02
JCDL 01
ACM DL 00
ACM DL 99
ACM DL 98
ACM DL 97
ACM DL 96
43
Taxonomy of DL Educational Resources
44
2-a: Collection development/selection policies2-b: Digitization
3-a: Text resources3-b: Multimedia3-c (8-b): File formats, transformation, migration
4-a: Metadata, cataloging, metadata markup, metadata harvesting4-b: Ontologies, classification, categorization4-c: Vocabulary control, thesauri, terminologies
4-d: Subject description4-e: Information architecture (e.g., hypertext, hypermedia)4-f: Object description and organization for a specific domain
5-a: Architecture overviews/models5-b: Applications5-c: Identifiers, handles, DOI, PURL
6-a: Info needs, relevance, evaluation6-b: Search strategy, info seeking behavior, user modeling
8-a: Repositories, archives, storage8-b (3-c): File formats, transformation, migration
9-a: Project management9-b: DL case studies9-c: DL evaluation9-d: Usability assessment, user studies
9-e: Bibliometrics, Webometrics9-f: Legal issues (e.g., copyright)9-g: Cost/economic issues9-h: Social issues
10-a: Future of DLs10-b: Education for digital librarians
Digital Objects3
Collection Development
2
Overview1
Architecture (agents, mediators)
5
CORE TOPICS
DL education and research
10
7-a: Search engines, IR, indexing methods7-b: Reference services7-c: Recommender systems
5-d: Protocols5-e: Interoperability5-f: Security
2-c: Harvesting2-d: Document and e-publishing/presentation markup
6-c: Sharing, networking, interchange (e.g., social)6-d: Interaction design, info summarization and visualization, usability assessment
User Behavior/ Interactions
6
7-d: Routing, community filtering7-e: Web publishing (e.g., wiki, rss, Moodle, etc.)Services7
8-c: Sustainability
Management and Evaluation
9
Archiving and Preservation
Integrity8
1-a (10-c): Conceptual frameworks, theories
10-c (1-a): Conceptual framework, theories10-d: DL research initiatives
Info/ Knowledge Organization
4
45
Overview1
1-a (10-c): Conceptual frameworks, theories
46
Collection Development
2
2-a: Collection development/selection policies2-b: Digitization
2-c: Harvesting2-d: Document and e-publishing/presentation markup
47
Digital Objects3
3-a: Text resources3-b: Multimedia3-c (8-b): File formats, transformation, migration
48
Info/ Knowledge Organization
4
4-a: Metadata, cataloging, metadata markup, metadata harvesting4-b: Ontologies, classification, categorization4-c: Vocabulary control, thesauri, terminologies
4-d: Subject description4-e: Information architecture (e.g., hypertext, hypermedia)4-f: Object description and organization for a specific domain
49
Architecture (agents, mediators)
5
5-a: Architecture overviews/models5-b: Applications5-c: Identifiers, handles, DOI, PURL5-d: Protocols5-e: Interoperability5-f: Security
50
User Behavior/ Interactions
6
6-a: Info needs, relevance, evaluation6-b: Search strategy, info seeking behavior, user modeling
6-c: Sharing, networking, interchange (e.g., social)6-d: Interaction design, info summarization and visualization, usability assessment
51
Services7
7-d: Routing, community filtering7-e: Web publishing (e.g., wiki, rss, Moodle, etc.)
7-a: Search engines, IR, indexing methods7-b: Reference services7-c: Recommender systems
52
Archiving and Preservation
Integrity8
8-a: Repositories, archives, storage8-b (3-c): File formats, transformation, migration
8-c: Sustainability
53
Management and Evaluation
9
9-a: Project management9-b: DL case studies9-c: DL evaluation9-d: Usability assessment, user studies9-e: Bibliometrics, Webometrics9-f: Legal issues (e.g., copyright)9-g: Cost/economic issues9-h: Social issues
54
DL education and research
10
10-a: Future of DLs10-b: Education for digital librarians
10-c (1-a): Conceptual framework, theories10-d: DL research initiatives
55
Personalizing A Course Website Using the NSDL
William Cameron2, Boots Cassel2, Edward Fox1, Manuel Perez-Quinones1, Manas
Tungare1, Xiaoyan Yu1
Virginia Tech1, Villanova2
56
Syllabus Collection …Towards an intelligent educational system
Unstructured Syllabus Text
StructuredSyllabus
Text
SearcherRecommender
Crawler
SyllabusClassifier
Extractor
Editor
SyllabusOntology
Services
Publisher
Other NSDL
Resources
Potential Syllabus
Text
Classification Scheme
ResourceClassifier
57
Syllabus Ontology
• Standard, machine understandable
• Ontology Editor: Protégé
• Syllabus Schema: SylVia
• http://doc.cs.vt.edu/ontologies/
58
Creating new syllabus
• Web-based application to support entry of syllabi into collection
• Moodle Plug-in in the works
• Uses CC 2001 to select topics for a course
59
Example: CBIR + SI
• Integration of– CBIR– Superimposed information (annotations …)
• Application to– Biodiversity, fisheries and wildlife– Archaeology
• Systems– CBISC, SIMPEL, SIERRA
60
EKEY: The electronic key for identifying freshwater fishes
61
Biodiversity Information Systems
• Retrieve fish descriptions of all fish whose shape is similar to that shown in Figure below, which belong to genus “Notropis”, which have “large eyes” and “dorsal stripe”, and have been observed within the catchments of the “Tennessee” river
62
Here is another scenario …• An archeologist wants to write
commentaries on artifacts discovered in the field
– Manually annotate images (and parts)
– Search for images (and parts), and annotations
– Automatically annotate/tag similar images (and parts)
– Share annotations and images
• Using an Archeology digital library in his study, he wants to be able to:
Sources: http://www.dorsetforyou.com, http://www.archaeology.org
Source: http://www.bewegende-plaatjes.net
63
Functionality required
• Digital Library (DL) users need, but get little assistance, regarding tasks:– Selecting and Annotating images and parts of
images• Preserve original context of information• Manual and automated annotation
– Content-based image retrieval of images and parts of images
– Combined text- and content-based image retrieval of images and parts of images
– Share selections and annotations
64
Layers in an SI system
Superimposed
Layer
Base Layer
Information Source1
Information Source2
Information Sourcen
…
marks
* Source: ICDE04 presentation by Murthy, et. al
65
Superimposed Applications
SIMPEL: A SuperImposed Multimedia Presentation Editor and pLayer
0 5 10 15 20
A
C
B
Enhanced CMapTools
66
Content-Based Image Retrieval (CBIR)
• Retrieve images similar to a user-defined specification or pattern (e.g., shape sketch, image example)
• Goal: To support image retrieval based on content properties (e.g., shape, color or texture), usually encoded into feature vectors
67
Effective Image Descriptor
Feature Vector
Image descriptors
• Image Descriptor
69
Example: Histogram
Image
Corresponding histogram
• Frequency count of each individual color
• Most commonly used color feature representation
Source: Andrade, D.
70
Texture Descriptors
71
A typical CBIR systemInterface
Query Specification Visualization
Image Database
Ranking
Similarity ComputationQuery-processing
Module
Query Pattern Similar Images
Feature VectorExtraction
FeatureVectors
Images
Data Insertion
72
CBISC Architecture
73
CBISC in ETANA
74
SIERRA
• A tool that allows users to select parts of images and associate them with text annotations.
• Performs information retrieval as annotations and associated marks in two ways, either for:– images or marks similar (in content) to a
specified image or mark– annotations containing specified query terms
75
Annotating an image
76
Searching over annotations
77
Searching over images/sub-images
78
Theory
79
Informal 5S & DL Definitions
DLs are complex systems that
• help satisfy info needs of users (societies)
• provide info services (scenarios)
• organize info in usable ways (structures)
• present info in usable ways (spaces)
• communicate info with users (streams)
80
5Ss
Ss Examples Objectives
Streams Text; video; audio; image Describes properties of the DL content such as encoding and language for textual material or particular forms of multimedia data
Structures Collection; catalog; hypertext; document; metadata
Specifies organizational aspects of the DL content
Spaces Measure; measurable, topological, vector, probabilistic
Defines logical and presentational views of several DL components
Scenarios Searching, browsing, recommending
Details the behavior of DL services
Societies Service managers, learners, teachers, etc.
Defines managers, responsible for running DL services; actors, that use those services; and relationships among them
81
5S
structures (d.10)streams (d.9) spaces (d.18) scenarios (d.21) societies (d. 24)
structural metadataspecification(d.25)
descriptive metadataspecification(d.26)
repository(d. 33)
collection (d. 31)
(d.34)indexingservice
structured stream (d.29)
digitalobject (d.30)
metadata catalog (d.32)
browsingservice
(d.37)
searchingservice (d.35)
digital library(minimal) (d. 38)
services (d.22)
sequence (d. 3)
graph (d. 6)function (d. 2)
measurable(d.12), measure(d.13), probability (d.14), vector (d.15), topological (d.16) spaces
event (d.10)state (d. 18)
hypertext(d.36)
sequence (d. 3)
transmission(d.23)
relation (d. 1) language (d.5)
grammar (d. 7)
tuple (d. 4)*
5S and DL formal definitions and compositions (April 2004 TOIS)
82
5SL – The Minimal DL Metamodel
Index
Actor
Search Manager
Index Manager
Document
Collection Catalog
Metadata
Service
Manager
Interface Manager
Community
Event
Scenario
Service
Browsing Manager
User
Interface
Scenarios (Meta-) Model
Spatial
(Meta-) Model
Meta-Models
Meta-ModelsPrimitives
Stream
(Meta-)ModelStructural (Meta-) Model
Text AudioVideo Image
Societal (Meta-) Model
Retrieval
Model
uses
runs
receiver
Repository Manager
83
Streams
text
audio
image
video digitalobject
Repository
CollectionCatalog
describes
stores
is_version_of/ cites/links_to
Index
Service
Scenario
event
extends
reuses
ServiceManager
Actor
operationexecutes
participates_in
recipient
runs
Scenarios
Societies
inherits_from/includes
association
uses
Topological
ProbabilisticMetric
Measurable
Measure
describes
employsproduces
employsproduces
employs
produces
Structures
Spaces
Vector
contains
metadata specifications
is_a is_a
precedes
happens_before
is_a
redefinesinvokes
contains
contains
84
Browsing Collaborating Customizing Filtering Providing access Recommending Requesting Searching Visualizing
Annotating Classifying Clustering Evaluating Extracting Indexing
Measuring Publicizing
Rating Reviewing (peer)
Surveying Translating
(language)
Conserving Converting
Copying/Replicating Emulating Renewing
Translating (format)
Acquiring Cataloging
Crawling (focused) Describing Digitizing
Federating Harvesting Purchasing Submitting
Preservational Creational
Add Value
Repository-Building
Information Satisfaction
Services
Infrastructure Services
85
Ontology: Applications
86
SearchingBrowsing
queryanchor
Society
actor
Collection, {digital object}
Recommending Filtering Binding Visualizing Expanding query
user model query/category {digital object}
{digital object} {digital object}
binder
InformationSatisfaction Services
space query’
fundamental
Rating Training
Infrastructure
Services (Add_Value)
composite
Requesting
handle
p pp
e e e{(digital object, actor, rate) }
p
e
e
p p p p p
e e
classifier
e ee e
e
p
e
Indexing
Index
p
e
transformer
e
87
Requirements Analysis Design Implementation Test
5S 5SLOO ClassesWorkflow Components
DLEvaluation
5SGraph 5SLGenFormalTheory/Metamodel
DL XMLLog
88
Digital Object
RepositoryCollection Minimal DL
Metadata Catalog
Descriptive Metadata
Specification
A Minimal DL in the 5S Framework
Structural Metadata
Specification
Streams Structures Spaces Scenarios Societies
indexing
browsing searching
services
hypertext
Structured Stream
89
Streams Structures Spaces Scenarios Societies
indexing
browsing searching
services
hypertext
Structured Stream
Descriptive Metadata
specification
SpaTemOrg
StraDia
Arch Descriptive Metadata specification
ArchDO
ArchObj
ArchColl
Arch Metadata catalog
ArchDColl ArchDR Minimal ArchDL
A Minimal ArchDL in the 5S Framework
90
Tools/Applications
5S MetaModel
5SGraphDL
Expert
DL Designer
5SL DL
Model
5SLGen
Practitioner
Researcher
TailoredDL
Teacher
componentpool
ODLSearch,ODLBrowse,ODLRate,ODLReview,
…….
Logging ModuleXMLLog
91
5SGen – Version 2: ODL, Services, Scenarios
5SL-SocietiesModel (1)
XPATH/JDOMTransform (2)
XMI:ClassModel (3)
Xmi2Java (4)
JavaClasses
Model (5)
superclass
DeterministicFSM (10)
SMC (11)
JavaFinite
State MachineClass
Controller (12)
5SL-ScenarioModel (6)
XPath/JDOMTransform (7)
StateChartModel (8)
Scenario Synthesis (9)
ODLSearch
Java
Wrapping
import
ComponentPool
ODLBrowse
Java
Wrapping
import
.
.
.
JSPUser
InterfaceView (13)
Generated DL Services
DLDesigner
DLDesigner
binds
5SLGen
5SL-SocietiesModel (1)
XPATH/JDOMTransform (2)
XMI:ClassModel (3)
Xmi2Java (4)
JavaClasses
Model (5)
superclass
DeterministicFSM (10)
SMC (11)
JavaFinite
State MachineClass
Controller (12)
5SL-ScenarioModel (6)
XPath/JDOMTransform (7)
StateChartModel (8)
Scenario Synthesis (9)
ODLSearch
Java
Wrapping
import
ComponentPool
ODLBrowse
Java
Wrapping
import
.
.
.
ODLSearch
Java
Wrapping
import
ComponentPool
ODLBrowse
Java
Wrapping
import
.
.
.
JSPUser
InterfaceView (13)
Generated DL Services
DLDesigner
DLDesigner
binds
5SLGen
92
5SGraph
Workspace
(instance model)
Structured
toolbox
(metamodel)
93
94
Information model
95
96
Formal Definition of DL Integration
• DLi=(Ri, DMi, Servi, Soci), 1 i n– Ri is a network accessible repository– DMi is a set of metadata catalogs for all collections– Servi is a set of services– Soci is a society
• UnionRep• UnionCat• UnionServices• UnionSociety
• Given n individual libraries, integrate the n DLs to create a UnionDL.
97
Taxonomy of Union Services
Infrastructure Services Information Satisfaction Services
Essential Add_Vaue Essential Add_value
indexing
harvesting
mapping
(Schema registry with analyses & mapping)
(data) cleaning
(focused) crawling
copying (replicating)
logging
(format) translating
(Service to support annotation)
(Metadata validation)
searching
browsing
access control
binding
comparison
(forum) discussion
(query) expansion
filtering
recommendation
visualization
Note: Suggested NSDL services are shown in blue.
98
Union Catalog Integration
VN MetadataFormat
Global MetadataFormat
VNCatalog
HDCatalog
Union Catalog
MappingTool
Wrapper
MappingTool
Wrapper
HD MetadataFormat
Virtual Nimrin(VN)
Halif DigMaster(HD)
Union ArchDL
99
local schema global schema
100
5SQual Tool
Implementing a Tool Aimed
at Automatic Quality Assessment in Digital Libraries
Bárbara Lagoeiro Moreira
101
Quality Base Model
Catalog
Collection
Digital Object
Metadata
Repository
Services
• Accuracy• Completeness• Conformance
• Completeness• Impact Factor
• Completeness• Consistency
• Accessibility• Pertinence• Preservability• Relevance
• Similarity• Significance• Timeliness
• Completeness• Consistency
• Composability• Efficiency• Effectiviness• Extensibility
• Reusability• Reliability
Numeric Indicators
102
satisfaction
performanceexpectancy
(PE)
behavioralIntention to
(re)use
relevance adequacy timeliness
reliability understandability scope
accessibilityease of use
joy of use reliability
userinterface
social influence (SI)
information quality(IQ)
system quality(SQ)
DL Success Model
system quality(SQ)
103
Systems
104
DL Manifesto - 1
• DL Reference Model• In support of the future European Digital Library• Developed by team connected with DELOS
(Candela, Casteli, Ioannidis, Koutrica, Meghini, Pagano, Ross, Schek, Schuldt)
• Draft 2.2 presented in Frescati, near Rome, June 2006 – 79 pages
• Could be integrated with work of DLF, JISC, etc.
105
DL Manifesto – 2: 3 Tiers
106
DL Manifesto – 3: Main Concepts
107
DL Manifesto – 4: Actor Roles
108
109
SIMILEObjectives, Current Status,
and DemonstrationStephen J. Garland, MIT CSAIL
Mick Bass, HP Labs
DSpace User Group Meeting
Cambridge, MA
March 11, 2004
110
Simile Goals
• Make the Semantic Web a reality– For libraries and their users– Support heterogeneous, multi-community metadata– Provide tools for viewing, browsing, searching
• Assess current state of Semantic Web– Explore utility of standards (RDF, RDFS, OWL)– Extend Semantic Web tool stack for libraries– Identify issues, gaps, opportunities, best practices
for digital libraries
111
What is Fedora™?
• Slides courtesy Vinod Chachra of VTLS
Flexible Extensible Digital Object Repository Architecture
112
Fedora™Repository
E x ter n a lC o n ten tS o u r c e
E x ter n a lC o n ten tS o u r c e
HT
TP
E x ter n a l C o n ten tR etr iev er
X M L F ile s
Re la t io n a l D B
S e s s io n M a n a g e me n tU s e r A u th e n t ic a t io n
P o l icies
U s ers /G ro u p s
H T T P
F T P
D atas tr eam s
D ig ita l O b jec tsS to rag e S u b s ys te m
S e c u rityS u b s ys te m
W e b Se r vi c eE xpo s ur eL aye r
SO
AP
R em o teS er v ic e
L o c alS er v ic e
M an ag e A c c e s s S e arc h O A I P ro v id e r
M an ag e m e n tS u b s ys te m
A c c e s sS u b s ys te m
HT
TP
FT
P
H T T PH T T P S O A P H T T P S O A P H T T P S O A P
C lie n tA pplica t io n
B a tchPro g ra m
S e rv e rA pplica t io n
W e bB ro ws e r
Co mp o n e n t M g mt
O b je c t M g mt
O b je c t Va lid a t io n
P ID Ge n e ra t io n
O b je c t D is s e min a t io n
O b je c t Re fle c t io n
P o lic y En fo rc e me n t
P o lic y M g mt
Co n te n t
Web Service Web Service Exposure Exposure LayerLayer
Adapted from Slide by V. Chachra, VTLS
113
VITAL / Fedora Relationship
114
OCKHAM Library Network
NSDL
OCKHAM
Services
NSDLServices
Teachers LearnersLibrarians
OCKHAMLibrary
Network
LibraryServices
115
OCKHAM
• Simplicity (a la OCCAM’s razor)
• Support by Mellon and DLF
• Four main ideas:
1. Components
2. Lightweight protocols
3. Open reference models (e.g., 5S, OAIS)
4. Community perspective and involvement
• Funded by NSF in NSDL, with P2P
116
Summary
• Acknowledgments• Introduction• Proposals• Projects• Systems• Theory• Curricula• Examples• Summary• Discussion
117
Questions?Comments?
See http://fox.cs.vt.edu/talks/