1 DELOS Conference (Pisa, Italy –14 Feb 2007) Digital Libraries: From Proposals to Projects to...

Preview:

Citation preview

1

DELOS Conference (Pisa, Italy –14 Feb 2007)

Digital Libraries:From Proposals to Projects

to Systems to Theoryto Curricula

Edward A. FoxVirginia Tech

Blacksburg, VA 24061 USA

2

Outline

• Acknowledgments• Introduction• Proposals• Projects• Systems• Theory• Curricula• Examples• Summary• Discussion

3

Acknowledgements

• Students

• Faculty, Staff

• Collaborators

• Support

• Mentors

4

Acknowledgements: Students

• Pavel Calado, Yuxin Chen, Fernando Das Neves, Shahrooz Feizabadi, Robert France, Marcos Gonçalves, Doug Gorton, Nithiwat Kampanya, Rohit Kelapure, S.H. Kim, Neill Kipp, Aaron Krowne, Bing Liu, Ming Luo, Paul Mather, Uma Murthy, Sanghee Oh, Ananth Raghavan, Unni. Ravindranathan, Ryan Richardson, Rao Shen, Ohm Sornil, Hussein Suleman, Ricardo da Silva Torres, Srinivas Vemuri, Wensi Xi, Seungwon Yang, Baoping Zhang, Qinwei Zhu, …

5

Acknowledgements: Faculty, Staff

• Lillian Cassel, Lois Delcambre, Debra Dudley, Roger Ehrich, Joanne Eustis, Weiguo Fan, James Flanagan, C. Lee Giles, Sandy Grant, Eric Hallerman, Eberhard Hilf, John Impagliazzo, Filip Jagodzinski, Douglas Knight, Deborah Knox, Alberto Laender, David Maier, Gail McMillan, Claudia Medeiros, Manuel Perez-Quinones, Jeff Pomerantz, Naren Ramakrishnan, Layne Watson, Barbara Wildemuth, …

6

Other Collaborators (Selected)

• Brazil: FUA, UFMG, UNICAMP• Case Western Reserve University• Emory, Notre Dame, Oregon State• Germany: Univ. Oldenburg• Mexico: UDLA (Puebla), Monterrey• College of NJ, Hofstra, Penn State, Villanova• Portland State University• University of Arizona, University of Florida,

Univ. of Illinois, University of Virginia• VTLS (slides on digital repositories, NDLTD)

Acknowledgements: Support

ACM, Adobe, AOL, CAPES, CNI, CONACyT, DFG, IBM, IMLS, Microsoft, NASA, NDLTD, NLM, NSF (IIS-9986089, 0080748, 0086227, 0307867, 0325579, 0532825, 0535057, 0535060; ITR-0325579; DUE-0121679, 0121741, 0136690, 0333531, 0333601, 0435059), OCLC, SOLINET, SUN, SURA, UNESCO, US Dept. Ed. (FIPSE), VTLS, …

8

Acknowledgements - Mentors

• JCR Licklider – undergrad advisor (1969-71)– Author in 1965 of “Libraries of the Future”– Before, at ARPA, funded start of Internet

• Michael Kessler – BS thesis advisor– Project TIP (technical information project)– Defined bibliographic coupling

• Gerard Salton – graduate advisor (1978-83)– “Father of Information Retrieval”– Application of Scientific Methods toward Integration of

Theory, Systems, Experiments, and Education

9

Libraries of the FutureJCR Licklider, 1965, MIT Press

World

Nation

State

City

Community

10

Introduction – Mentor Challenges

• Scientific method– “Leonardo da Vinci: The first scientist”

• Theory-based -> integration– Across computing disciplines– Over content, representations, services

• Experimentally proven– Evaluation: formative, summative

• Practically useful and beneficial– Make the world better (smaller)– Task support, effectiveness, efficiency

Digital Libraries --- Objectives

• World Lit.: 24hr / 7day / from desktop• Integrated “super” information systems: 5S:

Table of related areas and their coverage• Ubiquitous, Higher Quality, Lower Cost • Education, Knowledge Sharing, Discovery• Disintermediation -> Collaboration • Universities Reclaim Property• Interactive Courseware, Student Works• Scalable, Sustainable, Usable, Useful

12

Digital LibrariesShorten the Chain from

Editor

Publisher

A&I

Consolidator

Library

Reviewer

13

DLs Shorten the Chain to

Author

Reader

Digital

LibraryEditor

Reviewer

Teacher

Learner

Librarian

14

Introduction – 1991 Workshop

• ACM SIGIR ’91 (Chicago)

• Workshop on Future Directions in IR

• Report planning with– Michael McGill– Michael Lesk

• How can we accomplish something?– Address society’s needs

• What if all undergrads had info. access?

• Funding lobbying leading to: DLI, NSDL

15

Computing (flops)Digital content

Com

mun

icat

ions

(ban

dwid

th, c

onne

ctiv

ity)

Locating Digital Libraries in Computing andCommunications Technology Space

Digital Libraries technologytrajectory: intellectualaccess to globally distributed information

less moreNote: we should consider 4 dimensions: computing, communications,content, and community (people)

17

Challenges, Apps, Projects

• US-Korea Collaboration on DLs Workshop

• Reagan Moore and Ed Fox report

• Chart Headings:– Application Domain– Related Institutions– Examples– Technical Challenges– Benefit/Impact

18

Application

Domain

Related Institutions

Examples   Technical Challenges Benefit / Impact

PublishingPublishers, Eprint

archivesOAI   Quality control, openness Aggregation, organization

Education

Schools, colleges, universities

NSDL, NCSTRL  Knowledge management,

reuseabilityAccess to data

Art, Culture

Museum AMICO, PRDLA  Digitization, describing,

catalogingGlobal understanding

ScienceGovernment,

Academia, Commerce

NVO, PDG, SwissProt, UK

eScience,European Union Commission

  Data modelsreproducibility, faster reuse, faster

advance

(e) Governme

nt

Government Agencies (all levels)

Census  Intellectual property rights,

privacy, multi-nationalAccountability, homeland security

(e) Commerce

, (e) Industry

Legal institutionsCourt cases,

patents  Developing standards

Standardization, economic development

History, Heritage

Foundations American Memory  Content, context,

interpretation

Long term view, perspective, documentation, recording, facilitating, interpretation,

understanding

Cross-cutting

Library, Archive

Web, personal collections

 

Multi-language, preservation, scalability, interoperability, dynamic

behavior, workflow, sustainability, ontologies,

distributed data, infrastructure

Reduced cost, increased access, pereservation, democratization, leveling, peace, competitiveness

Reagan Moore

Ed Fox

June

2002

for

NSF

19

Introduction – Alliteration

• 5S– Societies– Scenarios– Spaces– Structures– Streams

• 3C– Content– Context– Criticism, commentary

20

Introduction – Alliteration

• 5S– Societies

• Users• Collaboration, Web 2.0

– Scenarios• Workflow, Stories• Services, Components

– Spaces: GIS– Structures: DBMS– Streams: DSMS

• 3C– Content

• Content Management Systems

– Context• Link Structure• NLP• Mental models

– Criticism, commentary• Annotation, Talmud• Cataloging, indexing• Abstracting• Summarizing• Secondary literature

21

Introduction – Time to:

• Treat DL as a serious field

• Achieve balance– Research & Development– Systems & Services– Practice, Continuous Quality Improvement– Use, Benefit

• Train digital librarians

• Achieve sustainability

22

Introduction - Approach

1. Proposals

2. Projects

3. Systems

4. Theory

5. Curricula

1. Vision

2. Objectives

3. Generality

4. Abstraction, conceptualization

5. Education– Structure– Pedagogy

23

Introduction - Proposals

• Early visions• Providing rationale for funding, programs• USA• Europe• India, China, New Zealand, Australia, …• Sustainability, follow-on• Technology transfer

– Stanford DLI-1 -> Google

24

Introduction - Projects

• Body of information

• Media type (maps, video, speech, photos)

• Representation (DC, METS, FRBR)

• Architecture (SOA)

• Interoperability (OAI)

• Archiving and Preservation (UVC)

• Devices (SenseCam, PIM)

• Links with other fields

25

Introduction – Projects -2• Body of information

– Person’s works (Cervantes)– Content by organization

• Library (Library of Congress)• Publisher (ACM)• Million books project• Google consortium

– Content by discipline (Physics, CS, Archaeology)– Content by genre (ETDs)– Content by target audience (TEL, Learners)

26

NSDL Information ArchitectureEssentially as developed by the Technical Infrastructure Workgroup

referenceditems &

collections

referenceditems &

collections

Special Databases

NSDLServicesNSDL

ServicesOther NSDLServices

CI Services

annotation

CI Services

discussion

CI Services

personalization

CI Services

authentication

CI Services

browsing

Core Services:information retrieval

Core Collection-Building Services

harvesting

Core Collection-Building Services

protocols

Core Services:metadata gathering

Portals &ClientsPortals &

ClientsPortals &Clients

Usage Enhancement

Collection Building

User Interfaces

NSDLCollections

NSDLCollections

NSDLCollections

CoreNSDL“Bus”

27

D ig ita l L ib ra r y C o n te n t

A rtic le s ,R e p o rts,

B o o ks

T e xtD o cum e n ts

S p ee ch ,M u s ic

V id eoA u d io

(A e ria l)P h o tos

G e og rap h icIn fo rm ation

M o d e lsS im u la tio ns

S o ftw a re ,P ro g ra m s

G e no m eH u m a n,a n im a l,

p la n t

B ioIn fo rm ation

2 D , 3 D ,V R ,C A T

Im ag es a ndG ra p h ics

C o nte n tT yp e s

28

Introduction – Projects - 5

• Links with other fields– Art, sculpture, music, speech– Medicine: images, datasets, genomics– Law, government

• Statutes, regulations• Citations, commentaries

– Supercomputers, Grid– HCI, Cognitive Psychology– IR, HT, MM

29

CC2001 Information Management Areas

IM1. Information models and systems*

IM8. Distributed DBs

IM2. Database systems* IM9. Physical DB design

IM3. Data modeling* IM10. Data mining

IM4. Relational DBs IM11. Information storage and retrieval

IM5. Database query languages

IM12. Hypertext and hypermedia

IM6. Relational DB design IM13. Multimedia information & systems

IM7. Transaction processing IM14. Digital libraries

* Core components

30

Introduction - Systems

• IBM DL -> content management system

• MARIAN, ODL, WS-ODL

• Greenstone

• DSpace

• Fedora

• DELOS– DLMS– ISIS & OSIRIS

31

Introduction - Theory

• Definitions: Key ideas, concepts

• Taxonomy: Groups, clusters

• Abstraction/generalization: Components

• Models, metamodels

• Proofs: relationships, improvements

• Uses, benefits– Interoperability (map, wrap, mediate, harvest)

• User interface: Explore: browse/search/visualize

– Automation (lex/yacc -> 5SGraph, 5SGen)

32

Introduction - Curricula

• Audience– LIKES, LIS, CS – Developer, implementer, systems librarian– D. Librarian (reference, coll. development)

• Core• Tracks

– Libraries: public, school/univ., corporation– Cultural heritage– Science (research, education)– Persons (PIM)

33

Knowledge Society

HCI

Visualization

Knowledge Management

Systems Analysis & Design

Programming

Database

Algorithms

Architecture

Net-Centricity

Intelligent Systems

Social & Ethical

Library Information Science

Simulation

Chemistry

Biology

Communi-

cations

Healthcare

Art

Music

Marketing

Finance

Modeling

Engineering

Sociology

Psychology

Physics

Architecture

History

Political Science

Geography

Knowledge Society

HCI

Visualization

Knowledge

Systems Analysis & Design

Database

Algorithms

Intelligent Systems

Social & Ethical

Library & Information Science

Economics

Simulation

Chemistry

Biology

Healthcare

Art

Music

Marketing

Finance

Engineering

Sociology

Psychology

Physics

Architecture

History

Political Science

Geography

English

Math

Living In the KnowlEdge Society (LIKES):Core surrounded by enabling computing concepts

and problem providing disciplines

34

DL Curricula

• “Curriculum Development for Digital Libraries” – NSF grant to VT, UNC-CH

• Studied body of literature

• Modules: core, related

• Invite collaboration worldwide

35

Digital Librarian:Needed Skills and Knowledge

• Choi, Y., & Rasmussen, E. (2006)

• What is needed to educate future digital librarians: A study of current practice and staffing patterns in academic and research libraries.

• D-Lib Magazine, 12(9)

• doi:10.1045/september2006-choi.

36

D.Librarian Skills & Knowledge:Technology Related

• DL architecture and software

• Technical and quality standards

• Web markup languages

• Database development and DBMS

• Web design skills

37

D.Librarian Skills & Knowledge:Library Related

• The needs of users

• Digital archiving and preservation

• Cataloging, metadata

• Indexing

• Collection development

38

D.Librarian Skills & Knowledge:Other

• Communication and interpersonal skills

• Project management and leadership skills

• Legal issues

• Grant/proposal writing skills

• Teaching and group presentation skills

39

Development & Evaluation Process

· From research team (VT & UNC)

· From current courses at VT & UNC

· From Advisory Board· From CC 2001

Vision/plan

Design

· Modules· Lessons

· Inspection by Advisory Board

· Inspection by external experts

· Inspection by Doctoral Consortium participants

Evaluate

· CC 2001 context· Curricular needs· Student background

Analyze· Specific strengths· Specific weaknesses

Feedback

· At UNC & VT· At additional universities

(in CS & LIS programs)

Revise & Implement

· Modules ready for use

· Lessons ready for use

Products

· Teacher perceptions· Student perceptions· Student outcomes

Evaluatein the field

40

Curriculum framework

41

Modules

1. Collection Development2. Digital objects / Composites / Packages3. Metadata, Cataloging, Author submission4. Architecture, Interoperability5. Data visualization6. Services7. Intellectual property rights management,

Privacy, Protection8. Social issues / Future of DLs9. Archiving and Preservation

42

Conference papers x modules

0

20

40

60

80

100

120

140

160

180

200

1 2 3 4 5 6 7 8 9

Module ID

Nu

mb

er

of

con

fere

nce

pa

per

s

JCDL 05

JCDL 04

JCDL 03

JCDL 02

JCDL 01

ACM DL 00

ACM DL 99

ACM DL 98

ACM DL 97

ACM DL 96

43

Taxonomy of DL Educational Resources

44

2-a: Collection development/selection policies2-b: Digitization

3-a: Text resources3-b: Multimedia3-c (8-b): File formats, transformation, migration

4-a: Metadata, cataloging, metadata markup, metadata harvesting4-b: Ontologies, classification, categorization4-c: Vocabulary control, thesauri, terminologies

4-d: Subject description4-e: Information architecture (e.g., hypertext, hypermedia)4-f: Object description and organization for a specific domain

5-a: Architecture overviews/models5-b: Applications5-c: Identifiers, handles, DOI, PURL

6-a: Info needs, relevance, evaluation6-b: Search strategy, info seeking behavior, user modeling

8-a: Repositories, archives, storage8-b (3-c): File formats, transformation, migration

9-a: Project management9-b: DL case studies9-c: DL evaluation9-d: Usability assessment, user studies

9-e: Bibliometrics, Webometrics9-f: Legal issues (e.g., copyright)9-g: Cost/economic issues9-h: Social issues

10-a: Future of DLs10-b: Education for digital librarians

Digital Objects3

Collection Development

2

Overview1

Architecture (agents, mediators)

5

CORE TOPICS

DL education and research

10

7-a: Search engines, IR, indexing methods7-b: Reference services7-c: Recommender systems

5-d: Protocols5-e: Interoperability5-f: Security

2-c: Harvesting2-d: Document and e-publishing/presentation markup

6-c: Sharing, networking, interchange (e.g., social)6-d: Interaction design, info summarization and visualization, usability assessment

User Behavior/ Interactions

6

7-d: Routing, community filtering7-e: Web publishing (e.g., wiki, rss, Moodle, etc.)Services7

8-c: Sustainability

Management and Evaluation

9

Archiving and Preservation

Integrity8

1-a (10-c): Conceptual frameworks, theories

10-c (1-a): Conceptual framework, theories10-d: DL research initiatives

Info/ Knowledge Organization

4

45

Overview1

1-a (10-c): Conceptual frameworks, theories

46

Collection Development

2

2-a: Collection development/selection policies2-b: Digitization

2-c: Harvesting2-d: Document and e-publishing/presentation markup

47

Digital Objects3

3-a: Text resources3-b: Multimedia3-c (8-b): File formats, transformation, migration

48

Info/ Knowledge Organization

4

4-a: Metadata, cataloging, metadata markup, metadata harvesting4-b: Ontologies, classification, categorization4-c: Vocabulary control, thesauri, terminologies

4-d: Subject description4-e: Information architecture (e.g., hypertext, hypermedia)4-f: Object description and organization for a specific domain

49

Architecture (agents, mediators)

5

5-a: Architecture overviews/models5-b: Applications5-c: Identifiers, handles, DOI, PURL5-d: Protocols5-e: Interoperability5-f: Security

50

User Behavior/ Interactions

6

6-a: Info needs, relevance, evaluation6-b: Search strategy, info seeking behavior, user modeling

6-c: Sharing, networking, interchange (e.g., social)6-d: Interaction design, info summarization and visualization, usability assessment

51

Services7

7-d: Routing, community filtering7-e: Web publishing (e.g., wiki, rss, Moodle, etc.)

7-a: Search engines, IR, indexing methods7-b: Reference services7-c: Recommender systems

52

Archiving and Preservation

Integrity8

8-a: Repositories, archives, storage8-b (3-c): File formats, transformation, migration

8-c: Sustainability

53

Management and Evaluation

9

9-a: Project management9-b: DL case studies9-c: DL evaluation9-d: Usability assessment, user studies9-e: Bibliometrics, Webometrics9-f: Legal issues (e.g., copyright)9-g: Cost/economic issues9-h: Social issues

54

DL education and research

10

10-a: Future of DLs10-b: Education for digital librarians

10-c (1-a): Conceptual framework, theories10-d: DL research initiatives

55

Personalizing A Course Website Using the NSDL

William Cameron2, Boots Cassel2, Edward Fox1, Manuel Perez-Quinones1, Manas

Tungare1, Xiaoyan Yu1

Virginia Tech1, Villanova2

56

Syllabus Collection …Towards an intelligent educational system

Unstructured Syllabus Text

StructuredSyllabus

Text

SearcherRecommender

Crawler

SyllabusClassifier

Extractor

Editor

SyllabusOntology

Services

Publisher

Other NSDL

Resources

Potential Syllabus

Text

Classification Scheme

ResourceClassifier

57

Syllabus Ontology

• Standard, machine understandable

• Ontology Editor: Protégé

• Syllabus Schema: SylVia

• http://doc.cs.vt.edu/ontologies/

58

Creating new syllabus

• Web-based application to support entry of syllabi into collection

• Moodle Plug-in in the works

• Uses CC 2001 to select topics for a course

59

Example: CBIR + SI

• Integration of– CBIR– Superimposed information (annotations …)

• Application to– Biodiversity, fisheries and wildlife– Archaeology

• Systems– CBISC, SIMPEL, SIERRA

60

EKEY: The electronic key for identifying freshwater fishes

61

Biodiversity Information Systems

• Retrieve fish descriptions of all fish whose shape is similar to that shown in Figure below, which belong to genus “Notropis”, which have “large eyes” and “dorsal stripe”, and have been observed within the catchments of the “Tennessee” river

62

Here is another scenario …• An archeologist wants to write

commentaries on artifacts discovered in the field

– Manually annotate images (and parts)

– Search for images (and parts), and annotations

– Automatically annotate/tag similar images (and parts)

– Share annotations and images

• Using an Archeology digital library in his study, he wants to be able to:

Sources: http://www.dorsetforyou.com, http://www.archaeology.org

Source: http://www.bewegende-plaatjes.net

63

Functionality required

• Digital Library (DL) users need, but get little assistance, regarding tasks:– Selecting and Annotating images and parts of

images• Preserve original context of information• Manual and automated annotation

– Content-based image retrieval of images and parts of images

– Combined text- and content-based image retrieval of images and parts of images

– Share selections and annotations

64

Layers in an SI system

Superimposed

Layer

Base Layer

Information Source1

Information Source2

Information Sourcen

marks

* Source: ICDE04 presentation by Murthy, et. al

65

Superimposed Applications

SIMPEL: A SuperImposed Multimedia Presentation Editor and pLayer

0 5 10 15 20

A

C

B

Enhanced CMapTools

66

Content-Based Image Retrieval (CBIR)

• Retrieve images similar to a user-defined specification or pattern (e.g., shape sketch, image example)

• Goal: To support image retrieval based on content properties (e.g., shape, color or texture), usually encoded into feature vectors

67

Effective Image Descriptor

Feature Vector

Image descriptors

• Image Descriptor

69

Example: Histogram

Image

Corresponding histogram

• Frequency count of each individual color

• Most commonly used color feature representation

Source: Andrade, D.

70

Texture Descriptors

71

A typical CBIR systemInterface

Query Specification Visualization

Image Database

Ranking

Similarity ComputationQuery-processing

Module

Query Pattern Similar Images

Feature VectorExtraction

FeatureVectors

Images

Data Insertion

72

CBISC Architecture

73

CBISC in ETANA

74

SIERRA

• A tool that allows users to select parts of images and associate them with text annotations.

• Performs information retrieval as annotations and associated marks in two ways, either for:– images or marks similar (in content) to a

specified image or mark– annotations containing specified query terms

75

Annotating an image

76

Searching over annotations

77

Searching over images/sub-images

78

Theory

79

Informal 5S & DL Definitions

DLs are complex systems that

• help satisfy info needs of users (societies)

• provide info services (scenarios)

• organize info in usable ways (structures)

• present info in usable ways (spaces)

• communicate info with users (streams)

80

5Ss

Ss Examples Objectives

Streams Text; video; audio; image Describes properties of the DL content such as encoding and language for textual material or particular forms of multimedia data

Structures Collection; catalog; hypertext; document; metadata

Specifies organizational aspects of the DL content

Spaces Measure; measurable, topological, vector, probabilistic

Defines logical and presentational views of several DL components

Scenarios Searching, browsing, recommending

Details the behavior of DL services

Societies Service managers, learners, teachers, etc.

Defines managers, responsible for running DL services; actors, that use those services; and relationships among them

81

5S

structures (d.10)streams (d.9) spaces (d.18) scenarios (d.21) societies (d. 24)

structural metadataspecification(d.25)

descriptive metadataspecification(d.26)

repository(d. 33)

collection (d. 31)

(d.34)indexingservice

structured stream (d.29)

digitalobject (d.30)

metadata catalog (d.32)

browsingservice

(d.37)

searchingservice (d.35)

digital library(minimal) (d. 38)

services (d.22)

sequence (d. 3)

graph (d. 6)function (d. 2)

measurable(d.12), measure(d.13), probability (d.14), vector (d.15), topological (d.16) spaces

event (d.10)state (d. 18)

hypertext(d.36)

sequence (d. 3)

transmission(d.23)

relation (d. 1) language (d.5)

grammar (d. 7)

tuple (d. 4)*

5S and DL formal definitions and compositions (April 2004 TOIS)

82

5SL – The Minimal DL Metamodel

Index

Actor

Search Manager

Index Manager

Document

Collection Catalog

Metadata

Service

Manager

Interface Manager

Community

Event

Scenario

Service

Browsing Manager

User

Interface

Scenarios (Meta-) Model

Spatial

(Meta-) Model

Meta-Models

Meta-ModelsPrimitives

Stream

(Meta-)ModelStructural (Meta-) Model

Text AudioVideo Image

Societal (Meta-) Model

Retrieval

Model

uses

runs

receiver

Repository Manager

83

Streams

text

audio

image

video digitalobject

Repository

CollectionCatalog

describes

stores

is_version_of/ cites/links_to

Index

Service

Scenario

event

extends

reuses

ServiceManager

Actor

operationexecutes

participates_in

recipient

runs

Scenarios

Societies

inherits_from/includes

association

uses

Topological

ProbabilisticMetric

Measurable

Measure

describes

employsproduces

employsproduces

employs

produces

Structures

Spaces

Vector

contains

metadata specifications

is_a is_a

precedes

happens_before

is_a

redefinesinvokes

contains

contains

84

Browsing Collaborating Customizing Filtering Providing access Recommending Requesting Searching Visualizing

Annotating Classifying Clustering Evaluating Extracting Indexing

Measuring Publicizing

Rating Reviewing (peer)

Surveying Translating

(language)

Conserving Converting

Copying/Replicating Emulating Renewing

Translating (format)

Acquiring Cataloging

Crawling (focused) Describing Digitizing

Federating Harvesting Purchasing Submitting

Preservational Creational

Add Value

Repository-Building

Information Satisfaction

Services

Infrastructure Services

85

Ontology: Applications

86

SearchingBrowsing

queryanchor

Society

actor

Collection, {digital object}

Recommending Filtering Binding Visualizing Expanding query

user model query/category {digital object}

{digital object} {digital object}

binder

InformationSatisfaction Services

space query’

fundamental

Rating Training

Infrastructure

Services (Add_Value)

composite

Requesting

handle

p pp

e e e{(digital object, actor, rate) }

p

e

e

p p p p p

e e

classifier

e ee e

e

p

e

Indexing

Index

p

e

transformer

e

87

Requirements Analysis Design Implementation Test

5S 5SLOO ClassesWorkflow Components

DLEvaluation

5SGraph 5SLGenFormalTheory/Metamodel

DL XMLLog

88

Digital Object

RepositoryCollection Minimal DL

Metadata Catalog

Descriptive Metadata

Specification

A Minimal DL in the 5S Framework

Structural Metadata

Specification

Streams Structures Spaces Scenarios Societies

indexing

browsing searching

services

hypertext

Structured Stream

89

Streams Structures Spaces Scenarios Societies

indexing

browsing searching

services

hypertext

Structured Stream

Descriptive Metadata

specification

SpaTemOrg

StraDia

Arch Descriptive Metadata specification

ArchDO

ArchObj

ArchColl

Arch Metadata catalog

ArchDColl ArchDR Minimal ArchDL

A Minimal ArchDL in the 5S Framework

90

Tools/Applications

5S MetaModel

5SGraphDL

Expert

DL Designer

5SL DL

Model

5SLGen

Practitioner

Researcher

TailoredDL

Teacher

componentpool

ODLSearch,ODLBrowse,ODLRate,ODLReview,

…….

Logging ModuleXMLLog

91

5SGen – Version 2: ODL, Services, Scenarios

5SL-SocietiesModel (1)

XPATH/JDOMTransform (2)

XMI:ClassModel (3)

Xmi2Java (4)

JavaClasses

Model (5)

superclass

DeterministicFSM (10)

SMC (11)

JavaFinite

State MachineClass

Controller (12)

5SL-ScenarioModel (6)

XPath/JDOMTransform (7)

StateChartModel (8)

Scenario Synthesis (9)

ODLSearch

Java

Wrapping

import

ComponentPool

ODLBrowse

Java

Wrapping

import

.

.

.

JSPUser

InterfaceView (13)

Generated DL Services

DLDesigner

DLDesigner

binds

5SLGen

5SL-SocietiesModel (1)

XPATH/JDOMTransform (2)

XMI:ClassModel (3)

Xmi2Java (4)

JavaClasses

Model (5)

superclass

DeterministicFSM (10)

SMC (11)

JavaFinite

State MachineClass

Controller (12)

5SL-ScenarioModel (6)

XPath/JDOMTransform (7)

StateChartModel (8)

Scenario Synthesis (9)

ODLSearch

Java

Wrapping

import

ComponentPool

ODLBrowse

Java

Wrapping

import

.

.

.

ODLSearch

Java

Wrapping

import

ComponentPool

ODLBrowse

Java

Wrapping

import

.

.

.

JSPUser

InterfaceView (13)

Generated DL Services

DLDesigner

DLDesigner

binds

5SLGen

92

5SGraph

Workspace

(instance model)

Structured

toolbox

(metamodel)

93

94

Information model

95

96

Formal Definition of DL Integration

• DLi=(Ri, DMi, Servi, Soci), 1 i n– Ri is a network accessible repository– DMi is a set of metadata catalogs for all collections– Servi is a set of services– Soci is a society

• UnionRep• UnionCat• UnionServices• UnionSociety

• Given n individual libraries, integrate the n DLs to create a UnionDL.

97

Taxonomy of Union Services

Infrastructure Services Information Satisfaction Services

Essential Add_Vaue Essential Add_value

indexing

harvesting

mapping

(Schema registry with analyses & mapping)

(data) cleaning

(focused) crawling

copying (replicating)

logging

(format) translating

(Service to support annotation)

(Metadata validation)

searching

browsing

access control

binding

comparison

(forum) discussion

(query) expansion

filtering

recommendation

visualization

Note: Suggested NSDL services are shown in blue.

98

Union Catalog Integration

VN MetadataFormat

Global MetadataFormat

VNCatalog

HDCatalog

Union Catalog

MappingTool

Wrapper

MappingTool

Wrapper

HD MetadataFormat

Virtual Nimrin(VN)

Halif DigMaster(HD)

Union ArchDL

99

local schema global schema

100

5SQual Tool

Implementing a Tool Aimed

at Automatic Quality Assessment in Digital Libraries

Bárbara Lagoeiro Moreira

101

Quality Base Model

Catalog

Collection

Digital Object

Metadata

Repository

Services

• Accuracy• Completeness• Conformance

• Completeness• Impact Factor

• Completeness• Consistency

• Accessibility• Pertinence• Preservability• Relevance

• Similarity• Significance• Timeliness

• Completeness• Consistency

• Composability• Efficiency• Effectiviness• Extensibility

• Reusability• Reliability

Numeric Indicators

102

satisfaction

performanceexpectancy

(PE)

behavioralIntention to

(re)use

relevance adequacy timeliness

reliability understandability scope

accessibilityease of use

joy of use reliability

userinterface

social influence (SI)

information quality(IQ)

system quality(SQ)

DL Success Model

system quality(SQ)

103

Systems

104

DL Manifesto - 1

• DL Reference Model• In support of the future European Digital Library• Developed by team connected with DELOS

(Candela, Casteli, Ioannidis, Koutrica, Meghini, Pagano, Ross, Schek, Schuldt)

• Draft 2.2 presented in Frescati, near Rome, June 2006 – 79 pages

• Could be integrated with work of DLF, JISC, etc.

105

DL Manifesto – 2: 3 Tiers

106

DL Manifesto – 3: Main Concepts

107

DL Manifesto – 4: Actor Roles

108

109

SIMILEObjectives, Current Status,

and DemonstrationStephen J. Garland, MIT CSAIL

Mick Bass, HP Labs

DSpace User Group Meeting

Cambridge, MA

March 11, 2004

110

Simile Goals

• Make the Semantic Web a reality– For libraries and their users– Support heterogeneous, multi-community metadata– Provide tools for viewing, browsing, searching

• Assess current state of Semantic Web– Explore utility of standards (RDF, RDFS, OWL)– Extend Semantic Web tool stack for libraries– Identify issues, gaps, opportunities, best practices

for digital libraries

111

What is Fedora™?

• Slides courtesy Vinod Chachra of VTLS

Flexible Extensible Digital Object Repository Architecture

112

Fedora™Repository

E x ter n a lC o n ten tS o u r c e

E x ter n a lC o n ten tS o u r c e

HT

TP

E x ter n a l C o n ten tR etr iev er

X M L F ile s

Re la t io n a l D B

S e s s io n M a n a g e me n tU s e r A u th e n t ic a t io n

P o l icies

U s ers /G ro u p s

H T T P

F T P

D atas tr eam s

D ig ita l O b jec tsS to rag e S u b s ys te m

S e c u rityS u b s ys te m

W e b Se r vi c eE xpo s ur eL aye r

SO

AP

R em o teS er v ic e

L o c alS er v ic e

M an ag e A c c e s s S e arc h O A I P ro v id e r

M an ag e m e n tS u b s ys te m

A c c e s sS u b s ys te m

HT

TP

FT

P

H T T PH T T P S O A P H T T P S O A P H T T P S O A P

C lie n tA pplica t io n

B a tchPro g ra m

S e rv e rA pplica t io n

W e bB ro ws e r

Co mp o n e n t M g mt

O b je c t M g mt

O b je c t Va lid a t io n

P ID Ge n e ra t io n

O b je c t D is s e min a t io n

O b je c t Re fle c t io n

P o lic y En fo rc e me n t

P o lic y M g mt

Co n te n t

Web Service Web Service Exposure Exposure LayerLayer

Adapted from Slide by V. Chachra, VTLS

113

VITAL / Fedora Relationship

114

OCKHAM Library Network

NSDL

OCKHAM

Services

NSDLServices

Teachers LearnersLibrarians

OCKHAMLibrary

Network

LibraryServices

115

OCKHAM

• Simplicity (a la OCCAM’s razor)

• Support by Mellon and DLF

• Four main ideas:

1. Components

2. Lightweight protocols

3. Open reference models (e.g., 5S, OAIS)

4. Community perspective and involvement

• Funded by NSF in NSDL, with P2P

116

Summary

• Acknowledgments• Introduction• Proposals• Projects• Systems• Theory• Curricula• Examples• Summary• Discussion

117

Questions?Comments?

See http://fox.cs.vt.edu/talks/