55
1 ADT Program Workshop Meeting (Sydney, Australia – September 26, 2005) “ADT and the Future of NDLTD” Edward A. Fox Executive Director, NDLTD [email protected] Virginia Tech Blacksburg, VA 24061 USA http://fox.cs.vt.edu Mobile: +1-540-230-6266

1 ADT Program Workshop Meeting (Sydney, Australia – September 26, 2005) “ADT and the Future of NDLTD” Edward A. Fox Executive Director, NDLTD [email protected]

Embed Size (px)

Citation preview

1

ADT Program Workshop Meeting(Sydney, Australia – September 26, 2005)

“ADT and the Future of NDLTD”

Edward A. FoxExecutive Director, NDLTD

[email protected] Tech

Blacksburg, VA 24061 USAhttp://fox.cs.vt.edu

Mobile: +1-540-230-6266

Outline

• Acknowledgements

• Digital Libraries

• NDLTD

• Recent and Future Work

• Conclusions

Acknowledgements

• All those working with ETDs

• NDLTD, including Board, Committees, Members, and Associate Members

• ETD 2005 Conference Team

• ADT, including Committees and Members

• Faculty, Staff, Students at Virginia Tech and at institutions collaborating with us

• Sponsors, including SURA, FIPSE, NSF, IBM, Microsoft, VTLS, Adobe, …

Digital Libraries (DLs) -- Objectives

• World Lit.: 24hr / 7day / from desktop• Ubiquitous• Integrated “super” information systems• Usable, Useful• Higher Quality, Lower Cost • Education, Knowledge Sharing, Discovery• Disintermediation -> Collaboration • Universities Reclaim Property• Interactive Courseware, Student Works

5

D ig ita l L ib ra r y C o n te n t

A rtic le s ,R e p o rts,

B o o ks

T e xtD o cum e n ts

S p ee ch ,M u s ic

V id eoA u d io

(A e ria l)P h o tos

G e og rap h icIn fo rm ation

M o d e lsS im u la tio ns

S o ftw a re ,P ro g ra m s

G e no m eH u m a n,a n im a l,

p la n t

B ioIn fo rm ation

2 D , 3 D ,V R ,C A T

Im ag es a ndG ra p h ics

C o nte n tT yp e s

DL OverviewWhy of Global Interest?

• National projects can preserve antiquities and heritage: cultural, historical, linguistic, scholarly

• Knowledge and information are essential to economic and technological growth, education

• DL - a domain for international collaboration– wherein all can contribute and benefit– which leverages investment in networking– which provides useful content on Internet & WWW– which will tie nations and peoples together more

strongly and through deeper understanding

7

SynchronousScholarly Communication

Same time, Same or different place

8

Asynchronous, Digital Library Mediated Scholarly Communication

Different time and/or place

9

Information Life Cycle

AuthoringModifying

OrganizingIndexing

StoringRetrieving

DistributingNetworking

Retention/ Mining

AccessingFiltering

UsingCreating

DL Challenges

• Preservation - so people with trust DLs

• Scalability, sustainability, interoperability

• Well-trained, confident, digital librarians

• DL community-oriented industry -- critical mass through covering libraries, archives, museums, corporate info, govt info, personal info -> “quality WWW” integrating IR, HT, MM, …

• Quality: content, access, services, …

11

AuthoringModifying

OrganizingIndexing

Storing

Archiving

NetworkingAccessing

Filtering

Creation

DistributionUtilization

Significance

Similarity

Pertinence

AccuracyCompletenessConformance

Seeking

SearchingBrowsingRecommending

Relevance

Timeliness

Accessibility

Accessibility

Inactive

Active

Discard

RetentionMining

Semi-Active

Preservability

Timeliness

Preservability

Describing

Quality and the Information Life Cycle

12

DL Curriculum FrameworkSemester 1:

DL collections:development/creation

Semester 2:DL services and

sustainability

CO

UR

SE

ST

RU

CT

UR

E

DigitizationStorage

Interchange

Digital objectsCompositesPackages

MetadataCataloging

Author submission

NamingRepositories

Archives

Spaces(conceptual,geographic,2/3D, VR)

Architectures(agents, buses,

wrappers/mediators)Interoperability

Services(searching,

linking, browsing, etc.)

Intellectual property rights mgmt.

PrivacyProtection (watermarking)

Archiving and preservation

Integrity

Architectures(agents, buses,

wrappers/mediators)Interoperability

CO

RE

DL

TO

PIC

S

DocumentsE-publishing

Markup

Info. NeedsRelevanceEvaluation

Effectiveness

ThesauriOntologies

ClassificationCategorization

Bibliographic information

BibliometricsCitations

RoutingFiltering

Community filtering

Search & search strategyInfo seeking behavior

User modelingFeedback

Info summarizationVisualization

Multimedia streams/structures

Capture/representationCompression/coding

Content-based analysis

Multimedia indexing

Multimediapresentation,

rendering

RE

LA

TE

DT

OP

ICS

13

14

Outline

• Acknowledgements

• Digital Libraries

• NDLTD

• Recent and Future Work

• Conclusions

16

NDLTD Incorporation

• Networked Digital Library of Theses and Dissertations incorporated May 20, 2003 in Virginia, USA

• Charitable and educational purposes (501 c 3)

• Officers– Executive Director (Ed Fox)– Secretary (Gail McMillan)– Treasurer (Scott Eldredge)

17

Board of Directors• Suzie Allard (ETD 2004, U. Kentucky)• Denise A. D. Bedford (World Bank)• Julia C. Blixrud (ARL, SPARC)• José Luis Borbinha (Natl Lib Portugal)• Alex Byrne (ETD 2005, ADT: Australia)• Tony Cargnelutti (ETD 2005, Australia)• Vinod Chachra (VTLS)• Susan Copeland (RGU, UK)• Jude Edminster (Bowling Green St. U.)• Scott Eldredge (Treasurer, ETD 2002, BYU)• Edward A. Fox (Exec Director,Virginia Tech)• John H. Hagen (West Virginia U.)• Thomas B. Hickey (OCLC)• Christine Jewell (U. Waterloo, Canada)

• Delphine Lewis (ProQuest)• Joan K. Lippincott (CNI)• Mike Looney (Adobe)• Gail McMillan (Secretary, Virginia Tech)• Joseph Moxley (ETD 2000, USF)• Eva Müller (U. Uppsala, Sweden)• Ana Pavani (PUC Rio, Brazil)• Axel Plathe (UNESCO, Paris)• Sharon Reeves (National Library Canada)• Peter Schirmbacher (ETD 2003, Humboldt)• Hussein Suleman (U.Cape Town, S. Africa)• Shalini R. Urs (U. Mysore, India)• Eric F. Van de Velde (ETD 2001, Caltech)

18

NDLTD Committees (Chairs)• Awards (John Hagen)• Conferences (Tony Cargnelutti)• Development (Peter Schirmbacher)• Executive (Edward Fox)• Finance (Scott Eldredge)• Implementation (Ana Pavani)• Membership (Shalini Urs)• Nominating (Sharon Reeves)• Standards (Thomas B. Hickey)• Union Catalog (Vinod Chachra)

19

Selected Projects / Sponsors

• Australia (ADT)• Brazil (BDT, IBICT)• Canada• Catalunya• Chile (Cybertesis)• Germany• India (Vidyanidhi)• Korea• OhioLINK: 79

colleges/univs

• Portugal (National Library)

• South Africa• UK (British Library,

JISC, Edinburgh, …)• UNESCO (especially

Latin America, Eastern Europe, Africa)

• Venezuela

20

Some Countries• Australia• Belgium• Brazil• Canada• Chile• China, Hong Kong• Columbia• Finland• France• Germany• Greece• India• Italy• Jamaica• Korea• Lithuania• Malaysia• Mexico

• Namibia• Netherlands• Norway• Poland• Russia• Singapore• S. Africa• S. Korea• Spain• Sudan• Sweden• Switzerland• Taiwan• Thailand• Turkey• UK• USA• Venezuela• Yugoslavia

Some Institutional Members• British Library• Cinemedia• Coalition for Networked Information (CNI)• Committee on Institutional Cooperation (CIC)• Consorci de Biblioteques Universitàries de Catalunya• Diplomica.com• Dissertation.com• Dissertationen Online (Germany)• ETDweb, a Division of Answer4.com• Ibero-American Science & Technology Education Consortium (ISTEC)• National Documentation Centre (NDC), Greece• National Library of Portugal (for all universities)• OCLC Online Computer Library Center• OhioLINK• Organization of American States (SEDI/OAS)• Southeastern Library Network (SOLINET)• UNESCO (www.unesco.org/webworld/etd)

22

UNESCO and ETDs(by Axel Plathe at ETD2003)

• Promoting the use of the Internet as a tool for disseminating scientific knowledge

• Facilitating the transfer of ETD expertise from developed to developing countries

 

• 1998: Member of the NDLTD Steering Committee• 1999: First UNESCO ETD meeting on ETD

internationalisation  

• 2002: “UNESCO Guide to Electronic Theses and Dissertations”

 

• 2003: Model training programmes and training courses• 2003: Sponsor pilot projects• 2003: Pilot projects (Africa, Europe, Latin-America)

23

24

25

26

Union catalog: OCLC

• OCLC will expand OAI data provider on TDs.

• Is getting data from WorldCat (so, from many sites!).

• Will harvest from all others who contact them.

• Need DC and either ETD-MS or MARC.

• Has a set for ETDs.

27

28

29

30

OCLC SRU Interface

31

32

ETD Union Search Mirror Site in China (CALIS)(http://ndltd.calis.edu.cn – popular site!)

33

34

VTLS Service’s Support ofContent Languages

The VTLS service for NDLTD supports different languages. These are: English German Greek Korean Portuguese Others – which are most needed ???

35

36

ETDs: Library Goals • Improve library services

–Better turn-around time –Always available

• Reduce work –catalog from e-text –eliminate handling: mailing to ProQuest, bindery

prep, check-out, check-in, reshelving, etc.• Save space

37

Why ETD? Short Answer

• For Students:– Gain knowledge and skills for the Information Age– Richer communication (digital information, multimedia, …)

• For Universities: – Easy way to enter the digital library field and benefit

thereby

• For the World: – Global digital library – large, useful, many services

• General:– Save time and money– Increased visibility for all associated with research results

• Aiding universities to enhance graduate education, publishing and IPR efforts

• Helping improve the availability and content of theses and dissertations

• Educating ALL future scholars so they can publish electronically and effectively use digital libraries (i.e., are Information Literate and can be more expressive)

What are we doing?

Outline

• Acknowledgements

• Digital Libraries

• NDLTD

• Recent and Future Work

• Conclusions

Recent and Future Work

• OAI

• Quality Analysis

• Preservation (LOCKSS)

• Automatic Classification to Identify Sub-collections and Categories

• Logging Standards

• Other Services and Enhancements

41

OAI - Open Archives Initiative

• Advocacy for interoperability

• Standard for transferring metadata among digital libraries– Protocol for Metadata Harvesting (PMH)

• Simplicity• Generality• Extensibility

• Support for PMH => Open Archive (OA)

42

OAI – Black Box Perspective

OA 1

OA 2

OA 4

OA 3

OA 5OA 6

OA 7

43

DiscoveryCurrent

AwarenessPreservation

Service Providers

Data Providers

Metadata

harvestin

g

The World According to OAI

44

Metadata Specifications and Metadata Format: Conformance

• Based on ETD-MS

0. 75

0. 8

0. 85

0. 9

0. 95

1

GWUD

LSU

VTET

D

MIT

UBC

PHYS

NET

VTINDIV

VANDER

BILT

NCSU

USA

SK

PITT HKU

HUMBO

LT

OCLC

BGMYU

DRES

DEN

VIEN

NA

GAT

ECH

ETSU

USF

MUEN

CHEN

UTE

NN

CCSD

WAT

ERLO

O

NSY

SU

LAVA

L

UPS

ALLA

CALTE

CH

UCL

Wag

Univ

45

LOCKSS

• Lots of copies keep stuff safe• Stanford (Vicky Reich)• Initial focus on lower levels• Initial content: journals• Emory (Martin Halbert)

– Help deploy and adapt– Help apply in other contexts

• Another registry• Set of publisher manifests (information providers)• Set of storage systems (archival storage)

– NDIIP: AmericanSouth, MetaArchive

46

NDLTD Sub-collections,PACS Automatic Classification

47

XML-based DL Log Standard• Log analysis

– is a source of information on:• How patrons really use DL services• How systems behave while supporting user information

seeking activities

• Used to:– Evaluate and enhance services– Guide allocation of resources

• Common practice in the web setting– Supported by web servers, proxy caches

• DL Logging can be more detailed

48

The XML Log Format

Log

SessionId MachineInfo StatementTransaction Timestamp

SessionInfo RegisterInfo StatementEvent Timestamp

Action

Search Browse StoreSysInfoUpdate

SearchBy QueryString CatalogCollection PresentationInfo

StatusInfo

Timeout

49

Future Work – 1 of 3

• Collection– Cover all universities in Australia, the region,

and the world– Expand from dissertations to theses to

undergrad theses to reports to e-portfolios– Add in page images of back files or at least

bib records (retrospective)

50

Future Work – 2 of 3

• Promote use of ETDs– Collaborate with Scirus, Google Scholar, …– Support students so have DOIs, resolved

references, XML works– Support cross language, multilingual,

multimedia search/browse – Provide aids to classification into DDC or

domain-specific classification

51

Future Work – 3 of 3

• Enhance services for students– Help students with electronic submission, e-pub,

multimedia, hypermedia, electronic data sets, electronic lab notebooks

– Support students with annotations, threaded discussions, recommenders

– Support work with early versions of ETDs for collaboration, with limited access and chat groups

– Encourage submission of PPT, video of defense (or a rerun)

52

Conclusions

• Review

• History

• Significant Progress

• Many Challenges

• Key Role for ADT

Review

• Acknowledgements

• Digital Libraries

• NDLTD

• Recent and Future Work

• Conclusions

54

Conclusions

• Review

• History

• Significant Progress

• Many Challenges

• Key Role for ADT

55

Questions?Discussion?

Recommendations?

Thank You!