ARCHIVE IMAGING SEARCHABLE VIA THE WEBPAC

Preview:

DESCRIPTION

ARCHIVE IMAGING SEARCHABLE VIA THE WEBPAC. Marthie de Kock The Hong Kong Institute of Education 9 December 2002. Education Imaging System ( EdIS ). Hong Kong Institute of Education Library. Points for discussion. Scope and functions EdIS Phase I EdIS Phase II Background - PowerPoint PPT Presentation

Citation preview

ARCHIVE IMAGING SEARCHABLE VIA THE

WEBPAC

Marthie de Kock The Hong Kong Institute of Education

9 December 2002

Education Imaging System(EdIS)

Hong Kong Institute of Education Library

3

Points for discussion

• Scope and functions

• EdIS Phase I

• EdIS Phase II

• Background

• Different document classes

• Data retrieval & searching

• INNOPAC and the Z server

4

ScopeScope

• Provide a sophisticated system to manage the growing electronic media including text, black & white scanned images, colour photos, audio, video and multimedia presentations available to and in HKIEd library.

• Provide an effective web interface to retrieve on-line digitised materials.

5

System FunctionsSystem Functions

• Capture of content, storage & management

• Scanning & OCR

• Supports both English and Chinese indexing and full text searching

6

BackgroundBackground

First Digital Library initiatives of HKIed Library

• Joint project between IBM & Library with technical support by ITS

• July 1997 - signed contract with IBM and it’s Digital Library

• June 23 1998 - the system was launched

7

Search Interface of EdIS > The Main Screen

8

Contents of EdIS Phase I Contents of EdIS Phase I Four Document TypesFour Document Types

Document types Digitised itemsNewspaper clippings Image scanning & OCR

Examination papers Image scanning & OCR

Curriculum materials Multimedia objects

Student Projects Multimedia objects

9

Document Types:Document Types:News Clippings & Exam PapersNews Clippings & Exam Papers

• News clippings:• Past newspaper clippings

• scanning, OCR, indexing

• Wiser News indexing & CMC operations

• Exam Papers:• Departments

• scanning, OCR, indexing

10

Document Types:Document Types:Curriculums & Student ProjectsCurriculums & Student Projects

• Digitising procedures included:• Content Analysis

• Categorise multimedia objects

• Write a summary

• Digitise materials, saving files with logical file names, web page design & preparing scripts for uploading

• Upload documents & testing

11

Basic Search Screen of Curriculum Materials

12

Search results screen of [Title = dance]

13

Selected the target page from the hit-list.

14

EdIS Phase II

• Include Archive materials

• Improve multimedia searching

• Search Archive materials via INNOPAC

• No response – IBM’s DL and CMC

• June 2001 new Tender specifications

• Vitova

15

EdIS Phase II Development

• Customise system

• Project development – July 2001

• Z server

• System delivered – April 2002

• Interface – uploading of Wiser news

16

System ArchitectureSystem Architecture

Three subsystems:

• Client subsystem• The front-end PC workstations with

Netscape or Microsoft web browser are available for record retrieval and viewing.

• Capturing Subsystems • Used for content preparation

(scanning OCR and indexing)• Server Subsystem • The production server - stores

records and manages the systems operations

17

ConfigurationConfiguration

• Hardware:• SUN Enterprise 250 server

• 36 GB data storage space

• Configured as RAID 0 (disk mirror)

• Operating Software:• ORACLE Database 8i for SUN Sparc Solaris Unix 2.7

Z39.50 server for document searching

18

Hardware and software

• Application software• VitalDoc Document Imaging system - 40 user

license

• Two VitalScan licenses for desktop Scanning and OCR

• Chinese OCR - TsingHau Wintone ver. 8.0

19

20

21

Other hardware

•Two scanning/OCR workstations

•Minolta PS7000 Scanner

•Ricoh IS330DC DF and Flatbed scanner

22

23

24

25

26

27

Typical Searching ProcedureTypical Searching Procedure

Enter Searching Criteria

Browsing Hit List

View Result/Content

Review HistoryNew Search

Select Class/Database

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

Future?

End