21
DIGITIZATION IN THEORY AND PRACTICE WEBSITE: www.helenokpala.com E-MAIL: [email protected] Helen Nneka Okpala Presentation done at University of Abuja Library Staff Training, 3 rd May – 6 th May, 2016.

Digitization in theory and practice

Embed Size (px)

Citation preview

Page 1: Digitization in theory and practice

D I G I T I Z AT I O N I N T H E O RY A N D P RAC T I C E

W E B S I T E : w w w. h e l e n o k p a l a . c o mE - M A I L : h e l e n . o k p a l a @ u n n . e d u . n g

Helen Nneka OkpalaPresentation done at University of Abuja Library Staff Training, 3rd May – 6th

May, 2016.

Page 2: Digitization in theory and practice

WHAT IS DIGITIZATION?• The process of converting analogue objects to

digital • The process of capturing analog materials as digital images

• Optical Character Recognition (OCR) programs “read” these images and convert them to text documents which can be easily searched, copied, edited, or used for computational text analysis methods.

Page 3: Digitization in theory and practice

DIGITIZATION CONTD…• Conversion of analog information in any form (text, photographs, voice, etc.) to digital form with suitable electronic devices (such as a scanner or specialized computer chips) so that the information can be processed, stored, and transmitted through digital circuits, equipment, and networks.

Read more: http://www.businessdictionary.com/definition/digitization.html#ixzz47bCzzTsg

Page 4: Digitization in theory and practice

OCR – OPTICAL CHARACTER RECOGNITION

• OCR programs process scanned documents (e.g. books, newspapers) to extract text.

• The quality of the text extraction will depend on the resolution of the scanned document, the format and quality of the print materials, and how well the OCR program deals with other languages or diacritical marks.

• No OCR program is 100% accurate.

Page 5: Digitization in theory and practice

BEFORE YOU DIGITIZE, DECIDE:

• Does digitization fit the organization's mission?

• Is there a known potential audience for the materials that are planned to be digitized?

• Will digitization increase access?

• Will it serve a purpose?

Page 6: Digitization in theory and practice

WHY DIGITIZE?

•??? ??? ??? ??? ??? ??? ??? ??? ??? ???

Page 7: Digitization in theory and practice

DIGITIZATION WORKFLOWIdentify a project

Selection criteria(old, institutional)

Copyright(OA; Restricted)

Manipulation(GIMP; Photoshop)

File formats(Jpeg; Tiff)

ScanningFlatbed/ Robotic

Web Ready(Adobe Acrobat

Pro)

Submit(IR – DSpace)

Database

Page 8: Digitization in theory and practice

IDENTIFY A PROJECT• Know your collections

– what is valuable

– what others need to “see”

– core business of institution

– what is used often

– benefit of such a project

Page 9: Digitization in theory and practice

SELECTION CRITERIA• Know the history and rationale behind selection of sources

• Start with items that are often used• Special attention to brittle material

• Published between a certain time-line• Language limitations• Forming part of a certain collection• Make sure no doubles are included or already online

available

Page 10: Digitization in theory and practice

COPYRIGHT• Stay clear of copyright• Try to avoid material still in CR and not owned by

institution• Where necessary start with copyright clearance first

– may take long to sort out• Note every step along the way – keep the evidence

Page 11: Digitization in theory and practice

PHYSICAL PRESERVATION

• Basic cleaning of material– dust

– tears / broken corners

– mould

– remove selotype / glue / pritt

– remove staplers, gem clips, anything that can cause rust marks

– store in acid free containers if possible

Page 12: Digitization in theory and practice

SCANNERS

i2S DigiBook bookscanner

PlusDeck 2cKodak/Minolta Microfiche scanner

Nikon 9000 Coolscan

Epson 1640X

Scribe scannersiTTUSB Turntable

Page 13: Digitization in theory and practice

IMAGE MANIPULATION

• Abbyy Finereader

• Adobe Photoshop

• Adobe Acrobat

Page 14: Digitization in theory and practice

WEB READY• Resolution of Jpeg derivative: 300dpi

• PDF can be compressed into smaller file sizes for the web

• OCR (Optical Character Recognition)

• Bookmark – For Easy navigation

• File Background (Optional)

Page 15: Digitization in theory and practice

SCANNING PROCESS• Install Scanner on your Computer(s), example EPSON GT-15000

•Select EPSON GT-15000. Select Office Mode, Grayscale, Document Table, 300 dpi

•Preview

Page 16: Digitization in theory and practice
Page 17: Digitization in theory and practice
Page 18: Digitization in theory and practice
Page 19: Digitization in theory and practice
Page 20: Digitization in theory and practice
Page 21: Digitization in theory and practice

ETCETERA…!