21
Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007.

Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

Embed Size (px)

Citation preview

Page 1: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

Nikola Tesla Museum Clipping Library

Saša Malkov

Nenad Mitić

Žarko Mijajlović

3rd SEEDI Int.Conf.

Cetinje, Montenegro

14. September 2007.

Page 2: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

Clipping Library

Nikola Tesla Museum possesses a rich collection of newspaper clippings on work and life of Nikola Tesla

The clipping library is collected by Nikola Tesla, supported by his personal secretary

One part of the library is organized in books, while many clippings are not organized

Page 3: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

Digital Library Prototype

Digitization Group at Faculty of Mathematics approached the development of digital clipping library prototype

Primary goals:– The problem analysis– Recognition of appropriate solutions

Page 4: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

Problems

Significant variations in materials sources and qualities

The data and metadata organization and modeling

Data access

Page 5: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

Differences in sources and preservation level

Different digitization techniques provide the different results, depending on paper and print type and preservation level

Different target formats are considered– Digital image formats– PDF– DejaVu format

Page 6: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

Data organization

File systems are not appropriate– Complex data and metadata access– Limited search capabilities

Databases allow– Simpler access– Advanced searching

Page 7: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

Automatic text extraction

Primary problems are :– Different languages– Large varieties and high font stylization used in the

corresponding time period– Significantly low material quality, because of aging

Different OCR systems are evaluated– No OCR software satisfied, primarily because of the low

material readability– Significant amount of manual corrections is necessary

Page 8: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

Searching

The multiple criteria searching is essential, including searching by

– Metadata Caption Key words Publications Language Period

– The clipping content Manual corrections of text are essential The efficiency require the application of some indexing methods

Page 9: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

The solution – DBMS

The prototype is based on DBMS IBM DB2– Advanced SQL implementation– Efficient handling of binary content– High concurrency level – High reliability– Good experiences– Free licensing terms

Page 10: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

The solution – User interface

Web application concept is– Rich in content and visual presentation – Customizable – Portable– Relatively simple for implementing

Page 11: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

The solution – Application

The library prototype is implemented in functional programming language Wafl– Wafl is designed for automatic document generation

and particularly customized for Web development– Features very simple and efficient database access

Page 12: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007
Page 13: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007
Page 14: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007
Page 15: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007
Page 16: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007
Page 17: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007
Page 18: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007
Page 19: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007
Page 20: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007
Page 21: Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007

Nikola Tesla Museum Clipping Library

Saša Malkov

Nenad Mitić

Žarko Mijajlović

3rd SEEDI Int.Conf.

Cetinje, Montenegro

14. September 2007.