LETA Digitization Services

  • View
    767

  • Download
    0

Embed Size (px)

Text of LETA Digitization Services

  • 1. New informative solution CONTENT DIGITIZATION

2.

  • The possibility to convert physical documents and scanned materials into easy-to-classify and electronically-searchable information.

What is content digitization? 3. Digitization process workflow 1. Scanning/ Import 2. Verify Page Frames 3. Verify Layout Elements 4. Verify Page Numbers, Page Hierarchy 5. Verify Hierarchy 6. OCR 7. Export 4. Step1 Scanning/Import

  • Material type:
    • Newspapers
    • Books
    • Magazines
    • Booklets
    • etc.
  • Access type:
    • Original images
    • Digital images
    • Microfilm
    • etc.
  • Before importing, it is highly important to define each documents metadata that are automatically captured in the program
    • Each documents metadata may include:

5. Step2 Verify Page Frames (VPF)

  • Basic operations:
    • Page deskew
    • Page cropping
  • Advantages:
    • Possibility to correct errors done while scanning:
      • Align the page in the right direction/slant
      • Cut off black scan borders
      • Divide double spreads in half
      • And more

6. Step3 Processing page layout elements (VLE)

  • Basic operations: Combining or revising zonesDeletion of unneeded zonesDefinition of zone types
  • Advantages:Possibility to work in fullscreen modeAlmost all operations may be performed with the mouse and a few keyboard keys

7. Step4 Verify Page Numbers / Verify Page Hierarchy

  • Basic operations:
    • Verify page numbers
    • Verify page hierarchy
    • Verify issue date, number, volume and edition
  • Advantages:
    • If the metadata include correct information on a given publication, this step may be skipped
    • Possibility to replace missing pages

8. Step5 Verify Hierarchy (VH)

  • Basic operations:
    • Structuring publication articles, sections
    • Checking if picture/table captions are added to the right picture/table
  • Book structure is different from newspaper structure
  • Advantages:
    • Possibility to correct mistakes that may have been made during any of the previous steps

9. Step6- OCR

  • Basic operations:
    • Correcting the required textual zone type
  • Advantages:
    • Possibility to correct any text zone (headings, authors, picture captions, etc.), if the integrated ABBYY program finds errors in the text
    • Possibility to change language or typeface

10. Step7- Export

  • Pursuant to the clients requirements, the program automatically exports the file as defined in the specification
  • Material for publishing on the Internet
    • File formats:
      • JPEG, PNG, TIFF etc.
      • PDF, structured PDF
      • XML (mets, alto, mods etc.)

11. We offer our services for

  • Libraries
    • Searchable archive
    • Digital preservation of cultural and historical heritage
    • Ease of adaption for Internet
  • Publishers
    • Searchable archive
    • Repeated use of content
    • Ease of adaptation for Internet
    • Business opportunity to sell content

12. Our experience

  • Digitization project of National Library of Latvia
    • Period: July 2010 June 2012
    • Volume: 1 562 500 book pages, 2 750 000 periodical pages
    • Daily export: 10 000 pages
    • In cooperation with soft provider CCS
  • Digitization project of Italian newspaperLA STAMPA
    • Period: May 2009 March 2010
    • Electronic archive for last 120 years
    • 800 000 pages
    • In cooperation with soft provider Zissor
  • LETA Media monitoring
    • In operation since 2006
    • Monthly we scan and electronically process more than 30 000 A4 pages of printed press
    • In cooperation with soft provider Zissor

13. Our advantages

  • Successful cooperation with largest European content conversion software providers Zissor, CCS
  • Professional and highly qualified team of50employees
  • Competitive and affordable pricing policy

14. Pricing policy

  • Pricing
  • - to buy each phase separately (scanning, segmentation, content portal)
  • - to buy whole package of digitization
  • Price
  • - price is set for processed page depending on material difficulty (material quality, number of pictures/ captions, columns etc.)

15.

  • Contacts
  • Ieva Portnaja
  • Head of Digitization Department
  • Phone: +371 22322733
  • E-mail:[email_address]
  • www.leta.lv