Plans for 2015 Tallinn, Jan 29 th, 2015 Ditte Laursen, dla@statsbiblioteket.dk Sabine Schostag,...

Preview:

Citation preview

Plans for 2015

Tallinn, Jan 29th, 2015

Ditte Laursen, dla@statsbiblioteket.dkSabine Schostag, sas@statsbiblioteket.dk

Plans for 2015 – Overview of presentation

Organisation Collection Preservation Access and dissemination Research collaborations

SB

Royal Library

IT developments

IT operations

Curator group

Steering Commitee

Directors

Coordinating Management Group

State Library

Collection

Analysis of broad crawls in order to achieve better collection methods

Analysis af web danica to identify Danish web content outside TLD .dk

Implementation of ISO statistics migration til heritrix3 test of tools

Preservation

upgrading of the preservation system replace the existing archive

module/components with a bit storage system called ”Bitmagasin”

Develop system with mass (big data) processing facilities (hadoop))

Collection management: locate and map the state af special collections, donations etc in order to integrate them into Netarchivet (as Warc files)

Access and dissimination (1)

Full text search with SOLR ( + restriction terms)

Access and dissimination (2)

10th anniversary (internal and external event)

Broader access E.g. for students (give acces to a chorpus

consisting of copyright free and screened web sites such as the Danish parliaments or the miniteries’ web sites

open wayback

Research collaboration: Netlab

An internet research infrastructure within the Danish research infrastructure for the humanities Digital Humanities Lab Research-driven projects to contribute to the

establishment, test and development of a research infrastructure for the study of online as well as archived internet materials

Develop a workspace (e.g., for searching and visualization) and build the relevant skills for using software-supported methods

Describe and evaluate the strengths and weaknesses of the Danish internet archive Netarkivet as part of a research infrastructure

8

9

Digital Humanities Lab

Language Media Experiments

Radio/tv NetLab

Online Archive

Netarkivet

NetLabForumIT developer

NeedsWays to filter search resultsA way to select and bookmark pagesA way to choose and isolate a corpusA flexible interface showing metadata

11

12

13

14

RESAW: A Research Infrastructure for the Study of Archived Web Materials’

A larger network of relevant institutions and researchers, European as well as international (app. 40 participants)

The basis for an application to EU’s Horizon 2020 within ’Research infrastructure – integrating activities’

Promote the establishing of a collaborative transnational European research infrastructure for the study of archived web materials

Research collaboration: RESAW

???

PWA

6. What is RESAW?

16

NetLab

Netarkivet

BUDDAH

BL

WebART

KB

Web90

BnF/INA

Activities PhD seminars, workshops, seminar An international conference 'Web Archives as scholarly Sources: Issues, Practices, and Perspectives', Aarhus, 8-10 June 2015 Small pilot projects (e.g. national webspheres, how the internet domain .eu can be archived, Eurovision Song Contest...)

Resaw.eu, and Newsletter

Research collaboration: RESAW