Upload
kevin-ashley
View
104
Download
1
Tags:
Embed Size (px)
DESCRIPTION
A closing talk I gave at the JISC/DPC 'Missing Links' conference on web archiving in July 2009. The talks were on the DPC site but ironically the link is now broken.
Citation preview
What we want with web-archives: will we win?
Kevin Ashley
ULCC Digital Archives Department
http://dablog.ulcc.ac.uk/
W8.0
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
2
Past histories
• Tom Standage – The Victorian Internet
• Not just what was said, but how
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
3
http://vimeo.com/2312662
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
4
Thinking about use cases
• Not just document-centred
• Content
• Properties of content
• The web of data
• The web as data
• Stuff about the web as well as from the web
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
5
Document-centred is useful
• For many academic uses, still central
• Sometimes content, sometimes presentation, sometimes both
• Timeslices or places over time:
• Brian Kelly's history of University of Bath homepage
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
6
Content in aggregate
• Textual analysis
• Contrasting use of language
• Tracking spread of neologisms
• Word clouds
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
7
Properties of content
• How quickly was PNG adopted ?
• Was takeup uniform in countries, types of site ?
• What did it replace ?
• What happened to XPM ?
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
8
Searching the past
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
9
The web as data
Hidekazu Shiozawa and Yutaka Matsushita – “Natto”
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
10
The web of data
• Linked data:
“a term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web”
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
11
http://taggalaxy.de/
APIs that allow alternate views
• Archives collect, protect and provide permanent references for content
• APIs allow many views and uses to emerge
• They permit intelligent intermediaries to do our work, or to assist
• Important as archive space fragments
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
12
Other stuff on or about the web
• Traditional media about the web
• Usage logs, server configs, server software
• Browsers, plugins, validators, …
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
13
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
14
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
15
Thanks to Martin Dodge’s cyber-geography pages