Upload
arcomem
View
378
Download
0
Embed Size (px)
Citation preview
ARCOMEMSocial media archiving
Dominik Frey (SWR) | Cosmin Cabulea (DW)
DIATA12, 21.03.2012
ARchive COMmunity MEMories: How to identify and preserve relevant social media content?
2
Social media archiving
Project consortium
01/2011 - 12/2013, funded by the EC
3
Use cases
Broadcaster: Rock festivals
Parliament: Euro Crisis
4
Talk about Rock am Ring
5
News, opinions, facts, rumors, … Links to videos, images, blogs, …
What content is relevant?
8
Social web anlysis: popularity, influence, trust, diversity
Semantic analysis: entities, topics, events, opinions
Usage scenarios
For archivists
support content selection & contextualize web archives
For journalists
find relevant content for their stories & follow the discussions about it
9
Two stage archiving strategy: web analyzing storage archive
Archivist describes targetHTML and API crawlers fetch content
Archiving workflow
10
Different modules analyse semantic information & social context to filter relevant content
HBase and RDF triple storage
Archiving workflow
11
Only relevant content is preserved in (W)ARC format
Semiautomatic content selectionHeritrix and Wayback compatible
Archiving workflow
12
Fulltext search and facet browsingSemantic and social contextualization
Visualizations to be developed on top (not in ARCOMEM sope)
Archiving workflow
13
The Journalistic Scenario
14
The Journalistic Use Case
15
The Story
16
Data
17
The Challenges
18
The Data Layers
19
Social web
The Challenges
20
Vox Civitas User Interface
21
SRSR (Seriously Rapid Source Review)
22
Riot rumours: how misinformation spread on Twitter during a time of crisis
23
ARCOMEM Graphic User Interface (Draft)
24
Third-Party-Brain
25