26
ARCOMEM Social media archiving Dominik Frey (SWR) | Cosmin Cabulea (DW) DIATA12, 21.03.2012

Diata12 ARCOMEM

  • Upload
    arcomem

  • View
    378

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Diata12 ARCOMEM

ARCOMEMSocial media archiving

Dominik Frey (SWR) | Cosmin Cabulea (DW)

DIATA12, 21.03.2012

Page 2: Diata12 ARCOMEM

ARchive COMmunity MEMories: How to identify and preserve relevant social media content?

2

Social media archiving

Page 3: Diata12 ARCOMEM

Project consortium

01/2011 - 12/2013, funded by the EC

3

Page 4: Diata12 ARCOMEM

Use cases

Broadcaster: Rock festivals

Parliament: Euro Crisis

4

Page 5: Diata12 ARCOMEM

Talk about Rock am Ring

5

News, opinions, facts, rumors, … Links to videos, images, blogs, …

Page 8: Diata12 ARCOMEM

What content is relevant?

8

Social web anlysis: popularity, influence, trust, diversity

Semantic analysis: entities, topics, events, opinions

Page 9: Diata12 ARCOMEM

Usage scenarios

For archivists

support content selection & contextualize web archives

For journalists

find relevant content for their stories & follow the discussions about it

9

Page 10: Diata12 ARCOMEM

Two stage archiving strategy: web analyzing storage archive

Archivist describes targetHTML and API crawlers fetch content

Archiving workflow

10

Page 11: Diata12 ARCOMEM

Different modules analyse semantic information & social context to filter relevant content

HBase and RDF triple storage

Archiving workflow

11

Page 12: Diata12 ARCOMEM

Only relevant content is preserved in (W)ARC format

Semiautomatic content selectionHeritrix and Wayback compatible

Archiving workflow

12

Page 13: Diata12 ARCOMEM

Fulltext search and facet browsingSemantic and social contextualization

Visualizations to be developed on top (not in ARCOMEM sope)

Archiving workflow

13

Page 14: Diata12 ARCOMEM

The Journalistic Scenario

14

Page 15: Diata12 ARCOMEM

The Journalistic Use Case

15

Page 16: Diata12 ARCOMEM

The Story

16

Page 17: Diata12 ARCOMEM

Data

17

Page 18: Diata12 ARCOMEM

The Challenges

18

Page 19: Diata12 ARCOMEM

The Data Layers

19

Social web

Page 20: Diata12 ARCOMEM

The Challenges

20

Page 21: Diata12 ARCOMEM

Vox Civitas User Interface

21

Page 22: Diata12 ARCOMEM

SRSR (Seriously Rapid Source Review)

22

Page 23: Diata12 ARCOMEM

Riot rumours: how misinformation spread on Twitter during a time of crisis

23

Page 24: Diata12 ARCOMEM

ARCOMEM Graphic User Interface (Draft)

24

Page 25: Diata12 ARCOMEM

Third-Party-Brain

25

Page 26: Diata12 ARCOMEM

26

THANK YOU

CONTACT DETAILS

Dominik [email protected]

Cosmin [email protected]

www.arcomem.eu