Linked Data Publication of Live Music Archives

Hello Cleveland!

Linked Data Publication of Live Music ArchivesSean Bechhofer*, Kevin Page+, David De Roure+

*School of Computer Science, University of Manchester+Oxford eResearch Centre, University of Oxford

@seanbechhofer

DMRN+7, QMUL, December 2012

The Proposition๏ Publication of structured metadata describing an audio

collection

๏ Links to external resources provide additional context and information

๏ Rich query to allow the extraction of “interesting” subcollections

The Players• The Internet Archive Live Music Archive

✦ Community contributed live audio recordings

• Semantic Technologies✦ RDF, Ontologies, SPARQL and Linked Data

• Additional resources✦ Artist DBs, Geographical Information, Venue information, etc.

• Some ruby scripts.....

The etree Collection• Internet Archive Live Music Archive• Community contributed live performance recordings

✦ “Legal bootlegs”

• Approx 4,000 artists,✦ 100,000 performances

• Why is it interesting?✦ Audio available in various formats

✤ mp3, ogg, shn, flac....✦ Multiple performances by artists✦ Cover versions

Semantic Technologies• Semantic Technologies aim to provide structured, machine

readable representations of content✦ Unified frameworks for (meta)data

• RDF: Resource Description Framework✦ Triple based representation of information

• OWL/SKOS: Ontologies & Vocabularies for content description✦ Shared vocabularies plus definitional capabilities

• SPARQL✦ A query language for RDF data✦ A generic API

Semantic TechnologiesRDF

• Triple Based Representation• Common Data Model• Identification via URIs • Easy Integration

✦ Graph Merging

• Query via SPARQL✦ A flexible, generic API

OWL/SKOS• Shared Vocabularies for

content description✦ Facilitating interoperation and

exchange✦ Everybody talks the same

language

• OWL allows for rich expressions and definitions

• SKOS supports simpler thesauri/controlled vocabularies

Linked Data• A set of common principles for data publication

• Common infrastructure facilitates construction of applications.• Use of content negotiation to supply “appropriate”

representations

1. Use URIs for identification2. Use HTTP URIs (that will dereference)3. Return useful information when dereferenced 4. Include links in that information

Linked Data Resources• MusicBrainz

✦ RDF conversions of MusicBrainz data

• Geonames✦ Information about locations

• DBpedia✦ Structured representation of Wikipedia content

• BBC✦ Programme information, artist information

Data mangling• Download of etree metadata files• Simple data conversion

✦ XML to RDF✦ etree data model

• Alignments✦ String matching plus bespoke

methods for locations✦ Explicit capture of alignments

• Publication Infrastructure✦ fuseki server + pubby front end

Modelling

Music OntologyEvent Ontology

Data Alignment• MusicBrainz

✦ Artist alignment via simple name queries

• Geographical Locations✦ Query against Geonames✦ Query against last.fm✦ Combination of string matching and lat/long

Layering• Alignments are captured in an additional layer of data on top of

the underlying source facts• Preserving original metadata

✦ Allows clients to make their own judgements✦ Preserves subjectivity

• Explicitly exposing the source of the mappings✦ Use of Provenance vocabularies

sameAs

Modelling

Similarity Ontology

Big Picture

Discussion• So far entirely metadata based

✦ No processing of underlying audio

• Alignment is a little messy✦ But has to be automated

• Dataset itself is an interesting artefact✦ Contrasts with some other LD activities.

• Is this actually useful?

Do artists really get a better reception when they play in their home town?

The Future• Better alignment

✦ Beyond simple string queries

• More alignment✦ Adding in, e.g. MusicBrainz track/work resources✦ Other collections?✦ Modelling questions

• Characterising Alignments• Audio Fingerprinting

✦ Identifying further track level matches

• Crowdsourcing corrections• Extracting subcollections

✦ What would you want?? 30

Thanks! You’ve been a great audience!

http://etree.linkedmusic.org31

Linked Data Publication of Live Music Archives

Documents

Linked Data as an enabling framework for resource discovery across libraries, museums and archives

Preserving the intelligibility of digital archives of contemporary music with live electronics

A linked open data architecture for contemporary historical archives

Report on the International Linked Open Data for Libraries, Archives and Museums Summit

Linked in live presentation hall nashville tech council 2 oct 14

Linked Data Approaches for Archival Description August 12, 2014 | Washington DC Digital Collections and Archives SAA Research Forum Eliot Wilczek

Adrian Stevenson (UKOLN) – Linked Open Copac Archives Hub (LOCAH) project – use of Timemap for visualising linked data

Linked Open Data: Opportunities & Barriers for Archives Adrian Stevenson LOCAH Project Manager UKOLN, University of Bath, UK Archives 360, Society of American

Sandra Collins - Building a linked data based content discovery service for the RTÉ Archives

Expressing language resource metadata as Linked Data: The ...€¦ · Simons and Bird, Expressing language resource metadata as Linked Data, Dec 2016 2 The Open Language Archives

Using Web Archives to Enrich the Live Web Experience Through Storytelling

OLA Superconference 2017 - Bridging the Gap: Linked Open Data for Libraries, Archives and Museums

Using Web Archives to Enrich the Live Web Experience Through Storytelling

Drawing Context from the Linked Data Web: The 20th Century Press Archives (Joachim Neubert)

Exploring the Use of Linked Data to Bridge State and Federal Archives

Increasing Access to Archives Through Linked Open Datafiles.archivists.org/researchform/2012/PDFS/Gracy... · Increasing Access to Archives Through Linked Open Data Karen F. Gracy

2010 Get Linked Madison Archives

A linked open data architecture for contemporary ...ceur-ws.org/Vol-1091/paper5.pdf · A linked open data architecture for contemporary historical archives ... learning and natural

usc foundation endowments | webinars on demand | wtfsw on ... Talk Archives...bers: complimentary live webinars. Members are eligible for at least one live webinar registration per

Describing web archives: a standard with an identity crisis?netpreserve.org/ga2019/wp-content/uploads/2019/07/...Yasmin AlNoamany. Using Web Archives to Enrich the Live Web Experience