40
Hideaki Takeda / National Institute of Informatics Connecting Museums with Linked Data 以鏈結資料連結博物館 Hideaki Takeda 武田英明 [email protected] National Institute of Informatics 国立情報学研究所 When Culture Encounters Internet Conference, December 14th-15th, 2010, Taipei With LODAC project team I. Ohmukai, F. Kato, T. Kamura, T. Takahashi, H. Ueda

Takeda 101214short-d

Embed Size (px)

Citation preview

Page 1: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Connecting Museums with Linked Data以鏈結資料連結博物館

Hideaki Takeda  武田英明[email protected] Institute of Informatics国立情報学研究所

When Culture Encounters Internet Conference, December 14th-15th, 2010, Taipei

With LODAC project teamI. Ohmukai, F. Kato, T. Kamura, T. Takahashi, H. Ueda

Page 2: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Outline

l Information Cycle l Linked Data and Museum Datal LODAC Museum

Page 3: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Information Cycle

Create&

l Information can be created only based on existing informationn No information can be created out of nothingn Collect – Use & Create

l Value of information is how much it is usedn No value for information without usen Use & Create – Publish

l Accumulation of information is the wealth of societyn Distribution of information is the health of societyn Publish – Share -- Collect

Page 4: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Information Cycle

Create&

l Before Gutenbergn Media

uHand-writing booksuOral communication

n Information Cycle isuSlowuSmall amountuFew People

l After Gutenberg, the age of Mass media arrived …

Page 5: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Two social layers on information cycle with Mass Media

Create

Writer, Artist, ScholarMass media

Government

&

Page 6: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Two social layers on information cycle with Mass media

Create

Writer, Artist, ScholarMass media

Government

&OrdinaryPeople

Collect

UseCreate

&

Page 7: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Two social layers on information cycle with Mass Media

Create

Writer, Artist, ScholarMass media

Government

OrdinaryPeople &

Page 8: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

WebInternet

Web Server

Web BrowserCreate

& HTML Editor

Search Engine

Information Cycle with Web

Open Door to Information Cycle for Ordinary People

Page 9: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

WebInformation Cycle

Create&

l Web accelerate Information Cycle inn Speedn Quantityn People

Page 10: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Web

Create&

Internet

Web Server

Web Browser HTML Editor

Search Engine

Information Cycle with Web

Page 11: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Metadata is the platform of Information Cycle

&

Metadata

&Create

Page 12: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Linked Data will be the platform of Information Cycle on the content layer

&

Metadata

&Create

Linked Data

Page 13: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

LOD Cloud(Linking Open Data)

Page 14: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Linked Data – Four Rules

l Linked Data is “Web of Data”n (Traditional) Web is “Web of Documents”

l What is Linked Data?n RDF triplesn Can refer othersn Can be referred by others,

l Four Rules for Linked Datan Use URIs as names for things n Use HTTP URIs so that people can look up those names. n When someone looks up a URI, provide useful information, using

the standards (RDF*, SPARQL) n Include links to other URIs. so that they can discover more things

Linked Data, TBL, http://www.w3.org/DesignIssues/LinkedData.html

Page 15: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Importance of data in public sector as Linked Data

l In principle, it should be shared l It is the basic knowledge of our societyl Data in public sector

n Library n Museumn Archiven Government

Page 16: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Challenges for Linked Data in Japan

l Lack of culture of sharingl Immature community for linked datal Lack of central data Setl Difficulty of multi-lingual data

Anyway let’s start!

Page 17: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

LODAC Project

l Open Social Semantic Web Platform for Academic Resourcesn Providing platforms for Linked Datan Practicing data accumulation and publishing

l Interested Areasn Museum informationn Geographical information, especially geographical namesn Local informationn …

Page 18: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Museum data as LOD

l The state-of-the-art of museum information in Japann Distributed

uSelf maintainedu Isolated

n OpaqueuSelf designeduMessy

l Aggregating and associating museum informationn LODAC-Museum (tentative)

Page 19: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Over 1.4 billion collectionsOver 1,000 organizations

Page 20: Takeda 101214short-d

Hideaki Takeda / National Institute of Informaticshttp://lod.ac/ (open on December 11)

Page 21: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

LODAC Museum – Main work

l Gathering of datan Thesaurus, museum collections, etc

l Standardization of datan Representing data from different sources in a unique form

l Integration of datan Identifying datan Associating the same data

l Publishing and share of data

Page 22: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Data sources

l Thesaurus and authority sourcesn 日本美術シソーラス DB 絵画編

(Thesaurus of Japanese Art)n 国指定文化財データベース

(DB for National Designated Cultural Property)n 文化遺産オンライン

(Cultural Heritage Online)l Museum Collection (14 museums)

n 国立美術館所蔵作品総合目録検索システム ( 国立国際美術館,京都国立近代美術館,東京国立近代美術館 ) (4 Nat’l Museums)

n 国立西洋美術館 (Nat’l M. Western Art)n 京都国立博物館 (Kyoto Nat’l Museum)n 奈良国立博物館 (Nara Nat’l Museum)n 福島県立美術館 (Fukushima Pref. M. of Art)

l Other sourcesn DBPedia Japann GIS data

n 栃木県立美術館

n 秋田県立近代美術館

n 岩手県立美術館

n 徳島県立近代美術館

n 山梨県立美術館

n 東京都現代美術館

n 香川県立東山魁夷せとうち美術館

Page 23: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Metadata design

l Basic Structuren Work – Creator – Museum

l Interoperability is more considered than correctness in the domainn DC> DCTerm> FOAF> iCal >SKOS>NDLSH> RDA> CIDOC

CRMn Keep it flat as long as possiblePREFIX URI

crm http://purl.org/NET/cidoc-crm/core#dc http://purl.org/dc/terms/dc11 http://purl.org/dc/elements/1.1/foaf http://xmlns.com/foaf/0.1/skos http://www.w3.org/2004/02/skos/core#rdfs http://www.w3.org/2000/01/rdf-

schema#ical http://www.w3.org/2002/12/cal/ical#rda2 http://RDVocab.info/ElementsGr2lodac http://lod.ac/ns/lodac#

lodac:Work Property( 一部項目省略 )資料分類 lodac:genre文化財 lodac:culturalAssets制作者 dc:creator / dc11:creator国籍 crm:P7_took_place_at作品名 dc:title / skos:prefLabel作品名読み dc:title @ja-hrkt / skos:altLabel作品名英語 dc:title @en / skos:altLabel銘文 crm:P62I_is_depicted_by印章 crm:P65_shows_visual_item員数 crm:P57_has_number_of_partsコレクション dc:isPartOf制作年 dc:created推定始年 lodac:estimatedStartYear材質 dc:medium / crm:P45_consists_of

Metadata elementsWork:   46Person:   23Org.   13Bib.   12

Page 24: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Integration Policy

l How to integrate data from different sources n sharing of responsibility

uEach source is responsible for its data l Identifying IDs for data and managing data with the IDs

uLODAC is only responsible for integrationl Assigning original IDs and associating other IDs to them

Data from Source B

24

Integrated data

dc:references dc:references

dc:references dc:references

dc:references dc:references

dc:creatordc:creator

crm:P55_has_current_location crm:P55_has_current_location

crm:P55_has_current_location dc:creatorData from Source A

Work

Museum

Creator

Page 25: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Integration of Person Data

l Matching of Creatorsn Base: List of Artists from Thesaurus of Japanese Artn Target: Creators of collection in museums + Dbpedian Method: String match of namesn Results: Links from artist nodes to work nodes are added

LODAC data

Link to Work

DBpedia

Basic Information for Creators

Links

Page 26: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Page 27: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Page 28: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Page 29: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Page 30: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Page 31: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

徳島県立美術館  Tokushima Pref. Museum

Page 32: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

東京近代美術館  National Museum of Modern Art, Tokyo

Page 33: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

国指定文化財データベース  DB for National Designated Cultural Property

Page 34: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Tokushima Pref. Museum Thesaurus for Japanese Art DB for National Designated Cultural Property

National Museum of Modern Art, Tokyo

Fukui Pref. Museum

Page 35: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Data size and Integration Results

Source Type No.

国立美術館 ( 西美を除く 3 館 ) Work 25180国立西洋美術館 Work 4373京都国立博物館 Work 5819奈良国立博物館 Work 431福島県立美術館 Work 20栃木県立美術館 Work 32秋田県立近代美術館 Work 22岩手県立美術館 Work 1558徳島県立近代美術館 Work 18482山梨県立美術館 Work 262東京都現代美術館 Work 5416香川県立東山魁夷せとうち美術館 Work 266Thesaurus for J. art Work 3800Thesaurus for J. art Person 1332Thesaurus for J. art Group 289Thesaurus for J. art Museum 648Cultural Heritage Online Museum 915Designated Cultural Property DB Work 10115

合計 103096

Type for Integration

Sources No. Results

Museum Thesaurus for J. art 648 77Cultural Heritage Online 915

Designated Cultural Property

Thesaurus for J. art (work) 3800 74Designated Cultural Property DB

10115

work Thesaurus for J. art (work) 1332 15020Museum collections (work) 61861

Person Thesaurus for J. art (artist) 1332 615Museum collections (work) 61861

Museum collections

Page 36: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

What can LOD give Museum Data?

Connectivity!!

l Open Connectivity makes new values for museum datan Connect to data in other areasn Connect to UGC (User Generated Contents)

Page 37: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Local Information with Museum data

l Museum LOD + Local LOD / Sightseeing LOD / Geo LOD

l e.g., n Tour visiting museums with a focus

n Joint event with local festivals

n Tour for food related historical events

n …

37

Page 38: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

User Generated Contents for Museum Information

1. Statue of Sarasvati 2. Ryohoji Temple

3. Theme Song for Ryohoji 4. Event

l Contributions by non-experts

l e.g.,

n Personal comments for Buddha statues

n Records of visiting museums

n Media-mix events

弁財天像

Page 39: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Publish museum data as LOD

l Let’s make museum data open and shareable

l Change “cultural heritage” to “cultural resources”

l (art/culture) * information = Promotion of the Nation

l Beyond collaboration of Museum Library Archives(MLA)

n MLA3(Museum Library Archives, Arts and Academia) l More users, more various types of usage

Page 40: Takeda 101214short-d

Hideaki Takeda / National Institute of Informatics

Make arts and culture more dynamic and more energetic

Pop Culture