29
16/05/2022 1 Presenter name 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data Daniel Vila-Suero 1 , Victor Rodríguez-Doncel 1 , Asunción Gómez- Pérez 1 , Philipp Cimiano 2 , John P. M c Crae 2 , and Guadalupe Aguado-de- Cea 1 1 Ontology Engineering Group, Facultad de Informática, UPM. Madrid, Spain {dvila, vrodriguez, asun, lupe}@fi.upm.es 2 Forschungsbau Intelligente Systeme (FBIIS). Universität Bielefeld. Bielefeld, Germany {cimiano, jmccrae}@cit-ec.uni-bielefeld.de

3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

Embed Size (px)

DESCRIPTION

European Data Forum 2014, Athens (Greece), presented during the session "Data Challenges I Languages, Governance, Business Models"

Citation preview

Page 1: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 1Presenter name

3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

Daniel Vila-Suero1, Victor Rodríguez-Doncel1, Asunción Gómez-Pérez1, Philipp Cimiano2, John P.

McCrae2, and Guadalupe Aguado-de-Cea1

1 Ontology Engineering Group, Facultad de Informática, UPM. Madrid, Spain{dvila, vrodriguez, asun, lupe}@fi.upm.es

2 Forschungsbau Intelligente Systeme (FBIIS). Universität Bielefeld. Bielefeld, Germany{cimiano, jmccrae}@cit-ec.uni-bielefeld.de

Page 2: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 2Daniel Vila-Suero

Context: Lider project• Ecosystem of Linguistic resources

(Corpora, Lexico-semantic data, etc.) as LD and NLP services to support content analytics.

Join us!http://lider-project.eu

Linked Data for Language TechnologiesCommunity Group (LD4LT)

Page 3: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 3Daniel Vila-Suero

Licensing Linked Data, why?

Open Data Propietary Data

Gain visibilityEncourage re-use

Protect your dataEnable ways to track usageThink about new business models

Page 4: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 4Daniel Vila-Suero

How open is the LOD cloud?

[1] Rodriguez-Doncel, Victor et al., 2013. Rights declaration in Linked Data. in Proc. of the 3rd Int. W. on Consuming Linked Data O. Hartig et al. (Eds) CEUR vol. 1034 (2013)

Page 5: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 5Daniel Vila-Suero

How open is the LOD cloud?

• 338 datasets in :

[1] Rodriguez-Doncel, Victor et al., 2013. Rights declaration in Linked Data. in Proc. of the 3rd Int. W. on Consuming Linked Data O. Hartig et al. (Eds) CEUR vol. 1034 (2013)

Page 6: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 6Daniel Vila-Suero

Linguistic Linked Data

1 "Open Data and Linguistics" working group, Open Knowledge Foundation, see more http://linguistics.okfn.org/

Language resources as Linked Data:

Lexica Language descriptions Corpora….

Linguistic LOD (LLOD) cloud

Page 7: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 7Daniel Vila-Suero

How open is the LLOD cloud?

Page 8: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 8Daniel Vila-Suero

What is 3LD?

3LD Linguistic Linked Licensed Data

Page 9: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 9Daniel Vila-Suero

What is 3LD?

3LD Linguistic Linked Licensed Data

Language resources such as:- Lexica

- Corpora - Dictionaries ..

Page 10: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 10Daniel Vila-Suero

What is 3LD?

3LD Linguistic Linked Licensed Data

Linguistic data as Linked Data using RDF andstandard data models (vocabularies):

- Lexica - Corpora .. NIF

NLP Interchange Format

Page 11: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 11Daniel Vila-Suero

What is 3LD?

3LD Linguistic Linked Licensed Data

Linguistic Linked Data published along witha machine-readable license.

ODRLOpen Digital Rights Language

NIFNLP Interchange Format

Page 12: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 12Daniel Vila-Suero

Guideline: Licensing models & mechanisms

Add "rights" metadata in the dataset description(e.g., VoID, DCAT)1 DCAT

Data catalog vocabulary

Page 13: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 13Daniel Vila-Suero

Guideline: Licensing models & mechanisms

Add "rights" metadata in the dataset description(e.g., VoID, DCAT)1

Use standard predicates to declare "rights" statements (e.g., Dublin Core terms: dc:rights, dct:license)2

DCATData catalog vocabulary

Page 14: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 14Daniel Vila-Suero

Guideline: Licensing models & mechanisms

Add "rights" metadata in the dataset description(e.g., VoID, DCAT)1

Use standard predicates to declare "rights" statements (e.g., Dublin Core terms: dc:rights, dct:license)2

?3a

Standard license available

DCATData catalog vocabulary

Page 15: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 15Daniel Vila-Suero

Guideline: Licensing models & mechanisms

Add "rights" metadata in the dataset description(e.g., VoID, DCAT)1

Use standard predicates to declare "rights" statements (e.g., Dublin Core terms: dc:rights, dct:license)2

?Yes

Use URI of standardlicense e.g., CC03a

Standard license available

DCATData catalog vocabulary

Page 16: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 16Daniel Vila-Suero

Guideline: Licensing models & mechanisms

Add "rights" metadata in the dataset description(e.g., VoID, DCAT)1

Use standard predicates to declare "rights" statements (e.g., Dublin Core terms: dc:rights, dct:license)2

?Use rights declarationlanguage, e.g., ODRL

Yes

Use URI of standardlicense e.g., CC0 3b3a

No

Standard license available

ODRLOpen Digital Rights Language

DCATData catalog vocabulary

Page 17: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 17Daniel Vila-Suero

Demo: Conditional access to Linked Data

• Prototype developed at the Ontology Engineering Group.

• A licenses-aware Linked Data server and a data policies and licenses manager

• Using Web standards (DCAT descriptions, SPARQL constructs, ODRL RDF policies, etc.)

Victor Rodríguez [email protected]

Page 18: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 18Daniel Vila-Suero

Demo: Use case

• Spanish geographical data: Administrative units, geopositions, links to DBpedia

1 Browse the data (user)

2 Set policies for parts of the dataset (admin)

3 Gain access to the restricted data (user)

Page 19: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 19Daniel Vila-Suero

Conditional.linkeddata.es

Demo available at:

http://conditional.linkeddata.es

Page 20: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 20Daniel Vila-Suero

Browse data: resource Barcelona (user)

Page 21: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 21Daniel Vila-Suero

Browse data: resource Barcelona (machine)

<http://localhost:99/ldr/resource/Provincia/Barcelona> a <http://localhost:99/ldr/ontology/Provincia> ; <http://www.w3.org/2000/01/rdf-schema#label> "Barcelona"^^<http://www.w3.org/2001/XMLSchema#string> ; <http://localhost:99/ldr/ontology/formadoPor> <http://localhost:99/ldr/resource/Municipio/Barcelona> ; <http://localhost:99/ldr/ontology/tieneCapital> <http://localhost:99/ldr/resource/Municipio/Barcelona> ; <http://www.w3.org/2003/01/geo/wgs84%2C%20pos#geometry> <http://localhost:99/ldr/policy/cdaddba4-fc2e-4ee0-a784-e62f1db259bc> ;

Page 22: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 22Daniel Vila-Suero

Set some policies (admin)

Page 23: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 23Daniel Vila-Suero

Set some policies (admin)

Page 24: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 24Daniel Vila-Suero

Browse data: resource Barcelona (user)

Page 25: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 25Daniel Vila-Suero

Browse data: resource Barcelona (machine)

<http://localhost:99/ldr/resource/Provincia/Barcelona> a <http://localhost:99/ldr/ontology/Provincia> ; <http://www.w3.org/2000/01/rdf-schema#label> "Barcelona"^^<http://www.w3.org/2001/XMLSchema#string> ; <http://localhost:99/ldr/ontology/formadoPor> <http://localhost:99/ldr/resource/Municipio/Barcelona> ; <http://localhost:99/ldr/ontology/tieneCapital> <http://localhost:99/ldr/resource/Municipio/Barcelona> ; <http://www.w3.org/2003/01/geo/wgs84%2C%20pos#geometry>

<http://localhost:99/ldr/resource/wgs84/41.3948528938705%2C%202.17465899138105> ;

Page 26: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 26Daniel Vila-Suero

Gain access to restricted data (user)

Page 27: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 27Daniel Vila-Suero

Gain access to restricted data (user)

<http://localhost:99/ldr/policy/ee32f675-ccae-4ca9-a544-3c07abf0b16e> a <http://www.w3.org/ns/odrl/2/Policy> , <http://www.w3.org/ns/odrl/2/Set>;

<http://www.w3.org/2000/01/rdf-schema#comment>"Individual triples are available upon payment of 1 euro cent" ;

<http://www.w3.org/ns/odrl/2/permission> ….

Page 28: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 28Daniel Vila-Suero

Gain access to restricted data (user)

Page 29: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

07/04/2023 29Daniel Vila-Suero

THANK YOUFOR YOUR ATTENTION

QUESTIONS?TWITTER: @dvilasueroSlideshare: /DanielVilaSuero