42
LOD2 Webinar . 24.06.2014 . Page 1 http:// lod2.eu Creating Knowledge out of Interlinked Data

LOD2 Webinar: SIREn

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 1 http://lod2.eu

Creating Knowledge out of Interlinked Data

Page 2: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 2 http://lod2.eu

Creating Knowledge out of Interlinked Data

http://lod2.eu

LOD2 is a large-scale integrating project co-funded by the European Commission within the FP7 Information and Communication Technologies Work Programme. This 4-year project comprises leading Linked Open Data technology researchers, companies, and service providers. Coming from across 12 countries the partners are coordinated by the Agile Knowledge Engineering and Semantic Web Research Group at the University of Leipzig, Germany.

LOD2 will integrate and syndicate Linked Data with existing large-scale applications. The project shows the benefits in the scenarios of Media and Publishing, Corporate Data intranets and eGovernment.

Page 3: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 3 http://lod2.eu

Creating Knowledge out of Interlinked Data

http://lod2.eu

Once per month the LOD2 webinar series offer a free webinar about tools and services along the Linked Open Data Life Cycle.

Stay with us and learn more about acquisition, editing, composing, connected applications – and finally publishing Linked Open Data.

Page 4: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 4 http://lod2.eu

Creating Knowledge out of Interlinked Data

• Nested Data Model• SIREn Overview• Getting Started with the SIREn Elasticsearch Plugin• Demo

Agenda

Page 5: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 5 http://lod2.eu

Creating Knowledge out of Interlinked Data

• Model becoming prevalent: JSON, XML, Avro, …– Can be arbitrarily nested and large

– No strict schema / structure enforced

• Schema-less brings– Flexibility

– Ease of development

• Developers do not have to invest significant modelling effort upfront

Schema-Less Nested Data Model

Page 6: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 6 http://lod2.eu

Creating Knowledge out of Interlinked Data

• Lucene, Solr and Elasticsearch plugin for indexing and searching JSON

• Rich data model (JSON)– Nested objects, nested arrays, datatypes

– Generic architecture compatible with various nested data models: JSON, JSON-LD, XML, Avro, ...

• Schema-agnostic– SIREn does not require any schema definition to index and search data

– Schema definition can change across records

• Designed from the ground up for high performance and scalability

Introducing SIREn

Page 7: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 7 http://lod2.eu

Creating Knowledge out of Interlinked Data

Introducing SIREn

Page 8: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 8 http://lod2.eu

Creating Knowledge out of Interlinked Data

• Document-oriented search and analytics engine– JSON Document– Based on Apache Lucene

• Distributed, Replication– High Performance and Availability

• REST API

Elasticsearch - Overview

Page 9: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 9 http://lod2.eu

Creating Knowledge out of Interlinked Data

• Index = Collection of Documents– Can have multiple shards and replicas

• Type = A set of documents sharing the same schema– Similar to a DB table

• Document = JSON object– Uniquely identified (index/type/id)– Similar to a DB record

Elasticsearch – Basic Concepts

Page 10: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 10 http://lod2.eu

Creating Knowledge out of Interlinked Data

SIREn – Basic Concepts

• JSON object = Tree– Different mapping available

• Node = An element of the tree– Can have a parent and one or more children– Contains data: text, numeric, boolean

Page 11: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 11 http://lod2.eu

Creating Knowledge out of Interlinked Data

SIREn – Basic Concepts

JSON Object Tree

{ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { "name" : "Data Collective", "type" : "financial-org" }, … ] }, … ] }

{ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { "name" : "Data Collective", "type" : "financial-org" }, … ] }, … ] }

name : Elasticsearch

funding_rounds

round_code : a

raised_amount : 10000000

funded_year : 2012

investments

name : Data Collective

type : financial-org

category_code : search

investments

name : ...

Page 12: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 12 http://lod2.eu

Creating Knowledge out of Interlinked Data

http://sirendb.com/downloads/

(Elasticsearch Distribution Coming Soon)

Getting Started with Elasticsearch & SIREn

Page 13: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 13 http://lod2.eu

Creating Knowledge out of Interlinked Data

SIREn Elasticsearch Distribution

├── dist│   ├── siren-core-1.3.jar│   └── siren-qparser-1.3.jar├── docs│   ├── apidoc│   └── siren-1.3-javadoc.jar├── example├── CHANGES.txt├── LICENSE.md├── README.md└── THIRD-PARTY.txt

├── dist│   ├── siren-core-1.3.jar│   └── siren-qparser-1.3.jar├── docs│   ├── apidoc│   └── siren-1.3-javadoc.jar├── example├── CHANGES.txt├── LICENSE.md├── README.md└── THIRD-PARTY.txt

SIREn LibrariesSIREn

Libraries

SIREn JavadocSIREn

Javadoc

Elasticsearch Distribution

Elasticsearch Distribution

Page 14: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 14 http://lod2.eu

Creating Knowledge out of Interlinked Data

SIREn Elasticsearch Distribution

├── bin│   ├── elasticsearch│   └── ...├── config│   ├── elasticsearch.yml│   └── logging.yml├── lib│   ├── elasticsearch-1.2.0.jar│   └── ...├── plugins│   └── siren-plugin│      ├── siren-elasticsearch-1.3.jar│ └── ...├── datasets├── NOTICE.txt└── README.md

├── bin│   ├── elasticsearch│   └── ...├── config│   ├── elasticsearch.yml│   └── logging.yml├── lib│   ├── elasticsearch-1.2.0.jar│   └── ...├── plugins│   └── siren-plugin│      ├── siren-elasticsearch-1.3.jar│ └── ...├── datasets├── NOTICE.txt└── README.md

ExecutablesExecutables

ConfigurationConfiguration

SIREn PluginSIREn Plugin

Page 15: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 15 http://lod2.eu

Creating Knowledge out of Interlinked Data

Starting Elasticsearch

$ ./bin/elasticsearch

[INFO ][node ] [Valerie Cooper] version[1.2.0], pid[12842], …[INFO ][node ] [Valerie Cooper] initializing ...[INFO ][plugins] [Valerie Cooper] loaded [siren-plugin], sites [][INFO ][node ] [Valerie Cooper] initialized[INFO ][node ] [Valerie Cooper] starting ...…[INFO ][node ] [Valerie Cooper] started

$ ./bin/elasticsearch

[INFO ][node ] [Valerie Cooper] version[1.2.0], pid[12842], …[INFO ][node ] [Valerie Cooper] initializing ...[INFO ][plugins] [Valerie Cooper] loaded [siren-plugin], sites [][INFO ][node ] [Valerie Cooper] initialized[INFO ][node ] [Valerie Cooper] starting ...…[INFO ][node ] [Valerie Cooper] started

Page 16: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 16 http://lod2.eu

Creating Knowledge out of Interlinked Data

Create an Index and Map SIREn’s Field

$ curl -XPUT 'http://localhost:9200/test/' –d '{ "mappings" : { "companies" : { "properties" : { "_siren_source" : { "index" : "analyzed", "analyzer" : "hybrid-json", "postings_format" : "Siren10AFor", "store" : "no", "type" : "string" } }, "_siren" : {} } }}'

$ curl -XPUT 'http://localhost:9200/test/' –d '{ "mappings" : { "companies" : { "properties" : { "_siren_source" : { "index" : "analyzed", "analyzer" : "hybrid-json", "postings_format" : "Siren10AFor", "store" : "no", "type" : "string" } }, "_siren" : {} } }}'

SIREn’s Field

SIREn’s Analyzer

SIREn’s Posting Format

Page 17: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 17 http://lod2.eu

Creating Knowledge out of Interlinked Data

Indexing a Document

$ curl -XPUT 'http://localhost:9200/test/companies/1' –d '{ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { "name" : "Data Collective", "type" : "financial-org" } ] } ]}'

$ curl -XPUT 'http://localhost:9200/test/companies/1' –d '{ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { "name" : "Data Collective", "type" : "financial-org" } ] } ]}'

Page 18: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 18 http://lod2.eu

Creating Knowledge out of Interlinked Data

• SIREn distinguishes 5 types of values in the JSON model:– Field, String, Long, Double, Boolean

Customising Datatype Analysis

{ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { "name" : "Data Collective", "type" : "financial-org" } ] } ]}

{ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { "name" : "Data Collective", "type" : "financial-org" } ] } ]}

json:field

xsd:long

xsd:string

Page 19: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 19 http://lod2.eu

Creating Knowledge out of Interlinked Data

Customising Datatype Analysis

# config/elasticsearch.yml

siren.analysis.datatype:http://json.org/field:

index_analyzer: keywordhttp://www.w3.org/2001/XMLSchema#string:

index_analyzer: simplesearch_analyzer: simpleFolding

http://www.w3.org/2001/XMLSchema#long:index_analyzer: long

index.analysis.analyzer:simpleFolding:

type: customtokenizer: letterfilter: [lowercase, asciifolding]

# config/elasticsearch.yml

siren.analysis.datatype:http://json.org/field:

index_analyzer: keywordhttp://www.w3.org/2001/XMLSchema#string:

index_analyzer: simplesearch_analyzer: simpleFolding

http://www.w3.org/2001/XMLSchema#long:index_analyzer: long

index.analysis.analyzer:simpleFolding:

type: customtokenizer: letterfilter: [lowercase, asciifolding]

Page 20: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 20 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Node Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "node" : { "query" : "search", } }}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "node" : { "query" : "search", } }}'

name : Elasticsearch

funding_rounds

round_code : a

raised_amount : 10000000

funded_year : 2012

investments

name : Data Collective

type : financial-org

category_code : search

investments

name : ...

SIREn’s Query Type

SIREn’s Query

Full-Text Query:•Boolean, Phrase, Range, Fuzzy, Regexp, Proximity, ...

Page 21: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 21 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Node Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "node" : { "query" : "search", } }}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "node" : { "query" : "search", } }}'

name : Elasticsearch

funding_rounds

round_code : a

raised_amount : 10000000

funded_year : 2012

investments

name : Data Collective

type : financial-org

category_code : search

investments

name : ...

Page 22: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 22 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Node Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "node" : { "query" : "search", "attribute" : "category_code", } }}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "node" : { "query" : "search", "attribute" : "category_code", } }}'

name : Elasticsearch

funding_rounds

round_code : a

raised_amount : 10000000

funded_year : 2012

investments

name : Data Collective

type : financial-org

category_code : search

investments

name : ...

Page 23: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 23 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Node Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "node" : { "query" : "search", "attribute" : "category_code", } }}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "node" : { "query" : "search", "attribute" : "category_code", } }}'

name : Elasticsearch

funding_rounds

round_code : a

raised_amount : 10000000

funded_year : 2012

investments

name : Data Collective

type : financial-org

category_code : search

investments

name : ...

Page 24: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 24 http://lod2.eu

Creating Knowledge out of Interlinked Data

• Query operators for Ancestor-Descendant and Parent-Child relationships

Searching: Twig Query

Page 25: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 25 http://lod2.eu

Creating Knowledge out of Interlinked Data

• Query operators for Ancestor-Descendant and Parent-Child relationships

• Consists of a root query

Searching: Twig Query

Boolean

Page 26: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 26 http://lod2.eu

Creating Knowledge out of Interlinked Data

• Query operators for Ancestor-Descendant and Parent-Child relationships

• Consists of a root query and one or more child

Searching: Twig Query

Boolean

PhraseMUST

Page 27: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 27 http://lod2.eu

Creating Knowledge out of Interlinked Data

• Query operators for Ancestor-Descendant and Parent-Child relationships

• Consists of a root query and one or more child and descendant queries

Searching: Twig Query

Boolean

PhraseMUST

BooleanSHOULD

Page 28: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 28 http://lod2.eu

Creating Knowledge out of Interlinked Data

• Query operators for Ancestor-Descendant and Parent-Child relationships

• Consists of a root query and one or more child and descendant queries

• Can be nested to form complex tree structure

Searching: Twig Query

Boolean

PhraseMUST

TwigNOT

RangeMUST

BooleanSHOULD

Page 29: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 29 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Twig Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "twig" : { "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" } }] } }}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "twig" : { "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" } }] } }}'

name : Elasticsearch

funding_rounds

round_code : a

raised_amount : 10000000

funded_year : 2012

investments

name : Data Collective

type : financial-org

category_code : search

investments

name : ...

Twig Query Operator Root Query

Child Query

Page 30: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 30 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Twig Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "twig" : { "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" } }] } }}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "twig" : { "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" } }] } }}'

name : Elasticsearch

funding_rounds

round_code : a

raised_amount : 10000000

funded_year : 2012

investments

name : Data Collective

type : financial-org

category_code : search

investments

name : ...

Page 31: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 31 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Twig Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "twig" : { "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" } }] } }}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "twig" : { "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" } }] } }}'

name : Elasticsearch

funding_rounds

round_code : a

raised_amount : 10000000

funded_year : 2012

investments

name : Data Collective

type : financial-org

category_code : search

investments

name : ...

Page 32: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 32 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Twig Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "twig" : { "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" } }] } }}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "twig" : { "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" } }] } }}'

name : Elasticsearch

funding_rounds

round_code : a

raised_amount : 10000000

funded_year : 2012

investments

name : Data Collective

type : financial-org

category_code : search

investments

name : ...

Page 33: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 33 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Twig Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "twig" : { "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" } }] } }}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "twig" : { "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" } }] } }}'

name : Elasticsearch

funding_rounds

round_code : a

raised_amount : 10000000

funded_year : 2012

investments

name : Data Collective

type : financial-org

category_code : search

investments

name : ...

Page 34: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 34 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Proximity Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } }] } },{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } }] } }]}}}}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } }] } },{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } }] } }]}}}}'

name : Elasticsearch

funding_rounds

investments

name : Data Collective

type : financial-org

category_code : search

funding_rounds

investments

name : Index Ventures

type : financial-org

Order Constraint

33

2211

44

Proximity Constraint

Boolean Clauses

Page 35: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 35 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Proximity Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } }] } },{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } }] } }]}}}}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } }] } },{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } }] } }]}}}}'

name : Elasticsearch

funding_rounds

investments

name : Data Collective

type : financial-org

category_code : search

funding_rounds

investments

name : Index Ventures

type : financial-org

33

2211

44

Page 36: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 36 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Proximity Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } }] } },{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } }] } }]}}}}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } }] } },{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } }] } }]}}}}'

name : Elasticsearch

funding_rounds

investments

name : Data Collective

type : financial-org

category_code : search

funding_rounds

investments

name : Index Ventures

type : financial-org

33

2211

44

Page 37: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 37 http://lod2.eu

Creating Knowledge out of Interlinked Data

Searching: Proximity Query

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } }] } },{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } }] } }]}}}}'

$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } }] } },{ "twig" : { "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } }] } }]}}}}'

name : Elasticsearch

funding_rounds

investments

name : Data Collective

type : financial-org

category_code : search

funding_rounds

investments

name : Index Ventures

type : financial-org

33

2211

44

Page 38: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 38 http://lod2.eu

Creating Knowledge out of Interlinked Data

Demo

Page 39: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 39 http://lod2.eu

Creating Knowledge out of Interlinked Data

• SIREn’s Key Feature– Dynamic/Schema-less Data Management– Nested Data– High performance and scalability– Powerful search operators– Elasticsearch/Solr integration

• Contact– http://sirendb.com– SindiceTech– [email protected]

Conclusion

Page 40: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 40 http://lod2.eu

Creating Knowledge out of Interlinked Data

Credits

Jingle R.E.M., Martin Kaltenböck, Florian Kondert

Coordination Thomas Thurner

Martin Kaltenböck

Moderation Martin Kaltenböck

Presented by Renaud Delbru, Harish Kumar

Page 41: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 41 http://lod2.eu

Creating Knowledge out of Interlinked Data

http://lod2.eu

Hope you enjoyed staying with us – if you need more detailed information, visit us at www.lod2.eu and let us know how we can improve to meet your expectations!

Don’t forget to register for our next webinar

20.12. 2011 - Virtuoso (Open Link Software) 24.01. 2012 - OntoWiki (University of Leipzig, Germany)

Have a great day and don’t forget ...

Page 42: LOD2 Webinar: SIREn

LOD2 Webinar . 24.06.2014 . Page 42 http://lod2.eu

Creating Knowledge out of Interlinked Data

http://lod2.eu