Virtuoso: Semantikk som skalerer!

Preview:

DESCRIPTION

Presentasjon om OpenLink Virtuoso, en RDF-database.

Citation preview

Øystein Isaksen27.10.2011

Virtuoso: Semantikk som skalerer!

Universal Server

Multiplattform

• Virtuoso støtter en rekke 32- & 64-bit plattformer inkludert: Windows, UNIX, Linux, og Mac OS X.

Data inn og ut

• ODBC, JDBC, ADO.NET og OLE DB• Conductor, administrasjonsgrensesnitt• ISQL (Interactive SQL)• Sparql

RDF Triple store (QUAD egentlig)

Verktøy

Administrasjonsgrensesnitt

Søk

Sparql-grensesnitt

ISQL

Mye mer

Sesamprosjektet hos Hafslund

Hvorfor valgte vi Virtuoso

Ytelse

Sparql benchmark

Number of Triples 100M 200M

Number of Products 284,826 570,000

Number of Producers 5,618 11,240

Number of Product Features 47,884 94,259

Number of Product Types 2,011 3,949

Number of Vendors 2,896 5,758

Number of Offers 5,696,520 11,400,000

Number of Reviewers 146,093 292,095

Number of Reviews 2,848,260 5,700,000

Total Number of Instances 9,034,108 18,077,301

Exact Total Number of Triples 100,000,748 200,031,975

File Size Turtle (unzipped) 8.7 GB 18 GB

Spørringer i sekundet

4store BigData BigOwlim TDB Virtuoso

Query 1 117.6 64.2 112.5 75.1 200.7

Query 2 49.0 33.6 159.3 41.0 71.1

Query 3 102.4 12.4 125.0 82.2 201.4

Query 4 43.4 38.4 97.9 62.1 103.9

Query 5 7.8 2.3 3.0 2.0 15.2

Query 6

Query 7 41.3 31.3 32.6 22.6 24.9

Query 8 49.1 48.5 38.0 24.4 54.0

Query 9 233.0 54.8 141.8 124.6 379.1

Query 10 49.2 61.6 48.5 33.5 113.7

Query 11 145.3 43.8 51.3 30.0 73.6

Query 12 46.5 54.8 65.4 33.3 68.0

Queries per Second by Dataset Size and Query Running 500 query mixes against the different stores lead to the following query throughput for each type of query over all 500 runs (in Queries per Second). The best performance figure for each query is set bold in the tables.

100 Million triples dataset.

Vår egen test

Max User Load 25

Tests/Sec 153

Tests Failed 0

Avg. Test Time (sec) 0,12

Transactions/Sec 0

Avg. Transaction Time (sec) 0

Pages/Sec 153

Avg. Page Time (sec) 0,11

Requests/Sec 153

Requests Failed 0

Requests Cached Percentage 0

Avg. Response Time (sec) 0,11

Avg. Content Length (bytes) 94,0

Virtuoso (RDF)

Funksjonalitet

Hva bruker vi det til?

Weblogic ESB

Hafslund arkitektur

CMIS klientSD share SPARQL

Public 360 ogSharePoint

IFS

SD share CMIS server

Metadata-extention

Virtuoso RDF server

OKS

JDBC

Office klient

Scanning

Office klientSharePoint

SD-Share kilent og

server

SPARQL

360 connectorSøkemotor

OpenSearch

OpenSearch

CMIS – 360adapter

IFS Dokument connector

SD share

SD-Share kilent og

server

API

Eksponerer data fra fagsystemer

Virtuoso (UMIC)

Virtuoso

Sparql

SparULSdshare

klientSdshare

Sdshare

SDshare hva ?

SDshare klient

System B

System A

SDshare klient

Hva skjer FØR det blir SDshare?

Buss

Snakke med de ansvarlige for fagsystemet

Web services

Web Service Klient SDshare

What’s in it for me ? ….”En av de største utfordringene med integrasjon, er villigheten hos systemeiere til å gjøre de nødvendige tilpasningene”

Virtuoso

SQL table

Vår tilnærming..

DB (ORACLE)

SQL linked table RDF view SDshare

Dette høres veeeeldig fornuftig ut…

Erfaringer så langt

Test selv

Open source

Into the clouds

.Net støtte, dotNetRDF

Hvor brukes det?

Spørsmål?

Recommended