Upload
bertrand-delacretaz
View
1.755
Download
5
Tags:
Embed Size (px)
DESCRIPTION
Introduction to the IKS FISE RESTful semantic engine - http://wiki.iks-project.eu/index.php/FISE , for the first IKS Early Adopters Workshop, Salzburg, June 22nd, 2010 - http://wiki.iks-project.eu/index.php/Workshops/EAworkshop
Citation preview
IKS – Interactive Knowledge Stack for Semantic Content Management Systems
veryveryalpha!
IKS Early Adopters WorkshopSalzburg, June 22-23, 2010
Introducing FISE, the IKS RESTful semantic engine
Bertrand Delacrétaz Senior Developer, Day Software (www.day.com)
Member, Apache Software Foundation (apache.org)
NOTa semantic
guru!
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
FISE:Furtwangen
IKSSemanticEngine
(«phase»)What?does FISE do and
How?
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
FISE server and plug-ins
FISE server (OSGi-based)
FISE EnhancementEngine API
Autotagging
Named EntityRecognition
FISE EnhancementEngine APIPlug-in APIs for more engines...
Storage (in-memory/Clerezza)
SPARL queries as well
RESTfulHTTP
interfacePOST content
GET enhanced contentGET query results
Workingcode + APIs
Queries (Apache Clerezza)
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
FISE demo: SACE client
1
2b
3
4
5
SACEclient
FISEserver
http
2a
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
SACE client GETs RDF from FISE<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:j.0="http://purl.org/dc/terms/" xmlns:j.1="http://rdfs.org/sioc/ns#" xmlns:j.2="http://fise.iks-project.eu/ontology/" > <rdf:Description rdf:about="urn:enhancement-ce190f1b-698c-5038-3057-88da26c3f722"> <rdf:type rdf:resource="http://fise.iks-project.eu/ontology/Enhancement"/> ... <j.2:extracted-from rdf:resource="urn:content-item-sha1-8559dbc3ce78b24"/> <j.0:creator rdf:datatype="http://www.w3.org/2001/XMLSchema#string">eu.iksproject....EntityMentionEnhancementEngine</j.0:creator>
<j.2:entity-reference rdf:resource="http://dbpedia.org/resource/Berkeley%2C_California"/> <j.2:entity-label rdf:datatype="http://www.w3.org/2001/XMLSchema#string">Berkeley, California</j.2:entity-label> <j.2:confidence rdf:datatype="http://www.w3.org/2001/XMLSchema#double">3.4116923809051514</j.2:confidence> <j.2:entity-type rdf:resource="http://www.w3.org/2002/07/owl#Thing"/> <j.2:entity-type rdf:resource="http://dbpedia.org/ontology/Place"/> <j.2:entity-type rdf:resource="http://dbpedia.org/ontology/PopulatedPlace"/> <j.2:entity-type rdf:resource="http://dbpedia.org/ontology/Area"/> </rdf:Description>
The EntityMentionEnhancementEngine thinks that content item is related to the «Berkeley, California» Place from dbpedia.
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
The FISE scenario
FISE server (OSGi-based)
RESTfulHTTP
interface
Enhancement Engines
1. POST raw content
2. Enhancement engine plug-ins
3. GET enhanced content
4. Add moreengines...
FISE extension APIs
HTTPclient
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
SPARQL endpointpowered by Apache Clerezza
FISE server (OSGi-based)
httpGET
<sparql xmlns="http://www.w3.org/2005/sparql-results#">...<results><result><binding name="content"> <uri>/11</uri></binding><binding name="engine"> <literal datatype="http://www.w3.org/2001/XMLSchema#string"> eu.iksproject.fise.EntityMentionEnhancementEngine</literal></binding><binding name="extraction_time"> <literal datatype="http://www.w3.org/2001/XMLSchema#dateTime"> 2010-06-21T23:22:15.802+02:00</literal></binding><binding name="enhancement"> <uri>urn:enhancement-d7c755a7-09d0-8a1f-5a15-ff177c1d3593</uri></binding></result><result>
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
the
Architectureof FISE
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
FISE vision: RESTful semantic engineadd-on for existing CMS
Persistence (CRX, Apache Jackrabbit)
JSR-170
Apache Sling Component Framework
Javascript Widgets
Rendering Components
Editing Dialogs
Workflow Engine
CQ5 web contentmanager
CQ5 Applications (Day or custom)
JS API
HTTPA
pache Felix / OSG
i framew
ork
CRX Explorer
OSGiConsole
HTTP
WebDAV
Semantic EngineConcept extractionIntelligent taggingSemantic search
Day’s cq5 platform, as an example CMS
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
FISE: minimal core with plugins
RESTful HTTP front-end
EnhancementsJob Manager
...
EnhancementEngine plugin
EnhancementEngine plugin
EnhancementsStore plugin
Query Engine plugin
Reasoning plugins?
Java
VM
OS
Gi f
ram
ewor
k
indicates plugins
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
OSGi for modularization
Maven plugins and OSGi console provided by Apache Felix.
Launcher provided by Apache Sling for standalone server.
Semantic components provided by Apache Clerezza
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
the FISE
Services APIexcerpts
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
FISE ContentItem API
FIXME
Adapter patterninstead of getInputStream?Helpers to add Enhancements to the Graph?
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
FISE EnhancementEngine API
FIXME
Collaboration between engines: Android-like intents, event-based tuple space?
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
current FISE
EnhancementEngines
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
FISE EnhancementEngine pluginsearly June 2010
FIXME
Fat bundles, need to optimize dependencies.
NamedEntityExtractionEnhancementEngine:Uses OpenNLP sentence detector and name finder to detect persons, places and organisations.
EntityMentionEnhancementEngine:Creates EntityAnnotations using an index of DBpedia entities.
LocationEnhancementEngine:Creates fise:EntityAnnotations based on the http://geonames.org dataset.
Tagging using the Zemanta API:Simple wrapper for Zemanta
Language identification:Identifies a text as: German(de), English(en), ...., Slovenian(sl), Danish(da), Hungarian(hu)
Metaxa text extractor:
Extracts metadata and text from document formats like PDF, Word, etc.
SemanticEngine
Contest?
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
EnhancementEngines collaboration
FIXME
Currently based on engine ordering.Need a better mechanism, intents, tuple space etc.
Entity extraction dbpedia entities Geonames locations
content entities dbpedia geonames
Engines use metadata from other engines as input.Iteratively enhance incoming content.
Resulting output:
data flow
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
GettingStarted
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
Getting started with FISEStart at http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE».Binary downloads are available there.Build from source code, not for the faint of heart currently (Maven snapshots!)
java -Xmx512M -jar!iks-fise-server-rNNN.jar
(add -h at the end to see more options)
Then openhttp://localhost:8080
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
What’snext?
See http://wiki.iks-project.eu/index.php/FISE or Google «IKS FISE»
Conclusions...and What’s Next?
Please
Let us know if FISE would add value to your CMS!
Our main goal is to build a community of FISE users, which provides feedback on semantic features that add value to existing CMS. And contributes to FISE!
Ideas from the full IKS stack will percolate into FISE based on actual use cases from the field.
Let’s keep FISE simple, understandable and robust, while allowing our researcher’s «wild ideas» plugins to be used for more experimental work.
Project Lead and Coordination
Salzburg Research
Wernher BehrendtSalzburg Research Forschungsgesellschaft m.b.H.
Jakob Haringer Straße 5/3 | 5020 Salzburg, Austria
T +43.662.2288-409 | F +43.662.2288-222
www.salzburgresearch.at
Deutsches Forschungsinstitut für Künstliche Intelligenz (DFKI)
Universität St. Gallen
Consiglio Nationale delle Ricerche (CNR)
Software Quality LabUnversität Paderborn
Software Research and Development Consultancy Ltd (SRDC)
Hochschule Furtwangen
Nuxeo Sa. Alkacon Software GmbH
TXT Polymedia Pisano Holding GmbH
Nemein Oy Day Software AG
The IKS Consortium