14
Publishing XBRL as Linked Open Data Roberto García, Rosa Gil GRIHO (HCI & Data Integration) Research Group Universitat de Lleida, Spain {rgarcia,rgil}@diei.udl.cat © 2009, http://rhizomik.net/~roberto

Publishing XBRL as Linked Open Data

Embed Size (px)

DESCRIPTION

The XML Business Reporting Language (XBRL) is a standard for business and financial information reporting. It is based on XML so instance documents based on XBRL, e.g. a quarterly report, are highly constrained by the XML document-oriented nature. This makes more difficult to perform queries that mix information from filings from different dates, companies, or accounting principles than with a formalism based on a graph model instead of a tree model. Semantic Web technologies provide a graph model that facilitates mashing-up different XBRL sources. We have put into practice this approach mapping the XBRL filings available from the SEC’s EDGAR program to Resource Description Framework (RDF) and the XML Schema taxonomies these filings are based on to Web Ontology Language (OWL). The resulting semantic metadata, though highly tied to the XML structure it is mapped from, benefits from Semantic Web technologies and tools in order to facilitate integration and cross-querying, even together with other parts of the Web of Linked Data.

Citation preview

Page 1: Publishing XBRL as Linked Open Data

Publishing XBRL as Linked Open Data

Roberto García, Rosa Gil

GRIHO (HCI & Data Integration) Research Group

Universitat de Lleida, Spain

{rgarcia,rgil}@diei.udl.cat

© 2009, http://rhizomik.net/~roberto

Page 2: Publishing XBRL as Linked Open Data

Introduction• Contribute to Web of Linked Open Data

• Public financial data available as XBRL– XML Business Reporting Language– E.g. SEC’s EDGAR voluntary1 program

• Reuse this data, map XBRL to RDF– Apply generic mapping: ReDeFer

• XSD2OWL, XML2RDF

• Publish LOD using Rhizomer

1 From 2009, 3 year phased-in schedule, by end 2011 all U.S. public companies plus 8,000 mutual funds

Page 3: Publishing XBRL as Linked Open Data

XBRL• Instance data

– Facts: e.g. “the long term debt is 350,000”, plus contextRef and unitRef

– Contexts: entity (company or individual), time, [scenario]

– Units: unit of measure– References: link to schema defining taxonomy

of facts

Publishing XBRL as Linked Open Data Linked Open Data Workshop – WWW’2009

Page 4: Publishing XBRL as Linked Open Data

XBRL XML to RDF• ReDeFer XML2RDF,

model XML tree using triples– xsd:element and xsd:atribute rdf:Property

– xbrli:id and xbrli:identifier rdf:Resource ID

– Other resources, anonymous

AsOf20061201_Consolidated_Unaudited

772500000rdf:value

-3

xbrli:decimals

USD

xbrli:unitRef

xbrli:period

CIK0000796343

xbrli:entity

2006-12-01xbrli:instant

usfr-pte:CashCashEquivalents

adbe-20080616

xbrli:contextRef

Rendered by RDF2SVG

Page 5: Publishing XBRL as Linked Open Data

XBRL XSDs to OWL• EDGAR Schemas: XBRL 2.1, US GAAP1,…• ReDeFer XSD2OWL

ReportType ItemType UnitType

ContextType

EntityType

PeriodType

ScenarioType

fact unitRef

contextRef

period

entity

scenario

value decimals

Literal Literal

XML Schema OWL

element | attributerdf:Propertyowl:DatatypePropertyowl:ObjectProperty

element@substitutionGroup rdfs:subPropertyOfelement@type rdfs:rangecomplexType owl:ClasscomplexType//element owl:Restrictionextension@base | restriction@base

rdfs:subClassOf

@maxOccurs, @minOccurs

owl:maxCardinality, owl:minCardinality

sequence, choice

owl:intersectionOf, owl:unionOf

Core classes and properties for XBRL Instance

XSD2OWL mappings

1Generally Accepted Accounting PrinciplesOntologies available from http://rhizomik.net/ontologies/bizontos

Page 6: Publishing XBRL as Linked Open Data

XML2RDF plus XSD2OWL• Resources typed with classes in ontologies

corresponding to their XSD complexTypes:

xbrli:monetaryItemType:

xbrli:contextType:AsOf20061201_Consolidated_Unaudited

772500000rdf:value

-3

xbrli:decimals

USD

xbrli:unitRef

xbrli:contextPeriodType:xbrli:period

xbrli:contextEntityType:CIK0000796343

xbrli:entity

2006-12-01xbrli:instant

usfr-pte:CashCashEquivalents

xbrli:xbrlType:adbe-20080616

xbrli:contextRef

Page 7: Publishing XBRL as Linked Open Data

Semantic XBRL• Dataset size

– April 2009: 1,34 million triples from 612 XBRL filings.

• Links to LODW:– Entities: companies in DBPedia, use name or

Central Index Key (CIK)– Units: e.g. USD

http://dbpedia.org/resource/United_States_dollar

Publishing XBRL as Linked Open Data Linked Open Data Workshop – WWW’2009

Page 8: Publishing XBRL as Linked Open Data

Architecture

XML2RDF

XSD2OWL

XBRL Schemas

.xml

XBRL filings

XBRL OntologiesReDeFer

Rhizomer

+1M triples

LOD

Linking

RDF2Form

XHTML+RDFa

RDF2HTML

SPARQL

edit

post

Rhizomer AJAX

get

.xsd

EDGARRSS Feed

download

EDGAR Standard

Taxonomies

.US

Publishing XBRL as Linked Open Data Linked Open Data Workshop – WWW’2009

Page 9: Publishing XBRL as Linked Open Data

Prototype Demo• Rhizomer: publish, query, browse, edit and

mashup (map, timeline,…) Linked Data:– Ex.1: get a context description from its URL

• Link, Capture

– Ex.2: get all mentions of entity CIK796343• Link, Capture

– Ex.3: all Investment[In]LeaseReceivable integrated from different filings

• Link, Capture

Publishing XBRL as Linked Open Data Linked Open Data Workshop – WWW’2009

Page 10: Publishing XBRL as Linked Open Data

Conclusions and Future Work• Straightforward mapping XBRL to LOD• Resulting RDF is too “XML-styled”…

…rules/CONSTRUCT to tailor to needs– E.g. facts as resources instead of properties

facilitate querying and browsing

• Semantic mappings might facilitate cross-querying financial data…– Across filings, companies, accounting

principles,…

Publishing XBRL as Linked Open Data Linked Open Data Workshop – WWW’2009

Page 11: Publishing XBRL as Linked Open Data

Thank you for your attention

Roberto García González

http://rhizomik.net/~roberto

Publishing XBRL as Linked Open Data Linked Open Data Workshop – WWW’2009

© 2009, http://rhizomik.net/~roberto

Page 12: Publishing XBRL as Linked Open Data

Get a context description from its URL

Back

Page 13: Publishing XBRL as Linked Open Data

Get all mentions of entity CIK796343

Back

Page 14: Publishing XBRL as Linked Open Data

Get all Investment[In]LeaseReceivable

Back