Upload
moises-stonestreet
View
217
Download
3
Embed Size (px)
Citation preview
Open DataLinked Data
Big Data
Christopher Gutteridge
Linked Data? Open Data?
• Linked data – a technique for creating data which joins up with data from other data provider’s unique identifiers.
• Open data - publishing data with a license which permits re-use, akin to creative commons for creative arts and to 'open source' software.
• RDF - a way of structuring data as a list of facts and globally unique identities for things. Easy to merge lists of facts from multiple sources.
LINKS: http://is.gd/WmZfq1
URIs to identify everything• URIs look like a web address, but (sometimes) identify a real world thing such as
• Equipment– http://id.southampton.ac.uk/equipment/E0672
• Facilities– http://id.southampton.ac.uk/facility/F0041
• Campuses– http://id.southampton.ac.uk/site/1
• Organisations (and parts thereof)– http://id.southampton.ac.uk/– http://id.southampton.ac.uk/org/FP
LINKS: http://is.gd/WmZfq1
URL
URI
data.southampton.ac.uk
Data Source
Auth-ority
API
Website
AppData Source
Auth-ority
Open World
API
Website
App
Website
Data Source
Auth-ority
Some other Aggregator
Open World
API
Website
App
Website
Data Source
Auth-ority
Some other Aggregator
API
Website
App
Website
Data Source
Auth-ority
Some other Aggregator
Uniquip: equipment.data.ac.uk
• Equipment, Facilities, Services/Capabilities
Uniquip: equipment.data.ac.uk
• Equipment, Facilities, Services/Capabilities• Description• Location (City)• Contact Info (email, phone, URL)
Uniquip: equipment.data.ac.uk
• Equipment, Facilities, Services/Capabilities• Description• Location (City)• Contact Info (email, phone, URL) • URI (or organisation + ID)
Linking Equipment
DataPaper
Impact
How Open Data can Support Big (research) Data
• Open Datasets as Big Data Taxonomy• Aggregated Open Data becomes Big Data• Encourages formation of standards
"The greatest crisis facing us is not Russia, not the Atom Bomb, not corruption in government, not encroaching hunger, nor the morals of the young. It is a crisis in the organization and accessibility of human knowledge. We own an enormous "encyclopedia" - which isn't even arranged alphabetically.
Our "file cards" are spilled on the floor, nor were they ever in order. The answers we want may be buried somewhere in the heap, but it might take a lifetime to locate two already known facts, place them side by side and derive a third fact, the one we urgently need.
Call it the crisis of the Librarian.We need a new "specialist" who is not a specialist,
but a synthesist. We need a new science to be a perfect secretary to all other sciences."
- Robert A. Heinlein, 1950.