6
THE WEB OBSERVATORY DATA GATHERING WITH EUGENE SIOW & XIN WANG 29 JANUARY 2016

Data Gathering with The Web Observatory

Embed Size (px)

Citation preview

Page 1: Data Gathering with The Web Observatory

THE WEB OBSERVATORY DATA GATHERING WITH

EUGENE SIOW & XIN WANG

29 JANUARY 2016

Page 2: Data Gathering with The Web Observatory

WHAT IS THE WEB OBSERVATORY?

SEARCH + ACCESS

Page 3: Data Gathering with The Web Observatory

DATA | WO | APP OPEN

PRIVATE

NoSQL

SQL STREAMS

LINKED DATA

JS

PYTHON NODE

Page 4: Data Gathering with The Web Observatory

GATHERING DATA WITH SCRAPING

DATA ON WEBPAGES DATA I CAN USE transform ( )

Page 5: Data Gathering with The Web Observatory

THE PROCESS OF SCRAPING

INVESTIGATE THE STRUCTURE OF THE PAGE

CHECK IF THERE IS AN API {APPLICATION PROGRAMMING INTERFACE}

USE CHROME’S INSPECTOR OR FIREBUG

EXTRACT, TRANSFORM, LOAD WHAT IS YOUR DESIRED END FORMAT?

Page 6: Data Gathering with The Web Observatory

HANDS-ON RESOURCES

codepen.io/xgfd/pen/wMyQWb

github.com/eugenesiow/datathon2016/wiki

DATA-DRIVEN APPS USING THE WO

DATA GATHERING

webobservatory.soton.ac.uk THE SOTON WEB OBSERVATORY

BACKGROUNDS FROM THE HUBBLE SPACE TELESCOPE