47
I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Embed Size (px)

Citation preview

Page 1: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

I’ve found the data; it’s free and open access. Now what?

Gilberto CâmaraNational Institute for Space Research (INPE)Brazil

Page 2: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Geospatial data catalogue

Source: [Bai and Di, 2011]

Page 3: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

The hard-wired map metaphor

Cantino planisphere (1502)

Page 4: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Map metaphors live in GIS

GeospatialDatabase

Desktop GIS Web service

Page 5: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Birds do it… bees do it… even educated fleas do it… Let’s do it…

Page 6: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Distribution Model Algorithm Distribution map

Tem

pera

ture

Precipitation

Environmental data

Ecological niche modelling

Page 7: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Speciesinfo

Speciesinfo Precipitation

Soil

Temperature

Environmental data

openModeller

Bioclim NeuralNetworks

GARP

Specimens

Modelling algorithmsopenopenModellerModeller

Page 8: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Natural disasters

Page 9: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Risk AnalysesRisk Analyses

Analysis

Page 10: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

On-line data feedOn-line data feed

ModelsSatellite/RadarDCP

Rain totalFixed time and irregular – alertPoint dataOne file per DCP

Grid 4kmTotal rain 1hTotal rain 24hCurrent (mm/h)Binary file

ETA 40, 20, 5 KmEnsemble 40 KmTotal rain 72h72 filesASCII grid file

Page 11: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Natural Disasters Monitoring and Alert System

Page 12: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Até 10%

10 - 20%

20 – 30%

30 – 40%

40 – 50%

50 – 60%

60 – 70%

70 – 80%

80 – 90%

90 – 100%

Amazonia (4.000.000 km2 = size of Europe)

Deforestation in Amazonia

Page 13: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Daily warnings of newly deforested large areas

Real-time Deforestation Monitoring

Page 14: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

166-112

116-113

116-112

30 Tb of data500.000 lines of code

150 man/years of software dev200 man/years of interpreters

How much it takes to survey Amazonia?

Page 15: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Data Access Hitting a Wall Current science practice based on data download

How do you download a petabyte?

Page 16: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Data Access Hitting a Wall Current science practice based on data download

How do you download a petabyte?You don’t! Move the software to the archive

Page 17: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Virtual Observatory

17

“If data is online, the internet is the world’s best telescope” (Jim Gray)

Page 18: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

How many clouds do we need?

Page 19: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

19

What happened here in the last 10 years?

source: INPE

< Corn > sugarcane ->

Page 20: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Are biofuels replacing food production in Brazil?

24% 26%30%

37% 41% 38%

26%

12%1% 1%

3% 3%3%

3%

7%17%

48%85%

98% 98%

1% 1%1%

1%

1%

1%

71% 70%65%

59%51%

44%

26%

3%1%

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

100%

2000 2001 2002 2003 2004 2005 2006 2007 2008 2009

Área Agrícola Cana-de-açúcar Citrus Pastagem Vegetação Arbórea

Page 21: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Are biofuels replacing food production in Brazil?

24% 26%30%

37% 41% 38%

26%

12%1% 1%

3% 3%3%

3%

7%17%

48%85%

98% 98%

1% 1%1%

1%

1%

1%

71% 70%65%

59%51%

44%

26%

3%1%

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

100%

2000 2001 2002 2003 2004 2005 2006 2007 2008 2009

Área Agrícola Cana-de-açúcar Citrus Pastagem Vegetação Arbórea

3 Tb of data behind this!

Page 22: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

How much processing should be in the cloud?

Standard API? WPS?

Page 23: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

23

Could this analysis be done in the cloud?

source: INPE

< Corn > sugarcane ->

Page 24: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Data chain in Earth System Sciencefonte: NASA

Page 25: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

source: USGS

Getting to the Data

Requires solving the spatial semantics problem

Tentative solutions catalogues, metadata, SDIs, ontologies, web services, semantic reference

systems, linked open-data, ....

Page 26: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Communicating location is easy

Deforestation hotspots in Amazonia

Page 27: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Weather

source: WMO

11,000 land stations (3000 automated)900 radiosondes, 3000 aircraft 6000 ships, 1300 buoys5 polar, 6 geostationary satellites

Communicating about data is feasible

Page 28: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Communicating concepts is hard

Image source: WMO

vulnerability? climate change? poverty?

Page 29: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

degradation

We’re bad at representing meaning

deforestation? degradation? disturbance?

Communicating concepts is hard

Page 30: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

When did the Aral Sea reach the tipping point?

Communicating change is very hard

Page 31: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil
Page 32: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Objects exist, events occur (mount Etna 2002 eruption)

Page 33: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

Observations allow us to get the measure of external reality

Page 34: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

WMO’s global observing system

Page 35: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

WMO GRIB: simple and cleanCode Parameter Units.052 Relative humidity % 053 Humidity mixing ratio kg/kg 054 Precipitable water kg/m2 055 Vapour pressure Pa 056 Saturation deficit Pa 057 Evaporation kg/m2 058 Cloud Ice kg/m2 059 Precipitation rate kg/m2/s 060 Thunderstorm probability % 061 Total precipitation kg/m2 076 Cloud water k g/m2 ..

Page 36: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

When did the large flood occur in Angra?

Page 37: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

When did the large flood occur in Angra? When precipitation was > 10mm/hour for 5 hours

Coverage set (hourly precipitation grid)

Cover change set (precipitation > 10

mm/hour)

Page 38: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

When did the large flood occur in Angra?

CoverageSet p1 (“Precipitation”).

CoverChangeSet s1 = extract (p1 > 10, time1, time2)

TimeSeries t1 = intersect (s1, geom (“Angra”)

Page 39: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

How many walruses reached Baffin island?

Page 40: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

How many walruses reached Baffin island? Those whose trajectories touched Baffin isld

moving objects

trajectories

Page 41: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

How many walruses reached Baffin island?

MovingObjectSet m1(“walruses”)

Trajectories t1= extract(m1,time1,time2)

Trajectories t2 = reach(t1, geom (“Baffin”))

Page 42: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

When was this area converted from food to biofuel production?

Coverage set (remote sensing

images)

Time Series (vegetation

index)

Page 43: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

When was this area converted from food to biofuel production? When the vegetation index peaked once a year.

Coverage set (remote sensing

images)

Time Series (vegetation

index)

Page 44: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

When was this area converted from food to biofuel production?

CoverageSet c1 (“Cerrado”).

TimeSeries ts1 = extract (c1, “VegIndex”)

for year = y1, yn do

time1 = year*52 + 1

time2 = time1 + 52

TimeSeries t2 = onepeak(ts1, time1, time2)

Time t1 = first (t2)

Page 45: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

A new kind of geospatial analysis engine?

Page 46: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

TerraLib: spatio-temporal database as a basis for innovation

Visualization (TerraView)

Spatio-temporalDatabase (TerraLib)

Modelling (TerraME)

Data Mining(GeoDMA)Statistics (aRT)

Page 47: I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil

We need a new generation of GI appliancesConnect data brokering, sources, analysis We need many clouds with remote processingDescribe observations, not eventsAllow users to process the data

Conclusions