29
AQ Science Problems Complex Needs: Multiple datasets Multiple Tools DIVERSE Community of People Underlying data infrastructure is being developed to accommodate data flow, still need for higher level information transfer

AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Embed Size (px)

Citation preview

Page 1: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

AQ Science Problems Complex

• Needs: – Multiple datasets– Multiple Tools – DIVERSE Community of People

• Underlying data infrastructure is being developed to accommodate data flow, still need for higher level information transfer

Page 2: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Web 1.0 Web 2.0

• web changing, making it easier to connect• Architecture of Participation - data/information (links, pictures,

videos, blogs) continuously popping up • Because it is on the internet data can be harvested and reused

(user-centric view) in ways not anticipated by the provider• Because it is a network can collaborate with people you don’t

even know

Page 3: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Harvested Resources

2000 Blogs on Florida Smoke

Smoke

Online Picture Sharing

215 Photos

Online Video Sharing 100 videos

Page 4: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Additional Value-added

• Low level of collaboration is sharing/harvesting resources• Next step is to add additional context, create a product

(analysis, report, proposal…)• Web as a platform provides open space to form a workgroup• Groups may be permanent (formal research groups) or ad hoc,

project specific groups.

Page 5: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Tools that enable these groups

• Allow for collecting and sharing artifacts – common procedures, tags

• Sharing a virtual workspace – Web is read/write now any page is editable with an edit button

• Opens the process up to allow unanticipated group members at any point in time to join and contribute

Page 6: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

DataFedwiki

• Known group – our research group that uses it• Provides a place to stash ideas during discussions, share with

members not present and remember/re-use later• Archives presentations, with additional metadata like notes

made when preparing the presentation or other pieces that also were included (KML, screencast, links etc)

Page 7: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

GEOSS AIP Workspace

• Ad hoc group formed around common interest in having input into the Air Quality Scenario CFP

• Collectively wrote the CFP • Captured key dialogue and discussion pertaining to the

scenario development… Key in creating a reusable scenario is having archive of process

Page 8: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Data Summit/ Community AQ Data System

• Invited group formed pre-Summit in order to profile the existing data systems

• Meeting activities were captured and workgroup/space evoloved

• Now Use wiki to capture group decisions, discussions, documents

Page 9: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

CF Naming Conventions

• Proposed names, discussed, modified and then submitted names

• Because it was open, we have pointed others to these pages as a process for creating standard names.

Page 10: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Sustainable Air Quality Class Workspace

• The class harvested resources, gathered data, and wrote a class report using the wiki;

• Their workspaces were different from the others described, because the output on each space was a final report. The history of the page, shows the evolution that each group took as they gathered data, wrote, and modified throughout the semester.

Page 11: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Benefits of the workspace

• Collect distributed content and provide context. – Currently, there is flickr for photos, del.icio.us for links provide URL

for metadata about photo or link – workspace provides URL for a project.

• Workspaces can be easily set up and evolve for the group needs

• Allow new users to easily be caught up to speed in as little or as much detail

• Provides rich archive of discussion, resources and process the group took as well as the outcome

• Because it is web-based unanticipated others may be able to use some part of the work.

Page 12: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Mashup of Tools through Interoperability Standards RSS, RDF, KML and XSL

E M Robinson, M Kieffer, S Kovacs, S R Falke, R B HusarWashington University, St. Louis

Fall AGU, Dec 13, 2007, San Fancisco

Page 13: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Information Overload

• Science problems are becoming increasingly complex • Information comes from both inside and outside the

organization (sometimes from people we don’t know)• Web is the platform

Page 14: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

The Information Interoperability Stack

Page 15: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Web 1.0 -> Web 2.0 Transformation

• The Web is being transformed: It is becoming more participatory

• Its content is increasingly generated and distributed by individuals

• See the explosive growth of wikies, picture-sharing, blogs, Facebook

• This architectural, technological and cultural change is Web 2.0

• Web 2.O is good for Earth Science community since it allows

– Better harvesting of current knowledge– Collaborative creation new knowledge.

Page 16: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Distributed Web Content

Distributed Web Content

Distributed Web Content

Workspace

Discussion

Knowledege is created – reports, proposals…

Page 17: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Connectivity of Humans

• Sharing specific artifacts – Pictures– Links– Videos– Blogs

• Shared workspace

Page 18: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Sharing Artifacts - May 07 Georgia/Florida Fires:

Community-Supplied Observations

Google and Technorati blog seaches yielded entries on GA Smoke.

. Smoke images, were also found searching Flickr and Google

Searching and pruning user-contributed Internet content yielded rich, but qualitative description of the May 07 Georgia Smoke Event.

Videos of smoke were found on YouTube

Visually pruned blogs, videos and images were bookmarked and tagged fore later analysis

Page 19: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Connectivity of Machines

• Web services • Service Orchestration • Mash-ups

Page 20: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Community Data Sharing - ‘DataSpaces’

Catalog - Find Dataset

Describe Dataset

Discuss Dataset

Earth Science Information Partners (ESIP)

• Semantic Wiki: Structured (RDF and Unstructured Content

• Open, Standard Matadata - RDF

• Ready for Export/Harvesting by Registries, Catalogs

Page 21: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Communal Event Analysis Southern California Fire Smoke

Approach (Blogs, Flickr, YOuTube):

Use app to perform coarse filtering Controlled tagging in del.icio.usRSS feed from del.icio.us

Page 22: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Wiki as a workspace

• Examples: Wikis, google groups, myspace, facebook ... • Collaboratively write and discuss

– Versioning/archive – open

• User-defined layout/Structure – Relational - tag-based, not hierarchical folders

• Organize different kinds of content - wiki can harness and integrate other services but it's up to user to assemble the parts in an innovative way - Situational applications – Small pieces loosely joined – Mash through RSS feeds or URL – upload docs – creating "records" on wiki pages

Page 23: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Wiki as a workspace

• Organize/collect distributed material and provide context for a topic

• Collaboratively create a document (report, proposal, presentation etc…)

• Discuss issues relating to topic• Act as an archive for the topic and group interactions

surrounding it. • The wiki platform is conducive for this because it allows

material distributed all over the web to be virtually 'mashed' together in one place through RSS feeds or iframe tags and elaborated on with relevant context and discussion. Because all of these elements are brought together in a workspace, the critical community knowledge is often captured as it is created.

Page 24: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Class?

• Used as class website with Syllabus/material • Answered questions about class on the wiki• Students organized and carried out their class project on the

wiki. • Established a set of key resources for next year’s class (Re-

usable)

Page 25: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Summary

• The participatory web architecture and technologies are here

• Standards are emerging for sharing previously unstructured data

• The entry resistance to any particular Web 2.0 tool is rather low

• However, many cultural, legal and other barriers remain

• The challenge is to learn how to apply these tools for Earth Science

Thank You!

Page 26: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Static versus Dynamic Research Projects

• Email only• Powerpoint slides• Pdf reports• Sharing/publication only of final

results

• Email + shared web workspaces

• Screencasts, WebEx• Multi-author content creation• Sharing/publication of workflow

(data to information)

Static Dynamic

Page 27: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

Data Sources

• Observations & Models– Surface– Satellite– Forecasts

• News• Blogs (Technorati)• Photos (Flickr) • Links (Del.icio.us)• Videos (YouTube)• What you collect and generate

Page 28: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

The Information Interoperability Stack

Page 29: AQ Science Problems Complex Needs: –Multiple datasets –Multiple Tools –DIVERSE Community of People Underlying data infrastructure is being developed to

OGC WCS Data Access Protocol

GEOSS Provides SOA for Coupling for Autonomous Nodes Facilitates Publishing, Finding and Accessing Data

Emerging Air Quality Data Flow Network