1
Harmonization and Integration of Semi-Structured Data through Wikis and Controlled Tagging. E.M. Robinson, R.B. Husar Washington University, St. Louis, Missouri D ata System Profiles D ata System Sem antic Tags M ultiple W ikiViews Tag m etadata Background C om m unity D ata Sharing - ‘DataSpaces’ C atalog – Find D ataset Dataset Discuss R euse M eta M ultiple View s D ataSpaces RegisterM etadata in Dataspaces C om m unal EventAnalysis Southern C alifornia Fire Sm oke G iven the high density and shortresponse of user-generated contentaboutairpollution events itis said thatthe Earth,has now acquired a "skin"forthe detection of changes in the environm ent. Acknowledgements: Stefan Falke, Ed Fialkowski, Mark Kieffer, Sandor Kovacs, Earth Science Information Partners wiki community Wiki W ikis originally used justfor collaborative w riting Features: Editable by w eb users – Tags Discussion pages –Versioning N ow they are dynam ic w orkspaces, able to em bed w eb objects from disparate sources Add additional context,facilitate collaborative analysis A llow tw o-w ay transferof know ledge D iscuss Edit View C ollaborate C ontrolled Tag-based M ediation U sers can be m ediators ofw eb-based contentby adding a unique controlled tag (orsetoftags)in tw o w ays: Use Del.icio.us to hom ogenize the heterogeneous objects C reate w ikipage as the w eb object.Add sem antic tags. C reate w ikipage w hich harvests queries and adds contextto create em ergent,reusable know ledge C ontrolled Tag-based C onnectivity Abstract: The architectural, technological and cultural transformation of the Internet, commonly referred to as Web 2.0, is good news for the Earth Science community since it offers new possibilities for sharing and harvesting community-provided content like photos, links and blogs as well as collaboratively creating new things. One key feature of all of these new software is the end-user's ability to add tags, adding value by extending the metadata of the particular object. The semantic uniformity of the internet resources can be improved by controlled tagging which apply a consistent namespace and tag combinations to diverse objects. We have used the above tagging approach in order to gather autonomous internet resources pertaining to air quality events and data. Because of the common set of controlled tags these web objects (i.e. links and photos) were harvested in a wiki environment where additional context could be added. Used the control tag “071022SoCalSmoke” Combined RSS Feeds into an event wiki page Distributed metadata described with semantic tags Metadata queries allowed multiple wiki views DataSpaces uses semantic and controlled tags Semantic tags allows reuse

2008-04-15 EGU Mtg Vienna Tagging

Embed Size (px)

Citation preview

Page 1: 2008-04-15 EGU Mtg Vienna Tagging

Harmonization and Integration of Semi-Structured Data through Wikis and Controlled Tagging.E.M. Robinson, R.B. HusarWashington University, St. Louis, Missouri

Data System Profiles

Data System

Semantic Tags

Multiple Wiki Views

Tag metadata

Background

Community Data Sharing -‘DataSpaces’

Catalog –Find Dataset

Dataset

Discuss

Reuse Meta

Multiple Views

DataSpaces

Register Metadata in Dataspaces

Communal Event Analysis Southern California Fire Smoke

Given the high density and short response of user-generated content about air pollution events it is said that the Earth, has now acquired a "skin" for the detection of changes in the environment.

Acknowledgements: Stefan Falke, Ed Fialkowski, Mark Kieffer, Sandor Kovacs, Earth Science Information Partners wiki community

Wiki• Wikis originally used just for

collaborative writing • Features:

– Editable by web users– Tags– Discussion pages– Versioning

• Now they are dynamic workspaces, able to embed web objects from disparate sources

• Add additional context, facilitate collaborative analysis

• Allow two-way transfer of knowledge

Discuss

Edit

View

Collaborate

Controlled Tag-based Mediation• Users can be mediators of web-based content by adding

a unique controlled tag (or set of tags) in two ways:– Use Del.icio.us to homogenize the heterogeneous objects– Create wiki page as the web object. Add semantic tags.

• Create wiki page which harvests queries and adds context to create emergent, reusable knowledge

Controlled Tag-based Connectivity

Abstract: The architectural, technological and cultural transformation of the Internet, commonly referred to as Web 2.0, is good news for the Earth Science community since it offers new possibilities for sharing and harvesting community-provided content like photos, links and blogs as well as collaboratively creating new things.

One key feature of all of these new software is the end-user's ability to add tags, adding value by extending the metadata of the particular object. The semantic uniformity of the internet resources can be improved by controlled tagging which apply a consistent namespace and tag combinations to diverse objects.

We have used the above tagging approach in order to gather autonomous internet resources pertaining to air quality events and data. Because of the common set of controlled tags these web objects (i.e. links and photos) were harvested in a wiki environment where additional context could be added.

Used the control tag “071022SoCalSmoke”

Combined RSS Feeds into an event wiki page

Distributed metadata described with semantic tags

Metadata queries allowed multiple wiki views

DataSpaces uses semantic and controlled tags

Semantic tags allows reuse