1
Challenge to Data-intensive science: cooperation of metadata database for upper atmospheric research and author ID KOYAMA, Y. 1 , K. KURAKAWA 2 , Y. SATO 3 , Y. TANAKA 3 , S. ABE 4 , T. HORI 5 , N. UMEMURA 5 , A. SHINBORI 6 , S. UeNo 6 , M. YAGI 7 , and A. YATAGAI 6 1 WDC for Geomag., Kyoto & Kyoto univ., 2 NII, 3 NIPR, 4 Kyushu univ., 5 Nagoya univ., 6 Kyoto univ., 7 Tohoku univ. Open Repositories 2013 July, 8, 2013 – July, 12, 2013 (PEI, Canada) • We defined Upper Atmospheric domain specific XML Schema which connects DOI & ORCID. ( https://gist.github.com/iugonet/e32526c240519e03c112 ) • In reuse of dataset & data, not only cooperation of DOI & ORCID, but also domain specific dainty- grained metadata is needed. A case study of Data-intensive Science in Upper Atmospheric Research Conclusion DataCite Metadata Properties ID Property 1 Identifier (with type attribute) 2 Creator (with name identifier attributes) 3 Title (with optional type attribute) 4 Publisher 5 Publication Year ID Property 6 Subject (with schema attribute) 7 Contributor (with type and name identifier attributes) 8 Date (with type attribute) 9 Language 10 ResourceType (with description attribute) 11 AlternateIdentifier (with type attribute) 12 RelatedIdentifier (with type and relation type attributes) 13 Size 14 Format 15 Version 16 Rights 17 Description (with type attribute) ID Property 0.1 LastMetadataUpdate 0.2 MetadataVersionNumber Table 1: DataCite Mandatory Properties Table 2: DataCite Optional Properties Table 3: DataCite Administrative Properties Characteristics of the upper atmospheric research 1.There are various phenomena over a wide area from the earth surface to the space 2.Many physical parameters need to be observed 3.Various variations including solar cycle, tidal cycle, gravity waves, etc. are overlapped •Sufficient for Data Publication & Citation purpose . •Insufficient for Scientific reuse purpose . •No detailed metadata (e.g., Geolocational properties), •No Role info. of the Contributor. •Coarse-grained metadata. (The min. unit is dataset ) Necessity for domain specific metadata for reuse purpose. Data-intensive Science DOI ORCID with Role (e.g., PI, Archive Spefialist) Metadata for Upper atmospheric research, DOI, & ORCID DataCite Japan Link Center (JaLC) •Sufficient for Scientific reuse purpose . •Detailed metadata (e.g., Geolocation Info., Calibration Info.), •Human Resource Info. with Role (e.g., PI, Archive Specialist) •Dainty-grained metadata. (The min. unit is data file ) This monain specific metadata helps reuse dataset & data. Scientist derive from their info. to “Derived and Recombined Data”. Tony Hey, Stewart Tansley, & Kristin Tolle (Eds.). (2009). The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research. Retrieved from http://research.microsoft.com/en-us/collab oration/fourthparadigm/default.aspx Science is changing because of the impact of information technology. Experimental, theoretical & computational science are all being affected by the data deluge, and a fourth, “data-intensive” science paradigm is emerging. Necessity of a Metadata Database “All Scientific Data Online” future increase Scientific information Velocity. Construction steps of the left pyramidal structure: 1.To construct Data Publication & Citation, 2.To construct reuse data framework, 3. - More. Some steps are needed. We considered the cooperation of metadata & identifiers which required for steps 1 & 2. 1 2 IUGONET provides a new research platform that enables Metadata for ground-based observation data to be shared. In addition, IUGONET developed analysis software to access and analyze data in an integrated fashion. IUGONET provides a new research platform that enables Metadata for ground-based observation data to be shared. In addition, IUGONET developed analysis software to access and analyze data in an integrated fashion. What is Metadata in this field? Metadata are defined as that describe certain information of our data (location, period, Instruments used, data location(URL)).

Or2013 poster

Embed Size (px)

Citation preview

Page 1: Or2013 poster

Challenge to Data-intensive science: cooperation of metadata database

for upper atmospheric research and author IDKOYAMA, Y.1, K. KURAKAWA2, Y. SATO3, Y. TANAKA3, S. ABE4, T. HORI5, N. UMEMURA5, A. SHINBORI6, S. UeNo6, M. YAGI7, and A. YATAGAI6

1WDC for Geomag., Kyoto & Kyoto univ., 2NII, 3NIPR, 4Kyushu univ., 5Nagoya univ., 6Kyoto univ., 7Tohoku univ.

Open Repositories 2013 July, 8, 2013 – July, 12, 2013 (PEI, Canada)

• We defined Upper Atmospheric domain specific XML Schema which connects DOI & ORCID. ( https://gist.github.com/iugonet/e32526c240519e03c112)

• In reuse of dataset & data, not only cooperation of DOI & ORCID, but also domain specific dainty-grained metadata is needed.

A case study of Data-intensive Science in Upper Atmospheric Research

Conclusion

DataCite Metadata Properties

ID Property

1 Identifier (with type attribute)

2 Creator (with name identifier attributes)

3 Title (with optional type attribute)

4 Publisher

5 Publication Year

ID Property

6 Subject (with schema attribute)

7 Contributor (with type and name identifier attributes)

8 Date (with type attribute)

9 Language

10 ResourceType (with description attribute)

11 AlternateIdentifier (with type attribute)

12 RelatedIdentifier (with type and relation type attributes)

13 Size

14 Format

15 Version

16 Rights

17 Description (with type attribute)

ID Property

0.1 LastMetadataUpdate

0.2 MetadataVersionNumber

Table 1: DataCite Mandatory Properties Table 2: DataCite Optional Properties

Table 3: DataCite Administrative Properties

Characteristics of the upper atmospheric research1.There are various phenomena over a wide area from the earth surface to the space2.Many physical parameters need to be observed3.Various variations including solar cycle, tidal cycle, gravity waves, etc. are overlapped

•Sufficient for Data Publication & Citation purpose.

•Insufficient for Scientific reuse purpose.

•No detailed metadata (e.g., Geolocational properties),

•No Role info. of the Contributor.

•Coarse-grained metadata. (The min. unit is dataset)↓

Necessity for domain specific metadata for reuse purpose.

Data-intensive Science

DOI

ORCID

with Role

(e.g., PI, Archive Spefialist)

Metadata for Upper atmospheric research, DOI, & ORCID

DataCiteJapan Link Center (JaLC)

•Sufficient for Scientific reuse purpose.

•Detailed metadata (e.g., Geolocation Info., Calibration Info.),

•Human Resource Info. with Role (e.g., PI, Archive Specialist)

•Dainty-grained metadata. (The min. unit is data file)↓

This monain specific metadata helps reuse dataset & data.Scientist derive from their info. to “Derived and Recombined Data”.

Tony Hey, Stewart Tansley, & Kristin Tolle (Eds.). (2009). The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research. Retrieved from http://research.microsoft.com/en-us/collaboration/fourthparadigm/default.aspx

Science is changing because of the impact of information technology. Experimental, theoretical & computational science are all being affected by the data deluge, and a fourth, “data-intensive” science paradigm is emerging.

Necessity of a Metadata Database

“All Scientific Data Online” futureincrease Scientific information Velocity.

Construction steps of the left pyramidal structure:1.To construct Data Publication & Citation,2.To construct reuse data framework,3. - More. Some steps are needed.

We considered the cooperation of metadata & identifiers which required for steps 1 & 2.

1

2

IUGONET provides a new research platform that enables Metadata for ground-based observation data to be shared.In addition, IUGONET developed analysis software to access and analyze data in an integrated fashion.

IUGONET provides a new research platform that enables Metadata for ground-based observation data to be shared.In addition, IUGONET developed analysis software to access and analyze data in an integrated fashion.

What is Metadata in this field?Metadata are defined as that describe certain information of our data (location, period, Instruments used, data location(URL)).