Upload
august-andrews
View
219
Download
0
Tags:
Embed Size (px)
Citation preview
<Insert Picture Here>
Riding the Wave: a Perspective for Today and the FutureAPA Conference, November 2011
Monica MarinucciEMEA Director for Research, Oracle
Monica MarinucciAPA Conference, November 2011
« Our vision is a scientific e-infrastructure that supports seemless access, use, re-use, and trust of data. »
Source: Riding the Wave Report, pag 4
Today
Future
Best Practices
National Libraries
MultimediaDelivery Method
Enterprise Class
Cost-effectiveness
Manageability
Tools for Adoption
open-based
Scientific Workflows
Collaboration
Semantic Data Management
Knowledge Discovery
Multidisciplinary
Governance
Data Security
Identity
Provenance
Preservation
Monica Marinucci
Researcher A
Astronomy
Climatology
Chemistry
History
Biology
Demography
Researcher B
APA Conference, November 2011
Monica Marinucci
Oracle Research Data Management Solution
• Reduce Time-to-Discovery by enabling researchers:
• to work collaboratively on extremely large data sets
• to run analytics and complex queries
more frequentlyData
Generation
Data Analysis
Data Storing
Preservation & Archiving
Data Aggregation
• Enable Trust & Security at Data and Application level
• Store & Retrieve Data across multiple different sources/format over time
• Develop open-standard based applications
Research Data
Lifecycle
APA Conference, November 2011
Monica Marinucci
“About 10 percent of the data is corporate structured data, and 90 percent is unstructured scientific data
such as satellite images,”
John de la LandeHigh-Performance Computing, Storage and Facilities Manager
Australian Bureau of Meteorology
APA Conference, November 2011
• Availability• Integrity• Authenticity• Reusability and Collaboration
• Security• Sustainability
• Trustworthiness
Data Requirements
Go Beyond Capacity
Monica Marinucci
Paving the way
Storage: Tiered Storage
MQFS
Metadata Tier
FSCAN
Infrastructure:Cost-Effectiveness
Green Data Centre
Scalable Research Services
Repositories:Interoperability
Semantic Layer
Collaboration
Secure IP
Knowledge Repository
Include ContentComplete data archive Search and Access
Ingest of different data types
Transform to new file types with access to original file types
Meeting “forever” retention and access requirements
Multi-channel Access
30% of people’s time is spent searching 80% of information is unstructured42% of transactions are still paper-based79% of companies have 2+ repositories 25% have 15+ repositories
Objects must be always retrievable (Access) Cost and complexity as systems scale (Economic Sustainability) Finding data through sophisticated metadata handling (Discovery) Data integrity must be assured (Trust) Seamless scaling must be provided (Scale & Extensibility)Source: Carl Grant, Chief Librarian, Ex Libris
Monica Marinucci
• Focus since 2007
• Collegial Collaboration to Upgrade the Community
• Community of Practice - “Just Do It!” • OAIS Architectures, Research Data Management and Digital
Curation, Third Party Solutions • Tiered Storage Architecture Best Practices and Trends
• New Directions
• Broader Technology Focus: Private and Community Clouds, Content Management for Digital Archiving, Semantic Data Management
• Intra-Institution/Enterprise Trends: Personal Archives, Federation, Security, Management of Scholarly Materials
Preservation & Archiving Special Interest Group (PASIG)
APA Conference, November 2011