27
GlobusWORLD 2012 Innovating Research Data Management Ian Foster – April 2012

GlobusWorld 2012 Foster Keynote

Embed Size (px)

DESCRIPTION

My introductory talk at GlobusWorld 2012, in which I review the progress made with Globus Online during the last year, and introduce our new Globus Storage, Globus Collaborate, and Globus Integrate services. A great demo showed it all working.

Citation preview

Page 1: GlobusWorld 2012 Foster Keynote

GlobusWORLD 2012Innovating Research Data Management

Ian Foster – April 2012

Page 2: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org

The data deluge

Page 3: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org3

Run a research project from a coffee shop?

Page 4: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org

• Dark Energy Survey• Galaxy genomics• LIGO observatory

Towards “research IT as a service”

• SBGrid structural biology consortium

• NCAR climate data applications• Land use change; economics

Page 5: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org

Accelerate discovery and innovation worldwide by providing research IT as a service

Leverage the cloud to• provide millions of researchers with unprecedented

access to powerful tools; • enable a massive shortening of cycle times in

time-consuming research processes; and• reduce research IT costs dramatically via

economies of scale

Let’s rethink how we provide research IT

Page 6: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org6

2011: Grid meets Cloud

Build the Grid

Components for building custom grid solutions

globustoolkit.org

Globus Toolkit

Use the Grid

Reliable file transfer Software-as-a-Service

globusonline.org

Globus Online

Page 7: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org7

• >4,500 registered users; adding ~300/month• >3.5 PB moved; averaging ~50 TB/week

Globus Online – The first 18 months

0

50

100

150

200Data Transferred (TB/week)

Jan 2011 Feb 20120

50

100

150

200

250

Support Requests Received

Dec 2010 Jan 2011

Page 8: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org8

• 514 active endpoints (year-to-date) including

62 at Tier 1 research institutions

• >390,000,000 files moved

• >100,000 transfer requests

• 99.9% uptime in 2012

• 18 identity providers supported, using multiple

protocols: X.509, MyProxy OAuth, OpenID

Globus Online – The first 18 months

Page 9: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org9

Our user communities

Page 10: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org10

Page 11: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org11

Globus IntegrateGlobus Connect Multi User

Globus Connect

Globus Nexus

Globus Transfer

Globus Storage

Globus Collaborate

Globus Toolkit

2012: Move. Store. Collaborate.

globus online

Page 12: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org

Towards “research IT as a service”

Research Data Management-as-a-Service

Globus Integrate

Globus Transfer

Globus Storage

Globus Collaborate

Globus Catalog

…SaaS

…PaaS

Page 13: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org

Commercial storage service

provider

National research center

Campus computing

center

• Place your data where you want

• Access it from anywhere via different protocols

• Update it, version it,and take snapshots

• Share versions with who you want

• Synchronize among locations

Globus Storage: For when you want to …

Globus Storage volume

Globus Transfer, HTTP/REST, Desktop sync

Page 14: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org

Join with a few or many people to:• Share documents• Track tasks• Send email• Share data • Do whatever

With:• Common groups• Delegated

management

Globus Collaborate: For when you want to

Page 15: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org

Write programs that access/manage user identities, profiles, groups, resources—and data …

… via REST APIs and command line programs

Globus Integrate

Globus Integrate: For when you want to…

Globus Transfer

Globus Storage

Globus Collaborate

Globus Connect Multi User

Globus Connect

Globus NexusGlobus Toolkit

Page 16: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org16

Earth System Grid – Portal integration

Outsource data transfer to GlobusData download to user machine from searchData transfer to another server by userReplication of data between sites by administrator

No ESGF client software needed

Page 17: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org

Page 18: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org18

Let’s take a look at what’s coming…

Diffusion Tensor Imaging (DTI)

Traumatic Brain Injury (TBI)

Page 19: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org19

UChicagoObject

Store

UChicagoObject

StoreCornell

Red Cloud

SDSCCloudKyle

Bryce PADSComputeCluster

“TBI”volume

Globus Storage Create volume and

share with TBI group

Globus Transfer Copy TBI data to compute cluster

Globus Transfer Move DTI results to shared volume

Globus NexusAdd Bryce to TBI

collaboration

Globus CollaboratePublish DTI data to TBI

web site

Let’s take a look at what’s coming…

Amazon S3

DTI Group- Kyle

Globus ConnectMove MRI files to TBI shared volume

Globus Connect Move DTI results to

Bryce’s laptop

Globus StorageCreate snapshot to share with group

DTI Group- Kyle- Bryce

Page 20: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org20

GT Releases 2011-2012

5/4/11 GT 5.1.0

5/18/11 GT 5.0.4

8/2/11 GT 5.1.1

11/16/11 GT 5.1.3 (5.2 beta)

12/15/11 GT 5.2.0 3/8/12 GT 5.0.5

Likely last 5.0.x release

4/5/12 GT 5.2.1rc15.2.1 final planned

for April

Globus Toolkit update

Highlights• Native packaging• GridFTP: DCSC support• New GRAM5 version;

in use on OSG

Page 21: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org21

• GT 5.2.0– Support for DCSC command

• Allows client to specify credentials used to secure data channel connection

• Utilized by Globus Transfer for seamless data movement across multiple security domains

– Server administrators may now restrict client access to a set of paths

• Coming in 5.2.1– MLSC – stream directory listings over control channel

• Enables simple (control channel-only) implementation of Globus Storage ftp server

• Can help directory listings for gridftp servers behind restrictive firewalls

Globus Toolkit update – GridFTP

Page 22: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org

Delivery plans

Globus Integrate• Transfer API available• APIs for Storage, Collaborate

planned after app release

Globus Connect Multi User (GA)

Globus Connect (GA)

Globus Transfer• Generally available• Service and Web UI

enhancements continue

Globus Storage• Early release• Generally available in

Fall 2012

Globus Collaborate• Initial projects at

UChicago• Early release sometime

before year-end 2012

Globus Nexus (Alpha)Globus Toolkit (GA V5.2)

Page 23: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org23

• Globus Storage– Current: Early access – contact us if interested– Open beta planned in late Spring; GA in late Summer– V1.0: UChicago and Amazon S3 object stores

• Globus Collaborate– Current: Pre-release with two UChicago/CI groups– First release planned around year-end

Delivery plans

Page 24: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org24

Motivated by sustainability: resource cost recovery, user support, future development

• Globus Transfer plans (resource providers)– www.globusonline.org/premium-plans/

• Globus Storage premium plans– Plans variables include: size, object store type,

durability, support level, personalization

• Globus Collaborate premium plans– Plans variables include: branding capabilities, support

level, integration (e.g. identity)

Premium offerings

Page 25: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org25

Motivated by sustainability: resource cost recovery, user support, future development

• Globus Transfer plans (resource providers)– www.globusonline.org/premium-plans/

• Globus Storage premium plans– Plans variables include: size, object store type,

durability, support level, personalization

• Globus Collaborate premium plans– Plans variables include: branding capabilities, support

level, integration (e.g. identity)

Premium offerings

Page 26: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org26

Bryce Allen Rachana Ananthakrishnan

Lisa Childers

Martin Feller

Tom Howe Raj Kettimuthu

Jack Kordas Lukasz Lacinski

Lee Liming JP Navarro Karl Pickett

Andrew Zich

Peter Zich

The Globus yeam

Page 27: GlobusWorld 2012 Foster Keynote

www.globustoolkit.org www.globusonline.org27

Thank you!