41
Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Embed Size (px)

Citation preview

Page 1: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Overview of TeraGrid Resources and Usage

Selim KalayciFlorida International University

07/14/2009

Note: Slides are compiled from various TeraGrid Documentations

Page 2: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

What is the TeraGrid?

Page 3: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

TGUP (TeraGrid User Portal)

Page 4: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Accessing TeraGrid User Portal

Page 5: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Accessing TeraGrid User Portal

Page 6: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Portal Overview

Page 7: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Portal: MyTeraGrid

Page 8: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Portal: Resources

Page 9: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Portal: Documentation

Page 10: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Accessing Resources

Page 11: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Web-based SSO via Portal

Page 12: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Web-based SSO via Portal

Page 13: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

SSO (Non-Portal) from a TeraGrid Resource

Page 14: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Example

- SSH to tg-login.ncsa.teragrid.org or another resource that you have SSH access.

- grid-proxy-info

- myproxy-logon -l username

- grid-proxy-info

- gsissh tg-login.purdue.teragrid.org

Page 15: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

TeraGrid Resources

• http://www.teragrid.org/userinfo/hardware/– Sorted by site– Sorted by machine type

• http://portal.teragrid.org– My TeraGrid -> Accounts– Resources

• http://www.ncsa.uiuc.edu/UserInfo/Resources/– URLs here contain detailed user documentation

Page 16: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Moving data to/from TeraGrid systems

• Sftp clients from your office to TeraGrid– Command line sftp on Linux– GUI sftp clients

• GSI-SSHTerm sftp button• http://portal.teragrid.org

– Resources -> File Manager [beta]

• High speed gridftp between TeraGrid systems– globus-url-copy– Uberftp

Page 17: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

File Transfers: Small ( <100 MB) Files

Page 18: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Large ( >100 MB) File Transfers: globus-url-copy

• The globus-url-copy client program is a GridFTP client for transferring files from the command line.

• Usage:– globus-url-copy <source_url> <destination_url>

where <source> or <destination> is of the format:if local file, file:<full path> if remote file, gsiftp://<hostname>/<full

path>

Page 19: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Example – Two Party Transfer

• Logon to NCSA Abe Cluster.• Create a large file on NCSA Abe Cluster:

– dd bs=100MB count=1 if=/dev/zero of=testfile

• Copy this file to Purdue Steele Cluster:– globus-url-copy –vb file:///u/ac/username/testfile

gsiftp://tg-steele.purdue.teragrid.org:2811/autohome/u108/username/

Page 20: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Example – Third Party Transfer

• Logon to NCSA Abe Cluster• Copy the testfile at Purdue Steele Cluster to NCAR

Frost Cluster– globus-url-copy -vb gsiftp://tg-

steele.purdue.teragrid.org:2811/autohome/u108/username/testfile gsiftp://gridftp.frost.ncar.teragrid.org:2811//home/username/

• GridFTP server addresses for each site are listed at:– http://www.teragrid.org/userinfo/data/transfer_locati

on.php#deployment

Page 21: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Optimized Data Transfer with globus-url-copy

• Using large TCP windows– globus-url-copy –vb -tcp-bs 1048576 file:///u/ac/username/testfile

gsiftp://tg-steele.purdue.teragrid.org:2811/autohome/u108/username/

• Using large memory buffers– globus-url-copy –vb -bs 1048576 file:///u/ac/username/testfile

gsiftp://tg-steele.purdue.teragrid.org:2811/autohome/u108/username/

• Using multiple parallel streams– globus-url-copy –vb –p 4 file:///u/ac/username/testfile

gsiftp://tg-steele.purdue.teragrid.org:2811/autohome/u108/username/

Page 22: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Large ( >100 MB) File Transfers: UberFTP

• UberFTP is an interactive GridFTP file transfer client.• Opens a session with a remote host, within which files

may be transfered and directories and files may be manipulated.

• Requires GSI authentication.• Hands-On:

– Login to NCSA Abe– uberftp– open tg-steele.rcac.purdue.edu– parallel 2– tcpbuf 8388608– ls/lls/put/get/…

Page 23: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Imaginations unbound

Data movement tips• To move a collection of small files, make an archive

and move it instead of moving the files individually– tar– zip

• For high bandwidth links and moderate file or archive size, do not compress, it's usually faster to just move the data [compression is a time waster]

• For low bandwidth links, compression is usually a time saver– tar z or j options for compression– zip– ssh -C, sftp -C

Page 24: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Permanent Storage at NCSA

• The larger TeraGrid sites provide persistent high-capacity storage

• Details vary by site, consult local site documentation for specifics

• Refer to:– http://www.teragrid.org/userinfo/data/storage.phpfor detailed information about different quotas,

policies, and tools (such as SRB, HPSS) at each site.

Page 25: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Managing Your Environment: Softenv

Page 26: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Softenv

Page 27: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Managing Your Environment: Modules

* Try at tg-steele.rcac.purdue.edu

Page 28: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Softenv and Modules: Which do I use?

Rule of thumb: go with the default on a given machine

– When you login for the first time issue ‘softenv’ and ‘module list’ commands

– In general, only one should be active by default: go with that one

– If you have questions or run into any problems contact [email protected]

Page 29: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

29

Grid Job Management using Globus

• Common WS interface to schedulers– Unix, Condor, LSF, PBS, SGE, …

• More generally: interface for process execution management– Lay down execution environment – Stage data– Monitor & manage lifecycle– Kill it, clean up

Page 30: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

30

Grid Job Management Goals

Provide a service to securely:• Create an environment for a job• Stage files to/from environment• Cause execution of job process(es)

– Via various local resource managers• Monitor execution• Signal important state changes to client• Enable client access to output files

– Streaming access during execution

Page 31: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

31

GRAM• GRAM: Globus Resource Allocation and

Management• GRAM is a Globus Toolkit component

– For Grid job management• GRAM is a unifying remote interface to Resource

Managers– Yet preserves local site security/control

• Remote credential management• File staging via RFT and GridFTP

Page 32: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

32

A Simple Example• First, login to queenbee.loni-lsu.teragrid.org• Command example:% globusrun-ws -submit -c /bin/date

Submitting job...Done.Job ID: uuid:002a6ab8-6036-11d9-bae6-0002a5ad41e5Termination time: 01/07/2005 22:55 GMTCurrent job state: ActiveCurrent job state: CleanUpCurrent job state: DoneDestroying job...Done.

• A successful submission will create a new ManagedJob resource with its own unique EPR for messaging

• Use –o option to create the EPR file% globusrun-ws -submit –o job.epr -c /bin/date

Page 33: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

33

A Simple Example(2)• To see the output, use –s (stream) option% globusrun-ws -submit –s -c /bin/date

Termination time: 06/14/2007 18:07 GMTCurrent job state: ActiveCurrent job state: CleanUp-HoldWed Jun 13 14:07:54 EDT 2007Current job state: CleanUpCurrent job state: DoneDestroying job...Done.Cleaning up any delegated credentials...Done.

• If you want to send the output to a file, use –so option% globusrun-ws -submit –s –so job.out -c /bin/date

…% cat job.out

Wed Jun 13 14:07:54 EDT 2007

Page 34: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

34

A Simple Example(3)

• Submitting your job to different schedulers– Fork% globusrun-ws -submit -Ft Fork -s -c /bin/date

(Actually, the default is Fork. So, you can skip it in this case.)

– SGE% globusrun-ws -submit -Ft PBS-s -c /bin/date

• Submitting to a remote site% globusrun-ws -submit -F tg-login.frost.ncar.teragrid.org -c /bin/date

Page 35: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

35

Batch Job Submissions% globusrun-ws -submit -batch -o job_epr -c /bin/sleep

50Submitting job...Done.Job ID: uuid:f9544174-60c5-11d9-97e3-0002a5ad41e5Termination time: 01/08/2005 16:05 GMT

% globusrun-ws -status -j job_eprCurrent job state: Active

% globusrun-ws -status -j job_eprCurrent job state: Done

% globusrun-ws -kill -j job_eprRequesting original job description...Done.Destroying job...Done.

Page 36: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

36

Resource Specification Language (RSL)

• RSL is the language used by the clients to submit a job.

• All job submission parameters are described in RSL, including the executable file and arguments.

• You can specify the type and capabilities of resources to execute your job.

• You can also coordinate Stage-in and Stage-out operations through RSL.

Page 37: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

37

Submitting a job through RSL

• Command:% globusrun-ws -submit -f touch.xml

• Contents of touch.xml file:<job> <executable>/bin/touch</executable> <argument>touched_it</argument></job>

Page 38: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Security - Basics

Page 39: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

How to get Help

• First, try searching the Knowledge Base or other Documentation

• If that doesn’t help, submit a ticket– Send an email to [email protected]– Use the TeraGrid User Portal ‘Consulting’

tab• Can also call TeraGrid Help Desk 24/7:

1-866-907-2383

Page 40: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

Submitting a Ticket

Page 41: Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

More Info

• TeraGrid Resource User Guides– http:www.teragrid.org/userinfo/hardware/

resources.php• File Transfers and Data Management on TeraGrid

– http:www.teragrid.org/userinfo/data• More Training

– https://portal.teragrid.org/gridsphere/gridsphere?cid=onlinetraining