13
Slide 1 Session 2: Datastage overview

02 Datastage Overview

Embed Size (px)

Citation preview

Page 1: 02 Datastage Overview

Slide 1

Session 2:Datastage overview

Page 2: 02 Datastage Overview

<Enter Project Name> Slide 2Slide 2

Enterprise Data IntegrationDescribe DataStageHistory of DatastageIdentify the server and client components of

DataStageDescribe DataStage projectsDescribe DataStage jobsIdentify the steps for designing a DataStage job

Objectives

Page 3: 02 Datastage Overview

<Enter Project Name> Slide 3Slide 3

Enterprise Data-Integration

Page 4: 02 Datastage Overview

<Enter Project Name> Slide 4Slide 4

With DataStage you can:• Design jobs that extract, integrate, aggregate,

transform data and load into a target• Create, manage, and reuse metadata• Validate, Run, monitor, and schedule jobs• Manage your development environment

DataStage

Page 5: 02 Datastage Overview

<Enter Project Name> Slide 5Slide 5

History Of DataStage

DataStage was started in 1997 by company called V-Mark.

Later was taken over by Ardent , which in turn was taken over by Informix.

Informix was acquired by IBM and Ascential was spun as a different company.

Ascential acquired Torrent Systems for $46 million, a developer of parallel-processing infrastructure software for building highly scalable data warehouses.

Current release is DataStage 7.5 from Ascential Software which includes the parallel processing capabilities in addition to its erstwhile server processing.

Ascential Software is now acquired by IBM.

Page 6: 02 Datastage Overview

<Enter Project Name> Slide 6Slide 6

DataStage Application Components

M i c r o s o f t ® W i n d o w s N T o r U N I X

S e r v e r R e p o s i t o r y

D e s i g n e r D i r e c t o rR e p o s i t o r yM a n a g e r

O r a c l eS y b a s eI n f o r m i xU n i V e r s eA p p l i c a t i o n s

O r a c l eS Q L S e r v e rR e d B r i c kS y b a s eI n f o r m i xU n i V e r s e

S o u r c eD a t a

T a r g e tD a t a

M i c r o s o f t ® W i n d o w s 9 5

A d m i n i s t r a t o r

E x t r a c t C l e a n s e T r a n s f o r m I n t e g r a t eE x t r a c t C l e a n s e T r a n s f o r m I n t e g r a t e

Page 7: 02 Datastage Overview

<Enter Project Name> Slide 7Slide 7

Most DataStage configuration tasks are carried out using the DataStage Administrator, a client program provided with DataStage.

Changing License Details.DataStage Project Administration :

• Add new DataStage projects• Delete projects• Move projects

Add Environment variables

DataStage Administrator

Page 8: 02 Datastage Overview

<Enter Project Name> Slide 8Slide 8

DataStage Administrator

Setting up DataStage users

Cleaning up project files

Purging job log files

Setting the timeout interval on server computer

Tracing server activity

Adding entries to the tools menu

Setting job parameter defaultsIssuing Datastage engine commands from the administrator client

Page 9: 02 Datastage Overview

<Enter Project Name> Slide 9Slide 9

DataStage Director

Validate jobsRun jobsMonitor jobsSchedule jobsGather statistics

Page 10: 02 Datastage Overview

<Enter Project Name> Slide 10Slide 10

DataStage Designer

Page 11: 02 Datastage Overview

<Enter Project Name> Slide 11Slide 11

` DataStage Manager

Store metadataReuse metadataDefine routines

Page 12: 02 Datastage Overview

<Enter Project Name> Slide 12Slide 12

Define project properties: AdministratorOpen projectDesign jobs: Designer

• Import metadata: Manager• Define extractions, data flows, integrations• Define transformations, constraints,

aggregations• Define loads

Compile and debug jobs: DesignerRun and monitor jobs: Director

Development in DataStage

Page 13: 02 Datastage Overview

<Enter Project Name> Slide 13Slide 13

DataStage Projects

Created during installation

Associated with a directory

Attach the users to the projects and assign roles

Self-contained

Multiple users can be working at the same time