13
Building ETL Systems with SAS Data Integration Studio November 14, 2007

Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

  • Upload
    vubao

  • View
    249

  • Download
    7

Embed Size (px)

Citation preview

Page 1: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

Building ETL Systemswith

SAS Data Integration Studio

November 14, 2007

Page 2: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

2

Three Building Blocks of DIS

Define target data sets to store your transformed data using Target Designer

Write custom transformations using Transformation Generator

Build job modules using Process Designer

Page 3: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

3

DIS – Target Designer

Target table creationCopy existing data set structureDesign new data set structure

Cube designer wizard

Page 4: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

4

DIS – Transformation Generator

Base SAS code

Macros

Define input & output parameters

Page 5: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

5

DIS – Transformation Generator

Page 6: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

6

DIS – Process Designer

Visual representation of processes

Resemble design diagrams

Execution capability

Deploy job for scheduling

Page 7: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

7

DIS – Process Designer

Page 8: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

8

Important Useful Features

Check in / Check out

Use of other SAS tools to write Base SAS code

Source Designer

Update metadata

Page 9: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

9

Challenges

Diagram size in Process Designer

Runtime space used by pre-defined transformations

Partial Check in capability

SPDS ClustersWhen cluster members are generated from different processes

Page 10: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

10

Environment Migration

ExportFrom FoundationBy object type

ImportFrom FoundationCleanup first

Page 11: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

11

Execution & Scheduling

Job execution from DIS

Deployment for scheduling

SAS – Management Console – Schedule Manager

Page 12: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

12

Conclusion

The SAS Data Integration Studio tool has the capabilities to construct full ETL systems.Three steps to build ETL systems using DIS:

Build target data setsWrite transformationsDevelop processes

Important extra features:Check In / Check OutMigration tools (Export/Import)Job execution and deployment

Some limitations:Size of processesUsing predefined transformationsPartial Check In

Page 13: Building ETL Systems with SAS Data Integration · PDF fileBuilding ETL Systems with SAS Data Integration Studio ... Check in / Check out ... The SAS Data Integration Studio tool has

13

Questions & Answers