14
What is Data Validation Option What can be tested

Data validation option

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Data validation option

• What is Data Validation Option

What can be tested

Page 2: Data validation option

Data Validation Option is a ETL Testing tool which comes along with power center and enables to test and validate data

Data Validation Option reads table definitions from PowerCenter metadata repositories, and checks the data for inconsistencies

What is Data Validation Option

What can be tested Verify that moved or transformed data is complete and accurate and has not been changed because of errors in the movement or transformation process

Will not check the correctness of transformations or mappings designed in Informatica

Page 3: Data validation option

A typical workflow for data validation consists of multiple tasks-

Data Validation Workflow

Data Validation Option reads one or more PowerCenter metadata repositories.

We define the validation rules(creation of tests) in Data Validation Option

We run the rules(tests) to ensure the data conforms to the validation rules. When we do this, Data Validation Option performs the following tasks:

1. Creates and executes all tests through PowerCenter.2. Loads results into the Data Validation Option results database and displays them in the Data Validation Option Client.

We examine the results and identify data inconsistencies in the ETL process

You repeat this process for new records.

Page 4: Data validation option

The Data Validation Option Client contains multiple areas that allow us to perform different tasks.

Data Validation Option Client Layout Overview

• allow us to perform different tasks.

Page 5: Data validation option

Repositories…

Add

Modify

Delete

Refresh

Exporting Repository Metadata

Page 6: Data validation option

A table pair is the basis for all tests that compare one table to another. We can select a relational

table, flat file, lookup view, or SQL view as one or both tables in a table pair.

Table Pairs

Add

Modify

Delete

Page 7: Data validation option

We can create\run the following types of tests on table pairs:

Aggregate -• Includes COUNT, COUNT_DISTINCT, COUNT_ROWS, MIN, MAX, AVG, and SUM.•

Set -• Includes AinB, BinA, and AeqB.•

Value -• Includes VALUE and OUTER_VALUE.

Tests for Table Pairs

Note: When we run tests, the target folder must be closed in the Designer and Workflow Manager. If the target folder is open, Data Validation Option cannot write to the folder, and the tests return an error

Page 8: Data validation option

single-table pair will be used to run tests on a single table. We can enforce valid values, aggregates, formats, and uniqueness

To run single-table constraints, we must create a single table. You can select a relational table, flat file, lookup view, or SQL view as a single table

Single Table Constraints Pairs

Page 9: Data validation option

SQL views facilitate the use of more complex functionality for single tables and table pairs.

An SQL view allows to use several tables and several calculations in a query to produce a set of fields that you can use as a table in a single table or table pair.

This functionality is similar to the SQL override in PowerCenter or a view in a relational database. You can use any valid SQL statement to create an SQL view.

SQL Views

Page 10: Data validation option

Data Validation Option lookup views allows to test the validity of the lookup logic in transformation layer

One disadvantage of the SQL view is that it does not allow the use of flat files or heterogeneous database tables

Join two heterogeneous sources with a lookup view

Lookup Views

Page 11: Data validation option

You can import and export the following metadata:• Table Pairs• Single Tables• PowerCenter Sources• SQL views• Lookup views• Join views

Import and export of Tests

Data Validation Option allows you to export and import test metadata.

Metadata import and export allows users to share tests and allows rapid generation of tests through scripting.

Page 12: Data validation option

Data Validation Option stores all test definitions and test results in the Data Validation Option repository. We can run reports to display test definitions and results.

Types of reports:

Reports Generation

Summary of Testing Activities

Table Pair Summary

Detailed Test Results

Page 13: Data validation option

Invoke Data Validation Option capabilities at the command line.

Run tests through Command prompt

Running tests at the command line allows you to schedule test execution. It also allows you to embed a specific test as part of the ETL workflow or as part of another process. For example, you can create an ETL process that moves data from source to staging, runs validation, and then moves data into the target or an error table based on the validation results.

Send Emails Data Validation Option sends an email when the tests are complete.

You can provide multiple email addresses separated by commas.

The email specifies whether the test has passed or failed and provides a link to the test results.

Page 14: Data validation option

END