48
Data Quality Management for Informatica (12.0.0) August 2009 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

1200 Dqinfa Ug En

  • Upload
    gill408

  • View
    72

  • Download
    0

Embed Size (px)

DESCRIPTION

TEst

Citation preview

Page 1: 1200 Dqinfa Ug En

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenterUser’s Guide

Data Quality Management for Informatica (12.0.0)

August 2009

Page 2: 1200 Dqinfa Ug En

Notices

Copyright information © 2009 SAP, AG. All rights reserved. SAP Business Objects owns the following U.S. patents, which may cover products that are offered and licensed by Business Objects: 5,295,243; 5,339,390; 5,555,403; 5,590,250; 5,619,632; 5,632,009; 5,857,205; 5,880,742; 5,883,635; 6,085,202; 6,108,698; 6,247,008; 6,289,352; 6,300,957; 6,377,259; 6,490,593; 6,578,027; 6,581,068; 6,628,312; 6,654,761; 6,768,986; 6,772,409; 6,831,668; 6,882,998; 6,892,189; 6,901,555; 7,089,238; 7,107,266; 7,139,766; 7,178,099; 7,181,435; 7,181,440; 7,194,465; 7,222,130; 7,299,419; 7,320,122 and 7,356,779. Business Objects and its logos, BusinessObjects, Business Objects Crystal Vision, Business Process On Demand, BusinessQuery, Cartesis, Crystal Analysis, Crystal Applications, Crystal Decisions, Crystal Enterprise, Crystal Insider, Crystal Reports, Crystal Vision, Desktop Intelligence, Inxight and its logos, LinguistX, Star Tree, Table Lens, ThingFinder, Timewall, Let There Be Light, Metify, NSite, Rapid Marts, RapidMarts, the Spectrum Design, Web Intelligence, Workmail and Xcelsius are trademarks or registered trademarks in the United States and/or other countries of Business Objects and/or affiliated companies. SAP is the trademark or registered trademark of SAP AG in Germany and in several other countries. All other names mentioned herein may be trademarks of their respective owners.

2 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 3: 1200 Dqinfa Ug En

Contents

Preface .............................................................................................................5About this guide.........................................................................................5Assumed audience and the content of this guide.......................................5Access the latest documentation................................................................5Datatype support ........................................................................................6Product naming conventions......................................................................6

Chapter 1:Overview ........................................................................................................ 7What is Data Quality Management for Informatica? .......................................8

Data quality functions................................................................................8Name cleansing..........................................................................................8Address cleansing ......................................................................................8Matching function......................................................................................8Flexible configuration................................................................................8

Chapter 2:Installation ..................................................................................................... 9System requirements ......................................................................................10

Requirements for Data Quality Management for Informatica.................10Installation......................................................................................................11

About installation directories...................................................................11Install Data Quality Management for Informatica (Windows and UNIX) ....12

The steps ..................................................................................................12Windows 2003 installation ......................................................................12

Step 1: Install Data Services...........................................................................13Data Services installation notes ...............................................................13Install sample dataflows to Data Services ...............................................13DPV, LACSLink, and URAC transforms................................................14

Step 2: Install the Data Quality Management for Informatica plug-in and samples ...........................................................................................................15

Install server plug-in to your Informatica Server(s) ................................15Install client plug-in to your Informatica client(s) and copy the Data Services workflow samples .....................................................................15The Data Services sample workflows......................................................16

Step 3: Register the DQM_Plugin.xml file ....................................................17Register to the repository.........................................................................17

Step 4: Verify the Data Quality Management for Informatica toolbar ..........18Step 5: Import and set up the sample mappings\workflows...........................19

Import the sample mappings....................................................................19Set up the sample mappings ....................................................................20

Step 6: Specify the Java class path.................................................................23Step 7: Verify the installation.........................................................................25Install Data Quality Management for Informatica on UNIX .........................26

Installation directories .............................................................................26Permissions ..............................................................................................26

Contents 3

Page 4: 1200 Dqinfa Ug En

Add to environment variables ................................................................. 26Install C++ runtime upgrade (Linux and HP-UX) .................................. 27

If you also use a stand-alone SAP BusinessObjects (Firstlogic) product...... 28What causes the conflict? ........................................................................ 28Different user profiles ............................................................................. 28Scripts on UNIX...................................................................................... 28Batch files on Windows .......................................................................... 29

Chapter 3:Add and use Data Quality Management for Informatica transformations 31Data Quality Management for Informatica transformations.......................... 32

Reusable versus nonreusable transformations......................................... 32Create a new transformation ................................................................... 32Transformations provided with this product ........................................... 33

Chapter 4:Troubleshoot Data Quality Management for Informatica ..................... 39Set the tracing level........................................................................................ 40Logging.......................................................................................................... 41Connection information ................................................................................. 42Data Quality Management for Informatica transformation/mapping issues . 43

The sample mappings are failing in the Workflow Manager.................. 43Data isn’t being processed when sent through a Data Quality Management for Informatica transformation ................................................................ 43Process terminating on signal/exception ................................................. 43

Sample active batch match workflow (wf_ABM) fails ................................. 44

Glossary ........................................................................................................ 45

Index.............................................................................................................. 47

4 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 5: 1200 Dqinfa Ug En

Preface

About this guide This guide provides a general overview of SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter as well as specific information for installing and integrating this product into your Informatica environment. Some sections in this guide may not apply to your version of Data Quality Management for Informatica, depending on which product options you purchased.

Assumed audience and the content of this guide

This guide assumes that the people responsible for integrating Data Quality Management for Informatica are experienced users of the various Informatica applications including the PowerCenter Repository Manager, Designer, and Workflow Manager. It does not explain basic operation of the Informatica applications. For questions about the Informatica applications, consult the Informatica documentation or Informatica technical support.

This guide also does not explain the operation of the SAP BusinessObjects Data Services. For specific questions about Data Services, consult the Data Services documentation.

Access the latest documentation

In addition to the documentation installed on your computer during installation of this product, you can access all the supported SAP Business Objects documentation from the SAP Business Objects Customer Support site.

1. Go to http://help.sap.com.

2. Select the Business Objects tab.

3. In the SAP Business Objects Overview section, select All Products.

4. On the “Business Objects Product Guides” screen, navigate to the document that you want to view.

You can view the PDFs online or save them to your computer.

5

Page 6: 1200 Dqinfa Ug En

Conventions This document follows these conventions:

About versions

Informatica versions This product is engineered for Informatica PowerCenter 8.6.1. It is not compatible with previous versions of PowerCenter.

SAP BusinessObjects technology versions

This product uses SAP BusinessObjects Data Services XI Release 3.2 (12.2.0.1) - Data Quality Management Premium license, to power its data quality processes.

This product is not compatible any version prior to 12.2.0.1.

Datatype support This product supports only the string datatype.

Product naming conventions

For brevity, SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter is often in this guide called Data Quality Management for Informatica.

SAP BusinessObjects Data Services XI Release 3.2 (12.2.0.1) is often in this guide called Data Services.

Convention Description

Bold Highlights file names, paths, and text that you should type exactly as shown. For example, “Type cd\dirs.”

Italics Both emphasizes text and indicates text for which you should substi-tute your own data or values. For example, “Type a name for your file, and the .txt extension (testfile.txt).”

> in menu commands

Indicates commands that you choose from menus in the following for-mat: Menu Name > Command Name. For example, “Choose File > New.”

!Alerts you to important information and potential problems.

Points out special cases that you should know about.

Draws your attention to tips that may be useful to you.

6 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 7: 1200 Dqinfa Ug En

Chapter 1: Overview

Before you use SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter, you should understand some of the concepts behind data quality and address cleansing. A basic familiarity with these concepts will help you to understand what Data Quality Management for Informatica does and why your organization benefits from data quality.

Chapter 1: Overview 7

Page 8: 1200 Dqinfa Ug En

What is Data Quality Management for Informatica?

Data Quality Management for Informatica is designed to ensure information quality within Informatica PowerCenter installations. With the capabilities provided by Data Services, it can provide batch verification and correction of name and address data, and can include sophisticated processes to identify matching records.

Data quality functions Data Quality Management for Informatica brings name and address integrity to your Informatica environment through preset transformations that your users can easily integrate into standard Informatica extract, transform, and load (ETL) mapping.

Name cleansing The name cleansing capability of this product uses extensive name dictionaries to find name and firm (business name) information and, if necessary, add or fix that information from your source data. For example, based on the name dictionary information, you can add to your source data other name-related components, such as gender, titles, and so on.

Address cleansing Data Quality Management for Informatica performs standardization and correction of U.S. and international addresses. You can purchase different international engines for Data Services. The engines provide a different level of address-checking for certain countries. For example, if you own the Canadian engine, Canadian addresses are verified down to the street address level. With the international engine, some countries’ addresses are only verified to the city level. International address cleansing may not be available in your version of Data Quality Management for Informatica, depending on which product options you purchased.

Matching function The matching function of Data Quality Management for Informatica brings sophisticated record-matching technology to your Informatica environment. Use this matching capability in concert with address cleansing and name cleansing to improve the quality of your data by identifying matching or duplicate records.

Flexible configuration You can integrate the data cleansing and matching functions with relatively minor configuration. In addition, your Informatica users can further fine tune their results of Data Quality Management for Informatica’s operations by making changes to the Data Services jobs.

8 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 9: 1200 Dqinfa Ug En

Chapter 2: Installation

Before you can integrate SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter into your application, you must install Data Services and install the Data Quality Management for Informatica plug-in to your Informatica server and client machines.

Be sure to follow the instructions appropriate for your operating system.

Chapter 2: Installation 9

Page 10: 1200 Dqinfa Ug En

System requirements

This section explains the requirements for installing the Data Quality Management for Informatica plug-in to operate with your Informatica PowerCenter environment.

Requirements for Data Quality Management for Informatica

Installation of the Data Quality Management for Informatica plug-in requires that you have the following software and hardware specifications before installing Data Quality Management for Informatica.

Informatica versions This version of Data Quality Management for Informatica is engineered for PowerCenter 8.6.1. It is not compatible with previous versions of Informatica.

Operating systems Here is a list of the operating systems that are supported in this release.

Win 2003 Server (32-bit + 64-bit), Win XP (32-bit + 64-bit), Windows 2008 Server, and Windows Vista (SP1)

AIX 5.2 (64-bit), AIX 5.3 (64-bit), and AIX 6.1 (64-bit)

HP Itanium 11i V2 11.23 (64-bit) and HP Itanium V3 11.31 (64-bit)

Red Hat Linux 4.0 (32-bit and 64-bit) and Red Hat Linux 5.0 (64-bit)

Solaris 9 (64-bit) and Solaris 10 (64-bit)

SuSE Linux 9.0 (64-bit) and SuSE Linux 10.0 (64-bit)

The installation routine you run is specifically designed for your operating system.

System requirements for installing Data Services can be found in the Data Services documentation

10 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 11: 1200 Dqinfa Ug En

Installation

Before you can integrate Data Quality Management for Informatica into your application, you must first install the Data Services. After you’ve installed Data Services, you can then install the Data Quality Management for Informatica plug-in to your Informatica server and client machines.

Follow the instructions appropriate for your operating system:

“Install Data Quality Management for Informatica (Windows and UNIX)” on page 12

“Install Data Quality Management for Informatica on UNIX” on page 26

About installation directories

Use these two variables when you enter paths for the Informatica server and the client: $PMRootDir and Client_Install_Dir.

$PMRootDir In these installation steps, procedures may refer to the Informatica server variable $PMRootDir. This User’s Guide uses $PMRootDir as an abbreviation for the installation directory (home) of your PowerCenter Server program.

For example, assume that your PowerCenter Server is installed in informatica_server. When the instructions refer to “Enter $PMRootDir\BusinessObjects\,” you enter informatica_server\Business Objects (Windows) or /informatica_server/businessobjects (UNIX).

If you are entering a path within Informatica or the Data Services interface, you can simply enter the $PMRootDir variable rather than the whole path.

Client_Install_Dir Client_Install_Dir is an abbreviation we use to refer to the installation directory (home) of your PowerCenter client. The client is only installed on Windows.

Chapter 2: Installation 11

Page 12: 1200 Dqinfa Ug En

Install Data Quality Management for Informatica (Windows and UNIX)

These installation instructions are applicable to Windows and UNIX. However, there are additional steps to perform if you are a UNIX user. Therefore, UNIX users should read “Install Data Quality Management for Informatica on UNIX” on page 26 before proceeding with the steps in this section.

The steps There are several steps involved in installing and setting up Data Quality Management for Informatica in your Informatica environment:

“Step 1: Install Data Services” on page 13

“Step 2: Install the Data Quality Management for Informatica plug-in and samples” on page 15

“Add to environment variables” on page 26 (UNIX only)

“For the user who launches the PowerCenter Client” on page 26

“Step 3: Register the DQM_Plugin.xml file” on page 17

“Step 4: Verify the Data Quality Management for Informatica toolbar” on page 18

“Step 5: Import and set up the sample mappings\workflows” on page 19

“Step 6: Specify the Java class path” on page 23

“Step 7: Verify the installation” on page 25

Windows 2003 installation

For Windows 2003, you must install both Informatica PowerCenter and the Data Quality Management for Informatica plug-in with either the same user account or with accounts that have the same permissions. In addition, the PowerCenter Server service must be started using an account that has permission to all files in both the PowerCenter installation and the Data Quality Management for Informatica plug-in installation. This ensures that all users can write to the files and folders necessary to run Data Quality Management for Informatica.

Though this section applies to both Windows and UNIX installations, the instructions favor Windows terminology. For example, path names contain the backward slash (dir\file), rather than the forward slash (dir/file) and show the GUI used for a Windows installation. UNIX users should substitute appropriate file names and UNIX conventions when following these instructions.

!You must install and set up Data Services before you install the Data Quality Management for Informatica plug-in.

12 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 13: 1200 Dqinfa Ug En

Step 1: Install Data Services

You must install the Data Services following the instructions in the Data Services documentation. This section contains information and additional procedures specific to installing Data Services to use with this product.

Data Services installation notes

You can find the instructions for installing, configuring, and running the Data Services portion of this product in the Data Services documentation. However, using Data Services as a component of this product presents some special circumstances not covered in the Data Services documentation.

This section describes some additional issues unique to installing Data Services as part of Data Quality Management for Informatica.

Install Data Services first You must install Data Services before installing the Data Quality Management for Informatica plug-in. After you have installed and configured Data Services, you can proceed to “Step 2: Install the Data Quality Management for Informatica plug-in and samples” on page 15.

Install Data Services on a different machine than Informatica

We recommend you install Data Services on a different machine than the one that contains your installation of Informatica PowerCenter. Information about the system requirements for the machine on which you install Data Services are included with the Data Services installation instructions.

If you install Data Services on a different machine than the one that contains your installation of Informatica PowerCenter, you must run the installation routine for the client plug-in on the machine that contains Data Services, in addition to the machines that contain your Informatica PowerCenter clients.

Data Services previously installed

If you are already a Data Services customer (prior to the purchase and installation of this product) and therefore have already installed a version of Data Services, you must upgrade to SAP BusinessObjects Data Services XI Release 3.2 (12.2.0). See the Data Services documentation for information about upgrading Data Services.

Key information about your Data Services installation

When you install Data Services, make note of the following items about your Data Services configuration:

Access server host

Access server port

Web service URL

This information is required during each installation of the client plug-in.

Install sample dataflows to Data Services

The installation package for the client plug-in contains sample Data Services dataflows. To use these sample dataflows, you must run the client plug-in installation routine on your Data Services machine, even if there is no Informatica PowerCenter client on the machine.

The installation routine detects your Data Services configuration and automatically imports the sample dataflows file.

Chapter 2: Installation 13

Page 14: 1200 Dqinfa Ug En

DPV, LACSLink, and URAC transforms

When you create a new Data Services USA Regulatory Address Cleanse (URAC) transform, the default configuration of the transform is set with the assumption that your installation of Data Quality Management for Informatica uses the Delivery Point Validation (DPV) and Locatable Address Conversion System Link (LACSLink) features. If you did not purchase one or both of these features, you must change the default settings of each URAC transform that you create.

The required change affects three options. This table shows the option, the default value of the option, and the value you should use if you are not using DPV or LACSLink.

Option Group Name/Option Default Value Value if not using DPV or LACSLink

ASSIGNMENT_OPTION/ENABLE_DPV YES or ChangeMe NO

ASSIGNMENT_OPTION/ENABLE_LACSLINK YES or ChangeMe NO

NON_CERTIFIED_OPTION/DISABLE_CERTIFICATION YES NO

14 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 15: 1200 Dqinfa Ug En

Step 2: Install the Data Quality Management for Informatica plug-in and samples

The next installation step is to install the Data Quality Management for Informatica plug-in to your Informatica Server and client machine(s), and to install the sample workflows to your Data Services configuration.

Install server plug-in to your Informatica Server(s)

You must install the server plug-in on each instance of the Informatica Servers you intend to use with this product.

To install the server plug-in to your Informatica Server:

1. From the installation package, run the file Server_setup.exe. The Welcome screen appears.

2. Click Next. The License Agreement window appears.

3. Accept the license agreement and click Next. The Ready to Install window appears.

4. Click Install to install the plug-in.

5. When the installation is complete, you are prompted to restart your computer. You don’t have to restart it now, but you must restart your computer before you can run Data Quality Management for Informatica.

Install client plug-in to your Informatica client(s) and copy the Data Services workflow samples

You must install the client plug-in to each of your Informatica client systems. When run on the machine that contains your Data Services installation, this installation routine also imports the sample Data Services workflows to the configuration.

On the machine containing an Informatica client or your Data Services installation:

1. From the installation package, run setup.exe. The Welcome screen appears.

2. Click Next. The License Agreement window appears.

3. Accept the license agreement and click Next. The PowerCenter Location window appears.

UNIX install: These instructions are for Windows installations of the server plug-in. To install the plug-in on UNIX, run the executable and follow the instructions provided. Also, if you are performing a UNIX install, proceed to “Add to environment variables” on page 26 and “For the user who launches the PowerCenter Client” on page 26 before continuing with “Step 3: Register the DQM_Plugin.xml file” on page 17.

For the purposes of using this product, an Informatica client is defined as any system that runs the Informatica PowerCenter Designer.

If the installation routine can not detect an installation of the Informatica PowerCenter client on this machine, it then skips this section and proceeds to the part of this procedure that installs the Data Services dataflows.

Chapter 2: Installation 15

Page 16: 1200 Dqinfa Ug En

4. Choose the default or click Change to browse to a destination folder where you want this product’s files to be installed, then click Next. The first Data Services Connection screen appears.

5. Enter the Data Services Access Server host name and port number in the appropriate fields, then click Next. The next Data Services Connection screen appears.

6. Enter the Data Services Web Services URL. The default value uses the Data Services Access Server IP address you gave in the previous step. Click Next. The Data Services Repository Connection Parameters screen appears.

7. In the Repository Connection Parameters screen, enter the connection details. Click Next. The RDBMS Selection screen appears.

8. In the RDBMS Selection window, choose the database type you intend to use for the creation of staging tables, and click Next.

The window that follows this step varies, depending on which database type you chose. Each database type requires different connection data. In the Connection Parameters window that appears for your database type, enter the database connection information and click Next. The Ready to Install window appears.

9. In the Ready to Install the Program window, click Install. The windows update for each steps in the installation routine.

10. Click Install to install the plug-in or the workflows.

11. When the installation is complete, you are prompted to restart your computer. You must restart your computer before you can run this product.

The Data Services sample workflows

The following table describes the Data Services workflows for this product.

Workflow Description

Job_Realtime_DQ_Informatica_datacleanse

This real-time job is used for the name cleansing. It standardizes name fields such as First Name, Middle Name, Last Name, and Job title.

Job_Realtime_DQ_Informatica_addresscleanse

This real-time job is used for the address cleansing. It cleanses address fields such as Address, City, Post Code, State, and Country of different countries. All country engines are enabled in this job.

Job_Realtime_DQ_Informatica_transaction_match

This job is used for the transactional matching. It takes the first record as a master record and the sub-sequent records as candidate records. Before doing the matching, it performs name and address cleans-ing. In the output, it gives the cleansed record and the match score of candidate records with the master record.

Job_Batch_DQ_Informatica_batch_match

This batch job is used for the batch matching. It cre-ates the different match groups and according to those match groups it creates the master record and candidate record. It outputs the record in cleansed format. It also provides the group number and the match score for every subordinate record.

16 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 17: 1200 Dqinfa Ug En

Step 3: Register the DQM_Plugin.xml file

Before you can use Data Quality Management for Informatica’s custom transformation, you must register it as a plug-in in the repository, as you would with any plug-in for Informatica PowerCenter.

Register to the repository

To register the plug-in to the repository on Informatica:

1. In the Informatica PowerCenter Administration Console, place the repository in “exclusive” mode. See the Informatica documentation for details on this step.

2. In the plug-ins tab for this node, click Register Repository Server plug-in. The Choose file window appears.

3. Browse to the location of the DQM_Plugin.xml plug-in. Select the plug-in, and click Open. The Console displays a screen to register the plug-in.

4. Enter the Repository Username and Password. Check the box for Update existing plug-in registration if you have previously registered this plug-in. Click OK.

5. The Console indicates that it is importing the plug-in. Upon completion, place the repository back into normal mode.

For more information about registering plug-ins, see your Informatica documentation.

Chapter 2: Installation 17

Page 18: 1200 Dqinfa Ug En

Step 4: Verify the Data Quality Management for Informatica toolbar

After you install the client software, the Data Quality Management for Informatica toolbar should appear in your PowerCenter Designer application.

There are four toolbar items for Data Quality Management for Informatica:

Active data cleanse transform

Passive data cleanse transform

Active transactional match transform

Active batch match transform

If the toolbar does not appear, perform the following steps:

1. In the PowerCenter Designer, open the appropriate repository.

2. Select Tools > Customize. The Customize window opens.

3. Select the Transformations and the Advanced Transformations options.

4. Click Ok.

18 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 19: 1200 Dqinfa Ug En

Step 5: Import and set up the sample mappings\workflows

The sample mappings\workflows are installed as an XML object. This object includes sources, targets, and transformations. You must import and set up the sample mappings\workflows for use with Data Quality Management for Informatica.

Import the sample mappings

To import the sample mappings\workflows, follow the normal steps (listed below) for importing an object.

1. In PowerCenter Repository Manager, choose Folder > Create. The Create Folder window opens.

2. In the Name box, enter DQMForInformaticaSamples.

3. Set the appropriate owners, groups, and permissions for the folder and click OK.

4. Choose Repository > Import Objects. The Import Wizard window opens.

5. Click Browse to locate and select the DQM_FOR_INFORMATICA_SAMPLE_WORKFLOWS.XML file and click OK. By default, the sample mappings are installed in c:\Program Files\Business Objects\Data Quality Mgmt for Informatica\SampleWorkflows.

6. Click Next. The wizard’s Select Objects to Import window opens.

Chapter 2: Installation 19

Page 20: 1200 Dqinfa Ug En

7. Click Add All to import all sample sources, targets, transformations, mappings, sessions, and workflows. Click Next. The wizard’s Resolve Conflicts in Folder window opens.

8. Resolve any conflicts, as necessary. When finished, click Close.

9. Click Import.

Set up the sample mappings

You set up the sample mappings by configuring the DQM Properties in the Edit Transformations window. This product provides five sample mappings:

m_ADC

m_PDC

m_ATM

m_ABM_SRC_TO_STG

m_ABM_PROCESS

You must repeat this procedure for each sample mapping.

To set up the sample mappings:

1. In the Mapping Designer, open one of the sample mappings (such as m_ADC).

2. Right-click Edit.

The Edit Transformations window opens.

3. Click the DQM Properties tab.

Tip: The first time that you open the Edit Transformation window, select the corner of the window and drag it so that you can view all of the tabs.

20 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 21: 1200 Dqinfa Ug En

4.

5. Set the properties as necessary. See the table in the following section to understand which properties are required or optional.

6. Click Verify ports to check that the fields in the specified Data Services dataflow correspond to the input and output ports on the Informatica transform. Data Services must be running for the verification to be successful.

7. Click Apply and then OK.

8. Repeat this process for each of the sample mappings.

Option descriptions This table describes each of the properties available for the Data Quality Management for Informatica custom transformations. Descriptions also appear in the Description box in the Edit Transformation window.

A description of any high-lighted or selected option appears in the Description box.

You must provide information for the active fields.

Option Description

Real time service Name of the Data Services real-time service with which this transformation runs. You must ensure that this service is pub-lished as a Web Service.

Break group key field Port to be used as the break group key during match opera-tions.

Number of threads The number of threads used to send and receive data with Data Services

Polling interval The number of seconds to wait before each fetch of the status of the batch job.

Number of records in a bundle

The number of records sent and received with Data Services per call.

Chapter 2: Installation 21

Page 22: 1200 Dqinfa Ug En

Processing timeout Number of seconds to wait for batch match to complete before the process times out.

Maximum records in a break group

The maximum number of records sent and received with Data Services per break group.

Connection... Produces a window that allows you to enter the connection information used to connect to Data Services. For more infor-mation, see “Connection information” on page 42.

Logging... Produces a window that allows you to enable and configure this product’s logging for events affecting the Informatica cli-ent server. For more information, see “Logging” on page 41.

Verify ports Click to check that the input and output port schema within the Informatica transformation match the input and output field schema of the Data Services real-time service. This pro-cess compares the name and the length of each input and out-put port of Informatica transformation and Data Services job.

Option Description

22 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 23: 1200 Dqinfa Ug En

Step 6: Specify the Java class path

This product uses several Java Archive (JAR) files. Within your Informatica Server(s), you must set the Java class path for these JAR files in the Informatica Workflow Manager. For Windows, the installer copies these JAR files to $PMRootDir\Business Objects. For UNIX, the installer copies these JAR files to $PMRootDir/businessobjects.

To specify the Java class path:

1. From the Informatica Server Task Developer workspace, double-click the session. The Edit Task window opens.

2. Click the Properties tab.

3. In the Java Classpath attribute, enter the following values:

For Informatica Server on Windows:

$PMRootDir\Business Objects\DQM_Link.jar;$PMRootDir\Business Objects\UniversalConnector.jar;$PMRootDir\Business Objects\axiom-api-1.2.5.jar;$PMRootDir\Business Objects\axiom-impl-1.2.5.jar;$PMRootDir\Business Objects\axis2-kernel-1.3.jar;$PMRootDir\Business Objects\commons-codec-1.3.jar;$PMRootDir\Business Objects\commons-httpclient-3.0.1.jar;$PMRootDir\Business Objects\commons-logging-1.1.jar;$PMRootDir\Business Objects\rtsClient.jar;$PMRootDir\Business Objects\stax-api-1.0.1.jar;$PMRootDir\Business Objects\UCDataPlayer.jar;$PMRootDir\Business Objects\wsdl4j-1.6.2.jar;$PMRootDir\Business Objects\wstx-asl-3.2.1.jar;$PMRootDir\Business Objects\xalan-2.7.0.jar;$PMRootDir\Business Objects\XmlSchema-1.3.2.jar

Chapter 2: Installation 23

Page 24: 1200 Dqinfa Ug En

For Informatica Server on Unix:

$PMRootDir/businessobjects/DQM_Link.jar;$PMRootDir/businessobjects/UniversalConnector.jar;$PMRootDir/businessobjects/axiom-api-1.2.5.jar;$PMRootDir/businessobjects/axiom-impl-1.2.5.jar;$PMRootDir/businessobjects/axis2-kernel-1.3.jar;$PMRootDir/businessobjects/commons-codec-1.3.jar;$PMRootDir/businessobjects/commons-httpclient-3.0.1.jar;$PMRootDir/businessobjects/commons-logging-1.1.jar;$PMRootDir/businessobjects/rtsClient.jar;$PMRootDir/businessobjects/stax-api-1.0.1.jar;$PMRootDir/businessobjects/UCDataPlayer.jar;$PMRootDir/businessobjects/wsdl4j-1.6.2.jar;$PMRootDir/businessobjects/wstx-asl-3.2.1.jar;$PMRootDir/businessobjects/xalan-2.7.0.jar;$PMRootDir/businessobjects/XmlSchema-1.3.2.jar

Most PDF viewers offer the capability to copy text.

24 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 25: 1200 Dqinfa Ug En

Step 7: Verify the installation

After installing and configuring the software, you should set up and run the sample workflows provided with this product to verify that the installation process was successfully completed. First you must shutdown and restart your computer if you haven’t already done so during the installation procedure.

This product provides the following sample workflows:

wf_ADC

wf_PDC

wf_ATM

wf_ABM

To verify the installation by running the sample mappings:

1. In PowerCenter Workflow Manager, connect your server to your repository.

2. Run one of the workflows (for example wf_ADC) using your PowerCenter Workflow Manager. If the session completes successfully, Data Quality Management for Informatica was properly installed for that module.

If the session does not properly complete, verify that you have followed all of the installation steps.

3. Repeat this process for each sample workflow.

Chapter 2: Installation 25

Page 26: 1200 Dqinfa Ug En

Install Data Quality Management for Informatica on UNIX

The installation procedures for UNIX are similar to the procedures for Windows. For installation steps that apply to both Windows and UNIX, see “Install Data Quality Management for Informatica (Windows and UNIX)” on page 12. Read this section for any additional installation steps, or for information that relates directly to your UNIX installation.

Installation directories

See “About installation directories” on page 11 to learn about directory variables that you can use when entering the default location for Informatica Server and PowerCenter.

Permissions The owner of your Informatica Server process should perform the installation if possible. Otherwise, must go back and set appropriate permissions to the Data Quality Management for Informatica files and directories after the installation.

After you perform “Step 4: Verify the Data Quality Management for Informatica toolbar” on page 18, you must add to your environment variables and install the Linux C++ runtime upgrade.

Add to environment variables

Add or edit the following environment variables in your .login or .profile file (in the Bourne shell), or in the .login or .cshrc file (in the C shell).

For the user who launches the PowerCenter Client

In the PATH environment variable, the user who launches the PowerCenter client must add the path <INFA_HOME>/java/jre/bin/client.

!Important: Before running the Informatica Server process (PMServer), you should change your default permissions for your Informatica user to rw-rw-r--. When you run the server process, certain files are overwritten and the server process owner becomes the owner of those files. Using rw-rw-r-- permissions ensures that people in the group will still have write access to these files.

Note: If you also use a stand-alone (batch) Business Objects or Firstlogic application on the same machine, you may encounter conflicts in some situations. To avoid this conflict, we recommend that you set up different user profiles: one for Data Quality Management for Informatica and one for the stand-alone application.

26 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 27: 1200 Dqinfa Ug En

Install C++ runtime upgrade (Linux and HP-UX)

If you use Red Hat Linux, you must install the GNU Standard C++ Library, version 3.2.3 (for libstdc++.so.5 support). We have found a technical issue requiring this new C++ runtime support. To learn more, go to http://www.redhat.com/apps/download and search for the keyword “libstdc++.” Be sure to review Red Hat’s specific instructions in the System Environment Libraries section related to dependencies. If you need further assistance, contact Red Hat.

If you use HP-UX, you must install the HP-UX aC++ runtime environment version A.03.26 or later. To learn more and download this runtime environment, go to http://docs.hp.com/en/5187-2783/ch15s03.html.

Continue with “Step 5: Import and set up the sample mappings\workflows” on page 19 to complete your installation.

Chapter 2: Installation 27

Page 28: 1200 Dqinfa Ug En

If you also use a stand-alone SAP BusinessObjects (Firstlogic) product

Some users of Data Quality Management for Informatica also use a stand-alone (batch) SAP BusinessObjects (Firstlogic) application. If Data Quality Management for Informatica and the stand-alone application are on the same machine, you may encounter conflicts in some situations.

What causes the conflict?

Your PATH and PW_PATH environment variables point to locations of your Business Objects products. If you try to run Data Quality Management for Informatica but your environment variable points to the stand-alone application location first, Data Quality Management for Informatica will try to use the software installed for the stand-alone application, and vice versa.

Different user profiles To avoid this conflict, Business Objects recommends that you set up different user profiles: one for Data Quality Management for Informatica and one for the stand-alone application. The environment variables in Data Quality Management for Informatica user profile should point to the location of the Data Quality Management for Informatica software. For UNIX, this location is specified in “Add to environment variables” on page 26. For Windows, the environment variables are automatically set up during installation.

The environment variables in the stand-alone application user profile should point to the location of the stand-alone application.

Scripts on UNIX If you run Data Quality Management for Informatica on a UNIX system, you can run Data Quality Management for Informatica and the stand-alone application with the same user profile. To do this, your login script must initially set the environment variables as appropriate for Data Quality Management for Informatica. You must also create a script (or enter the appropriate command) that resets your environment variables to the settings required by the stand-alone application.

1. Login with the profile set up for Data Quality Management for Informatica.

2. Start the Informatica Server process.

3. Run the script (or enter the appropriate commands) to set your environment variables for your stand-alone application. For example:

setenv PATH /postware/acesetenv PW_PATH $PATH

4. Run your stand-alone application, as needed.

5. Repeat these steps each time you log on or when you want to restart the Informatica Server process.

28 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 29: 1200 Dqinfa Ug En

Batch files on Windows

On Windows, you can use batch files to run the stand-alone application and Data Quality Management for Informatica from the same system. Follow these steps:

1. Create a batch file that sets the proper environment variables for your stand-alone application and launches the stand-alone application. The following example shows a batch file for launching an ACE job file.

2. Create additional batch files for each stand-alone application you want to run.

When you use this procedure, the application you run from the batch file uses the environment you set up in the batch file. All other applications you run use the normal environment variables set up for the user who is logged in.

A command-prompt window remains open until you close the application that you launched from the batch file.

You may want to create a shortcut to the batch files and place the shortcut on your desktop or in your Start menu. You can then double-click the shortcut to run the batch file and launch the stand-alone application.

@echo offset path=C:\pw\adm;C:\pw\ace;%path%set pw_path=C:\pw\adm;C:\pw\ace;%pw_path%pwace C:\pw\acejob.ace

These two commands insert the appropriate paths to the ACE-related environment variables before the other paths.

This line launches the ACE job-file program.

Chapter 2: Installation 29

Page 30: 1200 Dqinfa Ug En

30 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 31: 1200 Dqinfa Ug En

Chapter 3: Add and use Data Quality Management for Informatica transformations

After you’ve installed the SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter software, there are still other steps necessary before you can use and run Informatica processes that use this product.

Read this chapter to get an idea of which type of transformation, active or passive, is right for your task. Then refer to the appropriate sections of this chapter for information about setting up and using a transformation.

Chapter 3: Add and use Data Quality Management for Informatica transformations 31

Page 32: 1200 Dqinfa Ug En

Data Quality Management for Informatica transformations

Before incorporating Data Quality Management for Informatica transformations into your Informatica mappings, you must decide which type of transformation to use.

This section explains the reusability of transformation, and shows how to create transformations based on the four transformation types provided with this product.

Reusable versus nonreusable transformations

You can reuse a transformation if you create it in the Transformation Developer by using an instance of that transformation in mappings.

If you make changes in the main transformation, the changes are inherited by each instance of that transformation. You cannot make any changes within an instance of the transformation.

When you create a transformation within a mapping using the Mapping Developer, it is not reusable in other mappings, but it is modifiable.

You can make nonreusable transformations reusable by selecting the Reusable check box in the MetaData Extensions tab of the Transformation Developer. When you do this, Data Quality Management for Informatica places a reusable version of the transformation in the Transformations folder and makes your original transformation a non-modifiable instance.

If you want to fine-tune an instance of Data Quality Management for Informatica’s transformations for the mapping process into which it is incorporated, you should create the transformation within the mapping.

Create a new transformation

You can create a transformation to use with this product that is blank, or one based on a real-time service or a batch job. When you create a transformation, you are provided the Create transformation window to identify your Data Services connection and choose the type of transformation.

Option descriptions This table provides brief descriptions for each of the options on the Create transformation windows.

Option Description

Access server host The name of your Data Services host machine. Data Quality Management for Informatica automatically completes this field based on your Data Services setup.

Access server port The number of your Data Quality Access Server’s port. Data Quality Management for Informatica automatically completes this field based on your Data Quality Server setup.

Web service URL The encoded URL for the Data Services Web Service pointing to the WSDL.

Real time service The name of the Data Services Real-time service with which this transformation runs. You must ensure that this service is also published as a Web Service.

32 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 33: 1200 Dqinfa Ug En

Transformations provided with this product

This product provides the following transformations:

Passive data cleanse transformation

Active data cleanse transformation

Active transformational match transformation

Active batch match transformation

Descriptions of these transformations and the procedure for creating them are in the following sections.

Passive data cleanse transformation

For flexibility in mappings and ease of setup, Data Quality Management for Informatica’s passive data cleanse transformation is the best choice. Consider using this type of transformation if you are not using the match capability of Data Services and if you are new to Data Quality Management for Informatica.

This transformation outputs one row for every input row.

To create a passive data cleanse transformation:

1. Open the Informatica PowerCenter Designer and the appropriate repository.

2. Open the folder in which you want to work.

3. Choose Tools > Transformation Developer.

4. On the Data Quality Management for Informatica toolbar, click the DQM passive data cleanse transformation icon.

If the toolbar is not displayed, see “Step 4: Verify the Data Quality Management for Informatica toolbar” on page 18 (for Windows) or “Step 4: Verify the Data Quality Management for Informatica toolbar” on page 18 (for UNIX).

5. Click the area of the Transformation Developer workspace or the mapping designer where you want to place the transformation. The Create Transformation window opens.

6. Make any necessary changes to the available fields for the Data Services Access Server host name, port number, Web Services URL, and real time service. (Make sure that this real time service is published as a Web Service within Data Services.)

7. To create a blank transformation, click Create blank transformation. (You must later manually create input and output ports that match the input and output column on the Data Services job in order, datatype, and length.)

or

Create transform based on real time service or Create transform based on batch job

Click to create a new transformation based on the sample workflows provided with this product.

Create blank transfor-mation

Click to create a new transformation that does not have ports or settings.

Option Description

Chapter 3: Add and use Data Quality Management for Informatica transformations 33

Page 34: 1200 Dqinfa Ug En

To create a transformation based on an existing dataflow, click Create transform based on real time service. (The transformation has its input and output ports defined by the corresponding Data Services job.)

The transformation is created and placed on your Transformation Developer workspace. You must create an Informatica mapping to use this transformation. For details about this transform, examine the m_PDC sample mapping that ships with this product.

Active data cleanse transformation

The active data cleanse transform provides the best performance in data cleanse operations. Consider using this type of transformation if you are not using the match capability of Data Services, but are able to configure a transformation more complicated than the passive data cleanse transformation.

To create an active data cleanse transformation:

1. Open the Informatica PowerCenter Designer and the appropriate repository.

2. Open the folder in which you want to work.

3. Choose Tools > Transformation Developer.

4. On the Data Quality Management for Informatica toolbar, click the DQM active data cleanse transformation icon.

If the toolbar is not displayed, see “Step 4: Verify the Data Quality Management for Informatica toolbar” on page 18 (for Windows) or “Step 4: Verify the Data Quality Management for Informatica toolbar” on page 18 (for UNIX).

5. Click the area of the Transformation Developer workspace or the mapping designer where you want to place the transformation. The Create Transformation window opens.

6. Make any necessary changes to the available fields for the Data Services Access Server host name, port number, Web Services URL, and real time service. (Make sure that this real time service is published as a Web Service within Data Services.)

7. To create a blank transformation, click Create blank transformation. (You must later manually create input and output ports that match the input and output column on the Data Services job in order, datatype, and length.)

or

To create a transformation based on an existing dataflow, click Create transform based on real time service. (The transformation has its input and output ports defined by the corresponding Data Services job.)

The transformation is created and placed on your Transformation Developer workspace. You must create an Informatica mapping to use this transformation. For details about this transform, examine the m_ADC sample mapping that ships with this product.

Active transformational match transformation

The active transformational match transform provides real-time match capability.

To create an active transformational match transformation:

34 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 35: 1200 Dqinfa Ug En

1. Open the Informatica PowerCenter Designer and the appropriate repository.

2. Open the folder in which you want to work.

3. Choose Tools > Transformation Developer.

4. On the Data Quality Management for Informatica toolbar, click the DQM active transformational match transformation icon.

If the toolbar is not displayed, see “Step 4: Verify the Data Quality Management for Informatica toolbar” on page 18 (for Windows) or “Step 4: Verify the Data Quality Management for Informatica toolbar” on page 18 (for UNIX).

5. Click the area of the Transformation Developer workspace or the mapping designer where you want to place the transformation. The Create Transformation window opens.

6. Make any necessary changes to the available fields for the Data Services Access Server host name, port number, Web Services URL, and real time service. (Make sure that this real time service is published as a Web Service within Data Services.)

7. To create a blank transformation, click Create blank transformation. (You must later manually create input and output ports that match the input and output column on the Data Services job in order, datatype, and length.)

or

To create a transformation based on an existing dataflow, click Create transform based on real time service. (The transformation has its input and output ports defined by the corresponding Data Services job.)

The transformation is created and placed on your Transformation Developer workspace. You must create an Informatica mapping to use this transformation. For details about this transform, examine the m_ATM sample mapping that ships with this product.

To use this transformation, create an Informatica sorter transformation to use just prior to this transformation in the mapping. The port for which you have specified the primary sorting order into sorter transformation leads to the break group key field.

Specify the name of this transformation’s port (mapped with sorter's primary sorting order port) in the DQM Properties’ Break group key field.

Active batch match transformation

The active batch match transform provides batch match capability.

To create an active batch match transformation:

1. Open the Informatica PowerCenter Designer and the appropriate repository.

2. Open the folder in which you want to work.

3. Choose Tools > Transformation Developer.

4. On the Data Quality Management for Informatica toolbar, click the DQM active batch match transformation icon.

Chapter 3: Add and use Data Quality Management for Informatica transformations 35

Page 36: 1200 Dqinfa Ug En

If the toolbar is not displayed, see “Step 4: Verify the Data Quality Management for Informatica toolbar” on page 18 (for Windows) or “Step 4: Verify the Data Quality Management for Informatica toolbar” on page 18 (for UNIX).

5. Click the area of the Transformation Developer workspace or the mapping designer where you want to place the transformation. The Create Transformation window opens.

6. Make any necessary changes to the available fields for the Data Services Access Server host name, port number, Web Services URL, and batch job name. (Make sure that this batch job is published as a Web Service within Data Services.)

7. To create a blank transformation, click Create blank transformation. (You must later manually create input and output ports that match the input and output column on the Data Services job.You can create them with any name. For simplicity, use I_Input as input port and O_Output as output port. The datatype for both ports must be String.)

or

To create a transformation based on an existing dataflow, click Create transform based on real time service. (The transformation has its input and output ports defined by the corresponding Data Services job.)

The transformation is created and placed on your Transformation Developer workspace. You must create an Informatica mapping to use this transformation. For details about this transform, examine the m_ABM sample mapping that ships with this product.

This transformation enables this product to execute the associated batch job. Make sure that the RDBMS table used in Data Services batch job can be accessed from Informatica. In Informatica, both the input and output tables of the Data Services batch job are treated as staging tables.

The following is the ETL design for using the active batch match transformation.

1. Create a mapping in Informatica that loads all source data to the Input Stage table.This input stage table is the same RDBMS table used in the Data Services batch job as the input table. Refer to the m_ABM_SRC_TO_STG sample mapping shipped with this product for detailed information.

2. Create a second mapping that has a dummy source file. This dummy source file should have only one record of one column, with a value such as “Start”. (This value is independent, so you can specify any other value.)

3. In this second mapping, you should use the active batch match transformation immediately after the dummy source. After the active batch match transformation, you can use an Informatica router transformation that leads to a SQL transformation in case of success. In this SQL transformation, access the output staging table. This output staging table is the same RDBMS table used in the Data Services batch job as the output table. Once all the matched records are returned to Informatica, you can send them directly to the target or the next stage for further processing.

For more details, refer to the m_ABM_PROCESS sample mapping provided with this product.

36 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 37: 1200 Dqinfa Ug En

If you want to execute the active batch match workflow (wf_ABM), then make these changes:

1. Open the wf_ABM workflow.

2. Go to the Task Developer and edit the s_ABM_SRC_TO_STG session.

3. Go to Mapping > Targets and specify the Relational Connection. This setting should be same as in the Data Services configuration.

For example, if you are using Oracle as the Data Services configuration and DQM_Win32 as the user, then this relational connection should be mapped with this information. Make sure that you have created this staging table for this Configuration. You can execute the batch job Job_DQ_Informatica_CreateStageTables imported from this product. This batch job creates both the input and output staging tables for that configuration.

4. Click OK and save.

5. Edit the s_ABM_PROCESS session. Click on Mapping > Transformations >TR_ABM_STG_ TO_TGT_SQL and specify the Relational Connection. This connection should be the same as in the Data Services Configuration.

Chapter 3: Add and use Data Quality Management for Informatica transformations 37

Page 38: 1200 Dqinfa Ug En

38 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 39: 1200 Dqinfa Ug En

Chapter 4: Troubleshoot Data Quality Management for Informatica

If you can’t get something to work properly within SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter, this section may help you correct the problem. This section deals mostly with issues within Data Quality Management for Informatica itself. At times, issues may actually be caused by your Informatica applications. Be sure to consult the Informatica PowerCenter documentation as well.

Chapter 4: Troubleshoot Data Quality Management for Informatica 39

Page 40: 1200 Dqinfa Ug En

Set the tracing level

In addition to setting the DQM properties, you may also want to use the Edit Transformations window to control the level of information recorded in the Informatica log files when you run a mapping. You control this through the Tracing Level setting on the Properties tab.

1. From the Transformation Developer workspace, double-click the transformation and click the Properties tab of the Edit Transformation window.

2. In the Tracing Level property, set the appropriate level.

In addition to the normal logging produced by any of the other settings, the Tracing Level Verbose Data creates two files, <Transformation Name>_Reader.log and <Transformation Name>_Writer.log, in the Informatica Server’s bin directory.

40 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 41: 1200 Dqinfa Ug En

Logging

The DQM Properties tab of the Edit Transformations screen gives you access to the logging capabilities of this product. When you click the logging button, the following screen appears.

In this screen you can configure the following settings:

Option Description

Enable logging Controls whether logging is enabled.

Client component log directory

Specifies the location of the client log files.

Server component log directory

Specifies the location of the server log files.

Enable playback Controls use of data playback files, a more detailed version of logging. This setting should remain unchecked unless used in an engagement with Customer Support.

Chapter 4: Troubleshoot Data Quality Management for Informatica 41

Page 42: 1200 Dqinfa Ug En

Connection information

The DQM Properties tab of the Edit Transformations screen gives you access to the logging capabilities of this product. When you click the logging button, the following screen appears.

In this screen you can configure the following settings:

Option Description

Access server host The name of your Data Services host machine.

Access server port The number of your Data Quality Access Server’s port.

Web service URL The encoded URL for the Data Services Web Service pointing to the WSDL.

Time out The number of seconds to try to connect to the Data Services Access server before failure.

Retry attempts The number of times to retry connection to the Data Services Access server after failure.

Delay between attempts The number of seconds in between retry attempts.

42 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 43: 1200 Dqinfa Ug En

Data Quality Management for Informatica transformation/mapping issues

The sample mappings are failing in the Workflow Manager

If you try to run the sample mappings through the Workflow Manager but they fail to complete, there are a few things to check.

Did you install in the default location?

If you installed the Data Quality Management for Informatica plug-in in a directory other than the default, you will have to edit some of the session properties and transformation properties. Edit the following items so that they point to the location where you installed the Data Quality Management for Informatica plug-in:

Location of the source in the Source Options.

Location of the target in the Target Options.

Follow these steps to get to these properties:

1. In the Workflow Manager, Double-click on the workflow in the Workflow Designer tool. The Edit Tasks window opens.

2. Select the Mapping tab.

3. Open the Source or Target folders to edit the locations.

Do the Data Quality Management for Informatica plug-in’s files and directories have appropriate permissions?

Double-check to be sure that you have properly set the Data Quality Management for Informatica plug-in files’ permissions for the person who is running the session. The directory where the sample job files are located needs write access for the session owner, for example.

Is the Java class path set correctly?

Setting the Java class path is a new step of installation with this version of Data Quality Management for Informatica. To ensure that you have set the Java class path correctly, see “Step 6: Specify the Java class path” on page 23.

Data isn’t being processed when sent through a Data Quality Management for Informatica transformation

If your mappings are running, but data isn’t being cleansed or consolidated, Data Quality Management for Informatica may not be properly installed.

If you have other transformations that are working properly, you know that Data Quality Management for Informatica is installed properly. Try running one of the mappings in the sample mapping folder if you’re unsure. See Appendix A: “Sample mappings” on page 44 for more information about running a mapping. If the samples don’t work, verify that you’ve performed all of the installation procedures correctly as documented in Chapter 2: “Installation” on page 9.

Process terminating on signal/exception

You might receive this error if you improperly edit some of the properties in a custom transformation. To fix this error, delete the custom transformation from your mapping and add a new custom transformation. Do not modify the Procedure Name property or the ProcIndex properties from their initial values.

Chapter 4: Troubleshoot Data Quality Management for Informatica 43

Page 44: 1200 Dqinfa Ug En

Sample active batch match workflow (wf_ABM) fails

If the active batch match workflow (wf_ABM) fails upon exectution, follow these steps:

1. Open the wf_ABM workflow.

2. Go to the Task Developer and edit the s_ABM_SRC_TO_STG session.

3. Go to Mapping > Targets and specify the Relational Connection. This setting should be same as in the Data Services configuration.

For example, if you are using Oracle as the Data Services configuration and DQM_Win32 as the user, then this relational connection should be mapped with this information. Make sure that you have created this staging table for this Configuration. You can execute the batch job Job_DQ_Informatica_CreateStageTables imported from this product. This batch job will create both the input and output staging tables for that configuration.

4. Click OK and save.

5. Edit s_ABM_PROCESS session.

6. Click on Mapping > Transformations >TR_ABM_STG_ TO_TGT_SQL and specify the Relational Connection. This connection should be same as in the Data Services Configuration.

44 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 45: 1200 Dqinfa Ug En

Glossary

active transformationA transformation in which Informatica PowerCenter can export records in succession without waiting for a return record.

custom transformationAn Informatica PowerCenter mapping element, which receives every row of data through input ports, performs some operation on that data set, and returns a set of output rows through output ports. These are implemented with two types of transformations, active and passive.

Data ServicesData Services is the Business Objects technology that performs data quality operations on the back end of Data Quality Management for Informatica.

dictionariesFiles that contain a lexicon of words and phrases that BusinessObjects Data Quality XI for Informatica uses to identify, parse, and standardize address data. These files have a .dct extension.

directoriesFiles of national address data compiled by the USPS and used by Data Quality to match, assign, standardize, and verify addresses. These files have a .dir extension.

discrete fieldsInput or output data that has separate fields for each piece of information, such as addresses and names.

metadata extensionsKey/value pairs defined within a transformation. These provide flexibility, allowing you to store different data types of information including a blob of information.

multiline fieldsInput or output data that has certain address and name data combined in one field.

passive transformationA transformation that outputs one row for every input row.

transformationA repository object that generates, modifies, or passes data. BusinessObjects Data Quality XI for Informatica provides both active and passive custom transformations for Informatica PowerCenter.

Glossary 45

Page 46: 1200 Dqinfa Ug En

46 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide

Page 47: 1200 Dqinfa Ug En

Index

Aactive batch match transformation, 35active data cleanse transformation, 34active transformation, 45active transformational matchtransformation, 34address cleansing

international, 8audience, 5

BBusiness Objects (Firstlogic) software

stand-alone products, 28

CC++ runtime, 27Create Transformation window, 33, 34, 35, 36custom transformations

definition, 45overview, 32

Ddata cleansing, 8Data Quality Management for Iformatica

using with Business Objects (Firstlogic) stand-alone prod-ucts, 28

Data Quality Management for Informaticainstalling, 9transformations, 32troubleshoot, 39

Data Quality Management for Informatica transformation tool-bar, 33Data Quality Management for Informatica transformations, 32Data Services

installation, 13dataflow

Data Servicessample, 13

datatypessupported, 6

defaultinstallation location, 43

Delivery Point Validation, See DPVdictionaries, 45directories

definition, 45discrete fields

definition, 45DPV, 14DQM Properties tab, 20

EETL, 8

extract, transform, and load, See ETL

Ffailed sample mapping, 43

Iinstallation

Data Services, 13UNIX, 26

international address cleansingtransformation, 8

LLACSLink, 14log file

tracing level, 40

Mmappings

troubleshoot, 43matching function, 8metadata extensions, 45multiline fields, 45

Nname cleansing, 8

transformations, 8non-reusable vs reusable

transformations, 32

Ooverview, 7

Data Quality XI for Informatica, 8

Ppassive data cleanse transformations, 33passive transformation, 45Preface, 5Properties tab, 40

Rreusable vs non-reusable

transformations, 32

Ssample mappings

failed, 43stand-alone Business Objects (Firstlogic) products, 28string datatype, 6

Index 47

Page 48: 1200 Dqinfa Ug En

Ttoolbar, 33tracing level, 40

property, 40transformation, 45

active batch match, 35active data cleanse, 34active transformational match, 34create, 32creating and using, 31name cleansing, 8overview, 32passive data cleanse, 33reusable vs non-reusable, 32toolbar, 33troubleshoot, 43

transformsURAC, 14

troubleshoot, 39

UUNIX

setting up user profiles, 28user profiles, 28

Vversions, 6

WWindows

setting up user profiles, 29

48 SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter User’s Guide