11 Informatica Data Virtualization The “Foundation” for AGILITY & PRODUCTIVITY Kerry Holton Informatica Senior Sales Engineer

Embed Size (px)

Citation preview

  • Slide 1
  • 11 Informatica Data Virtualization The Foundation for AGILITY & PRODUCTIVITY Kerry Holton Informatica Senior Sales Engineer
  • Slide 2
  • 2 Informatica Corporation Confidential Do Not Distribute 2 H H Take some good notes ! A copy of Lean Integration. Tell me which box is the ONLY thing that data virtualization built on data federation does and why??? Answer questions along the way Lets Win Something!!!
  • Slide 3
  • 3 Informatica Corporation Confidential Do Not Distribute 3 Sign-Up Expert Roundtables Data Virtualization Corner http://vip.informatica.com/?elqPURLPage=8668 Sign-Up Expert Roundtables Data Virtualization Corner http://vip.informatica.com/?elqPURLPage=8668 To Learn More JOIN & DISCUSS 2000+ Strong Data Virtualization & Data Services Architecture Group Informatica.com > Products > PowerCenter > Data Virtualization Edition Informatica.com > Products > Data Virtualization
  • Slide 4
  • 4 Informatica Corporation Confidential Do Not Distribute 4 Agenda 2012 The Year of BI Agility Data Virtualization Overview, Problem & Need Key Use Cases Customer Examples Data Virtualization in Action Why Informatica? Next Steps & Q&A
  • Slide 5
  • 5 Informatica Corporation Confidential Do Not Distribute 5 ICC Director (VP of IM) to Dave Lyle (VP Product Strategy), end of Q3, 2009 getting the data out! Im writing you a million dollar check, but youre not solving my big problem. My big problem isnt getting the data into the data warehouse. My big problem is
  • Slide 6
  • 6 Informatica Corporation Confidential Do Not Distribute 6 2012 Our world will be turned upside down BI will be the top priority for the CIO, in 2012! Demands by users of business intelligence (BI) applications to "just get it done" are turning typical BI relationships, such as business/IT alignment and the roles that traditional and next- generation BI technologies play, upside down. As business users demand more control over BI applications, IT is losing its once-exclusive control over BI platforms, tools, and applications. Boris Evelson, Forrester Research, Blog - Top 10 BI Predictions for 2012 Business / BI IT Have any of you had this discussion ? Need for a new BI infrastructure Replacing spreadsheets Faster data access & reporting Have any of you had this discussion ? Need for a new BI infrastructure Replacing spreadsheets Faster data access & reporting Business-focused BI $100M Qtr. in 2011 10k+ customers
  • Slide 7
  • 7 Informatica Corporation Confidential Do Not Distribute 7 How Long Does it Take to Deliver New Critical Data or Reports to the Business?
  • Slide 8
  • 8 Informatica Corporation Confidential Do Not Distribute 8 The Business Cant Wait 3-6 Months For a Single View of All Enterprise Data Applications Partner Data SWIFTNACHAHIPAA UnstructuredDatabasesSocialWarehousesNoSQL Cloud Computing SOA ESB/EAI ETL EIIHand Coding Business Intelligence
  • Slide 9
  • 9 Informatica Corporation Confidential Do Not Distribute 9 Overview
  • Slide 10
  • NO REUSE NO REUSE 16 Types of Data Sources Different Price Info in Each LOB To Add 1 Product Attribute to Existing Report IT Estimated 1700 Hours Product Config Mgmt (MS SQL Server) Facets [Benefits, Products] (Sybase ASE) Data Warehouse (DB2) 30,000 Data Marts (MS Access) BI (Cognos) Portal (WebSphere) BusinessIT HealthNows Data Integration Challenges 30,000 Data Marts Were Created by Shadow IT Teams So What Did the Business Do?
  • Slide 11
  • 11 Informatica Corporation Confidential Do Not Distribute 11 The Fundamental Problem(s) It takes too long to explain requirements It takes months to change a DW / add new critical data It takes many iterations to get the right data / reports Changes can break existing integrations & impact apps. 1.Design 2.Change 3.Integrate 4.Unit Test 5.Validate 6.Deploy Typical Data Integration Process Business is Involved Too Late As-Is Value Stream Map (LOT OF WAIT & WASTE)
  • Slide 12
  • 12 Informatica Corporation Confidential Do Not Distribute 12 Applications Unstructured DataSpread Marts DATA MART EDW Trying to Solve it in BI Layer Just Wont ScaleWhy? No Reuse No Common Data Access Layer No Easy Way to Handle Change No Data Quality & No Data Consistency
  • Slide 13
  • 13 Informatica Corporation Confidential Do Not Distribute 13 PortalBI Composite Apps Enterprise Data Sources Data Abstraction Logical Data Objects PRODUCT CUSTOMER ORDER Data Consumers Logical View of All Underlying Data What is Needed to Solve these Problems? Think Virtual Machines for DATA! SUPPORT ALL USE CASES BI / DWMDMSOA FAST, DIRECT ACCESS TO DATA THE BUSINESS TRUSTS FAST, DIRECT ACCESS TO DATA THE BUSINESS TRUSTS DATA ABSTRACTION & REUSE OF SKILLS/LOGIC DATA ABSTRACTION & REUSE OF SKILLS/LOGIC COMMON ACCESS LAYER ACROSS MANY DATA SOURCES COMMON ACCESS LAYER ACROSS MANY DATA SOURCES
  • Slide 14
  • 14 Informatica Corporation Confidential Do Not Distribute 14 How is the Market Trying to Address the Problems? Cannot Easily Move to Persistent Store or Reuse DW BI Virtual View Access Merge Deliver Data Virtualization (Built-On Data Federation) Limited or Data Source Profiling Only X SQL/XQuery Only Transformations & No Data Quality DW X X Addresses specific use cases No data movement / no copies / only federation Code heavy / not model-based / no reuse Not tools for business self-service SQL/XQuery-only transformations No data profiling / no data quality Its like ONE step forward & TWO steps backward Time GAINED by federation is nullified by Time SPENT on more processing
  • Slide 15
  • 15 Informatica Corporation Confidential Do Not Distribute 15 What Are the Top 3 Key Capabilities for a Project that Needs Data Virtualization? Source Informatica Data Virtualization Experts Forum,2011 Dataset - 600 If Performance is a given
  • Slide 16
  • 16 Informatica Corporation Confidential Do Not Distribute 16 Are We Talking About TWO Separate Tools?
  • Slide 17
  • 17 Informatica Corporation Confidential Do Not Distribute 17 BusinessIT TRANSFORM IN RT Advanced Transformations, Data Quality, Data Masking 4 4 Virtual Table CRM Accounts ACCESS & MERGE 2 2 Virtual Table PROFILE IN RT Business Manager Analyst, Steward Developer, Architect Common Metadata 3 3 Virtual Table MODEL Customer Name Address Category Orders 1 1 Virtual Table CRM SCALE & PERFORM Accounts 7 7 Optimizations & Caching Optimizations & Caching Virtual Table MOVE OR FEDERATE AccountsCall Center DW 6 6 Virtual Table REUSE INSTANTLY Batch Web Services 5 5 Query Engine WS Server Virtual Table What Does the Ideal Solution Look Like?
  • Slide 18
  • 18 Informatica Corporation Confidential Do Not Distribute 18 How Does Informatica Deliver the Ideal Solution? Single environment for both data integration and data federation No data movement / no copies but easily reuse virtual views for batch Early & iterative business (analyst) involvement self-service Pre-built library of rich ETL-like advanced data transformations Integrated real-time, on-the-fly data profiling & data quality DW BI Virtual View Access Merge Deliver DW Prototype First Move to DW or Instantly Reuse as SQL / WS Advanced Transformations & Data Quality Analyze & Profile Data & Logic Anytime Early Business Involvement Data Virtualization = (Data Integration + Data Federation) in ONE Tool
  • Slide 19
  • 19 Informatica Corporation Confidential Do Not Distribute 19 DM WEB How Does It Work? DM Cust DW DM ODS DW PRODUCTINVOICECUSTOMERSUPPORT SELECT * FROM customer_table INNER JOIN support_table ON customer_table.customer_num = support_table.customer_id WHERE customer_name=ACME NEW QUERY SELECT * FROM customer_table Retrieve historical customer datatxt New query for report needing data not in DW Query is processed by virtualization layer Results retrieved in real-time without data movement Data quality rules applied on-the- fly against data Trusted blend of historical and operational data delivered On-boarding new data does not break integrations Virtual view can be physically materialized later into DW Complement data architecture with virtualization CUSTOMER SELECT * FROM SUPPORT EXISTING QUERY NEW REQUEST Change / add an attribute Join new data not in DW Create a new report NEW REQUEST Change / add an attribute Join new data not in DW Create a new report NEW DATA & REPORTS THAT BUSINESS NEEDS & TRUSTS, DELIVERED IN DAYS vs. MONTHS NEW DATA & REPORTS THAT BUSINESS NEEDS & TRUSTS, DELIVERED IN DAYS vs. MONTHS INSTANT REUSE
  • Slide 20
  • NO REUSE NO REUSE Product Config Mgmt (MS SQL Server) Facets [Benefits, Products] (Sybase ASE) Data Warehouse (DB2) 30,000 Data Marts (MS Access) BI (Cognos) Portal (WebSphere) BusinessIT Instant Reuse DW, BI, SOA & MDM (SQL, Web Services, Batch) Informatica Data Virtualization at HealthNow PRODUCT ORDERMEMBERCLAIM Virtual Table Common Data Model Fast, Direct Data Delivery 1 week (vs. 3 months) Shared Repository Shared Repository
  • Slide 21
  • 21 Informatica Corporation Confidential Do Not Distribute 21 What Does Informaticas Data Virtualization Solution Look Like? PowerCenter Data Virtualization Edition PowerCenter Data Virtualization Edition Data Federation (Data Services) Data Federation (Data Services) Developer Tool Analyst Tool Data Profiling ETL (PC Standard Edition) Partitioning NEW 2 Adapters (PWX for Relational) 2 Adapters (PWX for Relational) New PowerCenter Edition for AGILITY & PRODUCTIVITY Combines: Data integration (PowerCenter SE) Data Virtualization (IDS Full Use) Data Profiling (IDE Full Use) Business-IT Collaboration (Analyst) Packaged for simplicity and attractively priced Reuses existing skills and resources
  • Slide 22
  • 22 Informatica Corporation Confidential Do Not Distribute 22 What Use Cases Are Supported? Weeks/Days Change Request Deploy to Production BusinessIT DW/Business Intelligence (BI) Prototype DW & accelerate new data & reports from months to days 1 1 MDM Deliver a complete view of master & transactional data in real-time MDM Deliver a complete view of master & transactional data in real-time 2 2 Months SOA Deliver the missing data services layer to SOA & applications SOA Deliver the missing data services layer to SOA & applications 3 3 INCOMPLETE VIEW OF CUSTOMER MDM HUB TRANSACTIONAL SYSTEMS DATA WAREHOUSE DATA WAREHOUSE Virtual View COMPLETE VIEW OF CUSTOMER Applications Data Sources Registry ESB BPM Biz. Services Data Abstraction
  • Slide 23
  • 23 Informatica Corporation Confidential Do Not Distribute 23 What are the Benefits of Informaticas Solution? Provide fast, direct access to critical new data & reports in days vs. months Enable rapid iterations to results with instant Biz-IT collaboration Deliver flexibility, ensure reuse & insulate applications from changes COMPLETE, CURRENT & TRUSTED View of All Data, On-Demand
  • Slide 24
  • 24 Customer Examples
  • Slide 25
  • 25 BI, MDM, SOA HealthNow NY Improves Risk & Pricing Analysis With Data Services 16 enterprise databases and over 30,000 Access databases Took 1700 man hours to add a new product to portfolio Business had to go to 5 different sources for all information related to paid claims Continued data growth with over 30,000 claims processed per day Data proliferation leading to HIPAA compliance concerns Logical data models and data services to represent their core data entities MEMBER, CLAIMS,PROVIDER, ENCOUNTER, LAB RESULTS Rate Letter project for determination of policy rates and discounts went live in May 2010 Over 400 Logical data objects and 2 web services being used by around 125 end users Speed of data delivery Implemented first project in around 40 man hours. This would have taken an order of magnitude more in the past Complete view of the truth - Business users now access plan rate information from single service Better governance Centrally managed virtual views as opposed to one-off data marts is improving governance of data The Challenge The Solution The Benefits BI (Cognos) IDS Virtual Table Product Config Mgmt (MS SQL Server) Facets [Benefits, Products] (Sybase ASE) Data Warehouse (DB2) SQL, Web Service Data Marts (MS Access) Portal (WebSphere)
  • Slide 26
  • 26 Lack of visibility for proper supervision and regulation of the national financial system Real-time analysis and joining of data (Adabas, DB2, SQLServer, Files) Persistent data replication even for one-time use Huge data volumes (Online 6TB, DW 14 TB) Different reporting tools requesting different data combinations across heterogeneous data sources Logical data models to represent core business entities (e.g. CUSTOMER) Mainframe virtualization (join data from Adabas, DW DB2, Apps., 3rd Party ) Logical data models and Web services to deliver flexibility and agility to respond to changing business needs Creation of logical data objects and physical materialization of virtual views to familiar PowerCenter environment Speed of data delivery implemented first project in around 60 man hours and delivered a new virtual view in < 1hour Better risk/fraud governance (across more than 6000 financial institutions) and compliance with BASEL I, BASELII and SOX Complete single view of the truth - business users can now access consistent customer and plan rate data Centralized management and administration of logical data objects The Challenge The Solution The Benefits Microsoft Reporting Services Data Virtualization Virtual Table Financial Institutions (Flat Files and Messages) Credit Analysis, Applications, AML (SQL Server) Data Warehouse (DB2 LUW) SQL, Web Service Transactions Tables (Mainframe Adabas, DB2) Customized Applications BI, SOA - Large Latin American Bank Improves Governance
  • Slide 27
  • 27 BI, MDM VW Leverages Delivers a Complete View of Critical Data On-Demand CUSTOMER data in > 30 systems, MDM hub, transaction systems, DW Have 80% data but missing critical 20% transactions - WARRANTY, SERVICE No authoritative source of CUSTOMER, PRODUCT data, conflicting relationships No complete view of CUSTOMER data on-demand is affecting service Without complete view of data, cant meet goal to sell 3x more cars by 2018 Create a common data model for VW owners, prospects, & partners Federate data in real-time from > 30 systems & transactional systems Provide easy-to-use, browser-based tools for business & IT to collaborate Apply reusable DQ rules on-the-fly to CUSTOMER, PRODUCT data Instantly reuse data services for SQL or Web services Completed DI, DQ, & data services production pilot in
  • Slide 28
  • 28 Informatica Corporation Confidential Do Not Distribute 28 Data Virtualization in Action
  • Slide 29
  • 29 The Keystone Business Owns the Data While IT Retains Control BI Report Analyst Tool (Web Browser) Developer Tool (Eclipse) SQL or Web Service Data Warehouse Batch ETL Role-based tools for Analysts (Web) & IT developers (eclipse) Common metadata lets Analysts & IT collaborate in RT Empower business analysts to: Define entities & directly access & merge data to create virtual views Rapidly profile data sources & logic without more processing Quickly find data & rules via business glossary Collaborate, test, validate & share results Cuts the wait & the waste in the process Common Metadata VIRTUAL TABLE Portal SQL or Web Service
  • Slide 30
  • 30 BusinessIT TRANSFORM IN RT Advanced Transformations, Data Quality, Data Masking 4 4 Virtual Table CRM Accounts ACCESS & MERGE 2 2 Virtual Table PROFILE IN RT Business Manager Analyst, Steward Developer, Architect Common Metadata 3 3 Virtual Table MODEL Customer Name Address Category Orders 1 1 Virtual Table CRM SCALE & PERFORM Accounts 7 7 Optimizations & Caching Optimizations & Caching Virtual Table MOVE OR FEDERATE AccountsCall Center DW 6 6 Virtual Table REUSE INSTANTLY Batch Web Services 5 5 Query Engine WS Server Virtual Table The 7 Steps to AGILITY & PRODUCTIVITY
  • Slide 31
  • 31 1. Model Represent underlying data as business entities (CUSTOMER) Provide a common logical view or abstraction of all data Import logical model from 200+ modeling tools (ERWIN) Use visual and metadata based mapping language Instantly reuse logical data object for all applications Unstructured Data Applications Spread Marts EDW Common Data Access Layer Logical Data Object Common Data Access Layer Logical Data Object PRODUCTINVOICECUSTOMERORDER Data marts
  • Slide 32
  • 32 SocialWarehousesNoSQL 2. Access and Merge ApplicationPartner Data SWIFTNACHAHIPAA Cloud ComputingUnstructuredDatabase Analytical Data Interactional Data Transactional Data Archived Data Master Data PRODUCTINVOICECUSTOMERSUPPORT Turn many data sources into ONE with Data Virtualization
  • Slide 33
  • 33 3. Profile in RT Rich set of integrated profiling capability to find data anomalies and to discover keys and hidden relationships: Column & Rule Profiling Midstream or Comparative Profiling Join & Overlap Analysis Primary Key / Foreign Key Profiling Dependency Profiling
  • Slide 34
  • 34 4. Transform in RT Metadata-driven, codeless, graphical environment Rich, pre-built library of advanced transformation Integrated Data Quality transformations Define policies to mask sensitive data in real time
  • Slide 35
  • 35 5. Reuse Instantly Instantly reuse LDOs for any mode/protocol (SQL, WS) Single click deployment to batch Execution & optimization separate from design-time No re-development & re- building of LDOs
  • Slide 36
  • 36 6. Move or Federate BI DW Extract Advanced Transform & Quality Load Data Integration DW BI Virtual View Access Merge Deliver Data Federation DW Single-click deployment to PowerCenter (batch) Specific use cases No data movement / no copies Real-time federation SQL/XQuery-only transformations No data quality / business validation Majority of use cases Physical data movement Bulk/batch, near real-time, real-time Advanced transformations Built-in data quality
  • Slide 37
  • 37 Leverage the proven, high- performance Informatica engine Optimized SQL Query engine & graphical Query Plan High-performance Web services server Rich set of optimizations & caching mechanisms Rule Based, Cost Based, Push Down, Early Projection, Early Selection, Semi- Join, Virtual Table & Result Set Caching Fine grained access control, WS- Security & pass-through security Database, Schema, Table, Column, Row-Level (v9.5) security 7. Scale & Perform
  • Slide 38
  • 38 BusinessIT TRANSFORM IN RT Advanced Transformations, Data Quality, Data Masking 4 4 Virtual Table CRM Accounts ACCESS & MERGE 2 2 Virtual Table PROFILE IN RT Business Manager Analyst, Steward Developer, Architect Common Metadata 3 3 Virtual Table MODEL Customer Name Address Category Orders 1 1 Virtual Table CRM SCALE & PERFORM Accounts 7 7 Optimizations & Caching Optimizations & Caching Virtual Table MOVE OR FEDERATE AccountsCall Center DW 6 6 Virtual Table REUSE INSTANTLY Batch Web Services 5 5 Query Engine WS Server Virtual Table Data Virtualization Built On Data Federation Does 1 Box Which 1?
  • Slide 39
  • 39 Do it Right Avoid Costly Mistakes! 1000s of lines of code Business Rules SQL Web Services TIMECOST Maintenance Nightmare Model & metadata- driven environment TIMECOST Sustain & Maintain Sustain & Maintain Enabling Rapid Development v/s Profile data AND logic anywhere TIMECOSTRISK Get it Right 1 st Time Only source profiling, need extra processing Many Iterations & Mistakes TIMECOSTRISK Analyzing & Profiling v/s Hand-coding cant do advanced transforms TIMECOSTRISK SQL XQuery Simple Cleansing Web Service Limited Rules, No Data Quality Leverage pre-built logic including quality TIMECOSTRISK Virtual Table Bake-in Quality Integrating with Quality v/s Naturally extend your infrastructure TIMECOST Re-purpose Logic & Skills TIMECOST Re-work, re-deploy & re-train every time Re-invent the Wheel Leveraging Investments v/s Scaling with Flexibility v/s Virtualize or physically materialize in 1 tool TIMECOST Prototype First & Then Scale EII Optimizations TIMECOST Overburden Data Virtualization EII X RISK Non-integrated technologies
  • Slide 40
  • 40 Data Virtualization in Action
  • Slide 41
  • 41 Scenario Big Company ISSUES Call center talk times increasing = scattered data + many screens Time wasted in correcting inconsistent & inaccurate customer data Agents cant easily & quickly identify what products are owned IMPACT Cant easily identify top customers to improve up-sell/cross-sell Low customer satisfaction & growing customer attrition High marketing costs without targeted campaigns
  • Slide 42
  • 42 Demo Big Company Business needs a new report NOW vs. months! Quickly merge data from multiple systems & cleanse Analysts know the data want some self-service Join CUSTOMER (Oracle CRM) & ORDER (file) Get ORDER TOTAL for ACTIVE customers AnalystIT Architect / Developer Analyst defines business entity, profiles, defines rules & hands over to IT IT enriches the business entity & publishes for BI tool, portal or batch Integrate missing data, do data cleansing on-the- fly, validate
  • Slide 43
  • 43 Informatica Corporation Confidential Do Not Distribute 43 Why Informatica?
  • Slide 44
  • 44 Informatica Corporation Confidential Do Not Distribute 44 Gartner Magic Quadrant for Data Integration Tools, 2011 The ability to switch seamlessly and transparently between delivery modes (bulk / batch vs. granular real-time vs. federation) with minimal rework will be key for IT organizations seeking to develop a successful data integration strategy. Ted Friedman, VP Distinguished Analyst, Gartner The ability to switch seamlessly and transparently between delivery modes (bulk / batch vs. granular real-time vs. federation) with minimal rework will be key for IT organizations seeking to develop a successful data integration strategy. Ted Friedman, VP Distinguished Analyst, Gartner Why Informatica? With v9, Informatica advanced its capabilities with on-the-fly data quality and profiling, a model-driven approach to provisioning data services, performance enhancements, cloud integration, common metadata, and role-specific tools. The Forrester Wave: Data Virtualization, Q1 2012 With v9, Informatica advanced its capabilities with on-the-fly data quality and profiling, a model-driven approach to provisioning data services, performance enhancements, cloud integration, common metadata, and role-specific tools. The Forrester Wave: Data Virtualization, Q1 2012 Forrester Wave: Data Virtualization, Q1 12 2009-10 Power of The Platform THE BEST OF DATA INTEGRATION (SOPHISTICATION) THE BEST OF DATA INTEGRATION (SOPHISTICATION) THE BEST OF DATA VIRTUALIZATION (AGILITY) THE BEST OF DATA VIRTUALIZATION (AGILITY) ONLY INFORMATICA COMBINES INTO ONE SOLUTION THAT REUSES SKILLS INTO ONE SOLUTION THAT REUSES SKILLS
  • Slide 45
  • 45 Informatica Corporation Confidential Do Not Distribute 45 Only Informatica Provides ONE Solution for Data Integration and Federation DW BI Virtual View Access Transform Deliver DW Single environment for both data integration and data federation No data movement / no copies but can easily reuse virtual views for batch Early & iterative business (analyst) involvement, efficient collaboration Pre-built library of rich ETL-like advanced data transformations Integrated real-time, on-the-fly data profiling & data quality Prototype First Move to DW or Instantly Reuse as SQL/WS Advanced Transformations & Data Quality Analyze & Profile Data & Logic Anytime Early Business Involvement
  • Slide 46
  • 46 Informatica Corporation Confidential Do Not Distribute 46 Next Steps & Q&A
  • Slide 47
  • 47 Informatica Corporation Confidential Do Not Distribute 47 Have the Conversation with the Business! BusinessIT 1.Identify a Critical Project in Your Company 2.Involve the Business Early & Often 3.Bake-In Quality & Support Advanced Logic 4.Demonstrate Business Value Early 5.Self-Service + Data Virtualization = ROI 1.Identify a Critical Project in Your Company 2.Involve the Business Early & Often 3.Bake-In Quality & Support Advanced Logic 4.Demonstrate Business Value Early 5.Self-Service + Data Virtualization = ROI New data & reports take too long YOU can now do it in DAYS!
  • Slide 48
  • 48 Informatica Corporation Confidential Do Not Distribute 48 Sign-Up Expert Roundtables Data Virtualization Corner http://vip.informatica.com/?elqPURLPage=8668 Sign-Up Expert Roundtables Data Virtualization Corner http://vip.informatica.com/?elqPURLPage=8668 Next Steps & Q&A JOIN & DISCUSS 2000+ Strong Data Virtualization & Data Services Architecture Group Informatica.com > Products > PowerCenter > Data Virtualization Edition Informatica.com > Products > Data Virtualization
  • Slide 49
  • 49 Informatica Corporation Confidential Do Not Distribute 49