32
Module 1: Introduction to Data Warehousing and OLAP

Module 1: Introduction to Data Warehousing and OLAP

Embed Size (px)

Citation preview

Page 1: Module 1: Introduction to Data Warehousing and OLAP

Module 1: Introduction to Data Warehousing

and OLAP

Page 2: Module 1: Introduction to Data Warehousing and OLAP

Overview

Introducing Data Warehousing

Defining OLAP Solutions

Understanding Data Warehouse Design

Understanding OLAP Models

Applying OLAP Cubes

Page 3: Module 1: Introduction to Data Warehousing and OLAP

Introducing Data Warehousing

Raw Data vs. Business Information

OLTP Source Systems

Data Warehouse Characteristics

Data Warehouse System Components

Page 4: Module 1: Introduction to Data Warehousing and OLAP

Raw Data vs. Business Information

Capturing Raw Data

Gathering data recorded in everyday operations

Deriving Business Information

Deriving meaningful information from raw data

Turning Data into Information

Implementing a decision support system

Page 5: Module 1: Introduction to Data Warehousing and OLAP

OLTP Source Systems

OLTP System Characteristics

Processes real-time transactions of a business

Contains data structures optimized for entries and edits

Provides limited decision support capabilities

OLTP Examples

Order tracking

Customer service

Point-of-sales

Service-based sales Banking functions

Page 6: Module 1: Introduction to Data Warehousing and OLAP

Data Warehouse Characteristics

Provides Data for Business Analysis Processes

Integrates Data from Heterogeneous Source Systems

Combines Validated Source Data

Organizes Data into Non-Volatile, Subject-Specific Groups

Stores Data in Structures that Are Optimized for Extraction and Querying

Page 7: Module 1: Introduction to Data Warehousing and OLAP

Data Warehouse System Components

Data Warehouse

Data Access

User Data AccessData

Sources

Data Input

StagingArea

Data Marts

Page 8: Module 1: Introduction to Data Warehousing and OLAP

Defining OLAP Solutions

OLAP Databases

Common OLAP Applications

Relational Data Marts and OLAP Cubes

OLAP in SQL Server 2000

Page 9: Module 1: Introduction to Data Warehousing and OLAP

OLAP Databases

Optimized Schema for Fast User Queries

Robust Calculation Engine for Numeric Analysis

Conceptual, Intuitive Data Model

Multidimensional View of Data

Drill down and drill up

Pivot views of data

Page 10: Module 1: Introduction to Data Warehousing and OLAP

Common OLAP Applications

Executive Information Systems

Performance measures Exception reporting

Sales/Marketing Applications

Booking/billing Product analysis Customer analysis

Financial Applications

Reporting Planning Analysis

Operations Applications

Manufacturing Customer service Product cost

Page 11: Module 1: Introduction to Data Warehousing and OLAP

Relational Data Marts and OLAP Cubes

Relational Relational Data MartData Mart

Relational Relational Data MartData Mart OLAP CubeOLAP CubeOLAP CubeOLAP Cube

Data StorageData Storage Relational Data Structure

Relational Data Structure

N-dimensional Data structure

N-dimensional Data structure

Data ContentData Content Detailed and Summarized Data

Detailed and Summarized Data Summarized DataSummarized Data

Data SourcesData Sources Relational and Non-relational Sources

Relational and Non-relational Sources

Relational and Non-relational Sources

Relational and Non-relational Sources

Data RetrievalData Retrieval Fast Performance for Data Extract Queries

Fast Performance for Data Extract Queries

Faster Performance for Data Extract Queries

Faster Performance for Data Extract Queries

Page 12: Module 1: Introduction to Data Warehousing and OLAP

OLAP in SQL Server 2000

Microsoft Is One of Several OLAP Vendors

Analysis Services Is Bundled with Microsoft SQL Server 2000

Analysis Services Include

OLAP engine

Data mining technology

Page 13: Module 1: Introduction to Data Warehousing and OLAP

Understanding Data Warehouse Design

The Star Schema

Fact Table Components

Dimension Table Characteristics

The Snowflake Schema

Page 14: Module 1: Introduction to Data Warehousing and OLAP

The Star Schema

Fact TableDimension Table

Employee_DimEmployee_DimEmployee_DimEmployee_DimEmployeeKeyEmployeeKey

EmployeeID...EmployeeID...

Time_DimTime_DimTime_DimTime_DimTimeKeyTimeKey

TheDate...TheDate...

Product_DimProduct_DimProduct_DimProduct_DimProductKeyProductKey

ProductID...ProductID...

Customer_DimCustomer_DimCustomer_DimCustomer_DimCustomerKeyCustomerKey

CustomerID...CustomerID...

Shipper_DimShipper_DimShipper_DimShipper_DimShipperKeyShipperKey

ShipperID...ShipperID...

Sales_FactSales_FactTimeKeyEmployeeKeyProductKeyCustomerKeyShipperKey

TimeKeyEmployeeKeyProductKeyCustomerKeyShipperKey

Sales AmountUnit Sales ...Sales AmountUnit Sales ...

Page 15: Module 1: Introduction to Data Warehousing and OLAP

Fact Table Components

DimensionTables

DimensionTables

customer_dimcustomer_dimcustomer_dimcustomer_dim

201 ALFI Alfreds201 ALFI Alfreds

product_dimproduct_dimproduct_dimproduct_dim 25 123 Chai 25 123 Chai

sales_fact Table

MeasuresMeasures

customer_keycustomer_key product_keyproduct_key time_keytime_key quantity_salesquantity_sales amount_salesamount_sales

Foreign KeysForeign Keys

201 25 134 400 10,789

The grain of the sales_fact table is defined by the lowest level of detail stored in each dimension

134 1/1/2000134 1/1/2000

time_dimtime_dimtime_dimtime_dim

Page 16: Module 1: Introduction to Data Warehousing and OLAP

Dimension Table Characteristics

Describes Business Entities

Contains Attributes That Provide Context to Numeric Data

Presents Data Organized into Hierarchies

Page 17: Module 1: Introduction to Data Warehousing and OLAP

The Snowflake Schema

Defines Hierarchies by Using Multiple Dimension Tables

Is More Normalized than a Single Table Dimension

Is Supported within Analysis Services

Page 18: Module 1: Introduction to Data Warehousing and OLAP

Understanding OLAP Models

OLAP Database Components

OLAP Dimensions vs. Relational Dimensions

Dimension Fundamentals

Dimension Family Relationships

Cube Measures

Relational Data Sources

Page 19: Module 1: Introduction to Data Warehousing and OLAP

OLAP Database Components

Numeric Measures

Dimensions

Cubes

Page 20: Module 1: Introduction to Data Warehousing and OLAP

REGIONWest CA OREast MA NY

REGIONWest CA OREast MA NY

REGIONWestEast

STATEREGIONCA WestOR WestMA EastNY East

REGIONWestEast

STATEREGIONCA WestOR WestMA EastNY East

OLAP Dimensions vs. Relational Dimensions

OLAP Relational

Page 21: Module 1: Introduction to Data Warehousing and OLAP

Dimension Fundamentals

Page 22: Module 1: Introduction to Data Warehousing and OLAP

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

Dimension Family Relationships

USA is the parent of North West and South West

USA is the parent of North West and South West

North West and South West are children of USA

North West and South West are children of USA

North West and South West are siblings

North West and South West are siblings

North West and California are descendants of USA

North West and California are descendants of USA

North West and USA are ancestors of Washington

North West and USA are ancestors of Washington

Oregon and California are cousins Oregon and California are cousins All are dimension members All are dimension members

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

USANorth West Oregon WashingtonSouth West California

Page 23: Module 1: Introduction to Data Warehousing and OLAP

Cube Measures

Are the Numeric Values of Principle Interest

Correspond to Fact Table Facts

Intersect All Dimensions at All Levels

Are Aggregated at All Levels of Detail

Form a Dimension

Page 24: Module 1: Introduction to Data Warehousing and OLAP

Relational Data Sources

Star and Snowflake Schemas

Are required to build a cube with Analysis Services

Fact Table

Contains measures

Contains keys that join to dimension tables

Dimension Tables

Must exist in same database as fact table

Contain primary keys that identify each member

Page 25: Module 1: Introduction to Data Warehousing and OLAP

Applying OLAP Cubes

Defining a Cube

Querying a Cube

Defining a Cube Slice

Working with Dimensions and Hierarchies

Visualizing Cube Dimensions

Connecting to an OLAP Cube

Page 26: Module 1: Introduction to Data Warehousing and OLAP

Defining a Cube

Q4Q1 Q2 Q3Time Dimension Pro

ducts

Dim

ensio

n

Detroit

Denver

Chicago

Mar

ket D

imen

sion

Apples

CherriesGrapes

Atlanta

Melons

Page 27: Module 1: Introduction to Data Warehousing and OLAP

Q4Q1 Q2 Q3Time Dimension

Produ

cts D

imen

sionDallas

Denver

Chicago

Mar

kets

Dim

ensi

on

Apples

CherriesGrapes

AtlantaSales Fact

Melons

Querying a Cube

Page 28: Module 1: Introduction to Data Warehousing and OLAP

ApplesQ4Q1 Q2 Q3

Time Dimension

Produ

cts D

imen

sion

Detroit

Denver

Chicago

Atlanta

Mar

kets

Dim

ensi

on

MelonsCherries

Grapes

Defining a Cube Slice

Page 29: Module 1: Introduction to Data Warehousing and OLAP

Working with Dimensions and Hierarchies

Dimensions Allow You to

Slice Dice

Hierarchies Allow You to

Drill Down Drill Up

Page 30: Module 1: Introduction to Data Warehousing and OLAP

Visualizing Cube Dimensions

Page 31: Module 1: Introduction to Data Warehousing and OLAP

Connecting to an OLAP Cube

Page 32: Module 1: Introduction to Data Warehousing and OLAP

Review

Introducing Data Warehousing

Defining OLAP Solutions

Understanding Data Warehouse Design

Understanding OLAP Models

Applying OLAP Cubes