14
I Teradata Overview · Teradata Basics -List and describe the major components of the Teradata architecture(Parsing Engine (PE), Access Module Processors (AMPs), BYNET) -Describe how the components interact to manage incoming and outgoing data -List 5 types of Teradata database objects · Teradata RDBMS Architecture -Describe the purpose of the PE and the AMP. -Describe the overall RDBMS parallel architecture · Teradata System Architecture -Characteristics of components VPROCs (PE and AMP), BYNET -SMP and MPP System -Teradata Clique · Creating Teradata Database -Distinguish between a Teradata Database and Teradata User. -Define Perm Space and explain how it is used. -Define Spool Space and its use. -Visualize the hierarchy of objects in a Teradata system. · Storing and accessing Data -Explain the purpose of the Primary Index -Distinguish between Primary Index and Primary Key · Primary Index Mechanics -Role of Hashing Algorithm -RowID · Secondary Index · Data Protection -Concept of FALLBACK tables, Cliques -Describe the Recovery, Transient and Permanent Journals and their function. · Partitioned Primary Index II Physical Implementation (Teradata Advanced) · Hashing - Data distribution form and method - Describe Hashing - Describe Primary Index hash mapping · File System Writes - Describe File System Write Access - Fragmentation - Impact of Row Inserts · Access and Constraints - Analyze Optimizer Access scenarios - Identify column level attributes and constraints. - Identify table level attributes and constraints.

Teradata Contents

  • Upload
    jayanth

  • View
    223

  • Download
    4

Embed Size (px)

Citation preview

Page 1: Teradata Contents

I  Teradata Overview

·         Teradata Basics  -List and describe the major components of the Teradata architecture(Parsing Engine (PE),              Access Module Processors (AMPs), BYNET)  -Describe how the components interact to manage incoming and outgoing data  -List 5 types of Teradata database objects

·         Teradata RDBMS Architecture  -Describe the purpose of the PE and the AMP.  -Describe the overall RDBMS parallel architecture

·         Teradata System Architecture  -Characteristics of components VPROCs (PE and AMP), BYNET  -SMP and MPP System  -Teradata Clique

·         Creating Teradata Database  -Distinguish between a Teradata Database and Teradata User.  -Define Perm Space and explain how it is used.  -Define Spool Space and its use.  -Visualize the hierarchy of objects in a Teradata system.

·         Storing and accessing Data  -Explain the purpose of the Primary Index  -Distinguish between Primary Index and Primary Key

·         Primary Index Mechanics  -Role of Hashing Algorithm  -RowID

·         Secondary Index·         Data Protection

  -Concept of FALLBACK tables, Cliques  -Describe the Recovery, Transient and Permanent Journals and their function.

·         Partitioned Primary Index 

II Physical Implementation (Teradata Advanced)

·         Hashing  - Data distribution form and method   - Describe Hashing  - Describe Primary Index hash mapping

·         File System Writes  - Describe File System Write Access  - Fragmentation  - Impact of Row Inserts

·         Access and Constraints  - Analyze Optimizer Access scenarios  - Identify column level attributes and constraints.  - Identify table level attributes and constraints.

·         Statistics  - How the Optimizer acquires statistics  - Teradata Statistics Wizard

·         Explain  - Explain Feature in Teradata  - Explain Terminology

·         Visual Explain Utility

III Teradata SQL ·         Data Types ·          DDL, DML, DCL statements ·          Creation of Tables with SET/ MULTISET options.

Page 2: Teradata Contents

·          Temporary, Volatile, Global & Derived Tables ·          CASE and SET operators ·          DATE TIME Functions ·          Aggregate and String Functions·          Ordered Analytical(OLAP) Functions·          Numeric Formatting Options·          Date and Time Formatting Options·          MERGE INTO Statement·          Materialized Views·          Sampling Data

  IV Teradata Utilities

·         BTEQ  - BTEQ Export  - BTEQ Import

·         FastLoad  - Two Phases of FastLoad  - Prepare a FastLoad Script 

·         FastExport  - FastExport Capabilities  - Prepare a FastExport Script

·         MultiLoad  - Capabilities of MultiLoad.  - Five phases of MultiLoad and state the main function of each.  - Create a MultiLoad script.

·         TPump  - Capabilities and limitations of TPump  - Describe TPump commands and parameters  - Create a TPump Script

TERADATA COMPONENTS & ARCHITECTURE: 1. The Teradata parallel architecture 2. A logical view of the Teradata architecture 3. The Parsing Engine (PE) 4. The Access Module Processors (AMPs) 5. The BYNET

DATA DISTRIBUTION IN TERADATA 1. Rows and columns 2. The Primary Index 3. Two types of Primary indexes (Unique Primary Index & Non-Unique Primary Index) 4. Data distribution using the Unique Primary Index Value/ Non-Unique Primary Index Value 5. The Secondary Index 6. Data distribution using the Secondary Index Value 7. Partition Primary Index

TERADATA SPACE MANAGEMENT1. Types of Spaces 2. PERM, SPOOL and TEMP space 3. How spool space is calculated

DATAPROTECTION 1. Transaction concept & Transient Journal 2. Fallback protection /clusters 3. Redundant Array of Independent Disks (RAID)

Page 3: Teradata Contents

4. Cliques 5. Permanent Journal 6. Table create with Fallback and Permanent Journaling 7. Locks and their compatibility

THE ACTIVE DATA WAREHOUSE 1. OLTP environments 2. The DSS environment 3. Data marts and it’s type

TERADATA TOOLS - SQL ASSISTANT 1. TDQM 2. Index wizard 3. Archive recovery 4. Teradata Analyst Suite

CREATING TABLES 1. CREATE / Populate Table 2. Set tables /Multiset tables 3. quick way to copy tables 4. CREATE table options 5. Defining constraints at the column level/ table level 6. Partitioning with CASE_N 7. Partitioning with RANGE_N 8. NO CASE, NO RANGE, or UNKNOWN 9. Data types /Attributes /Constraints 10. PPI and Identity columns/Secondary index definitions 11. COLLECT STATISTICS 12. Hash functions

TEMPORARY TABLES 1. Derived tables 2. Multiple columns in a derived table 3. Volatile table 4. Global temporary tables

VIEWS 1. View basics 2. CREATE/REPLACE/DROP 3. Placing aggregates inside a view 4. Using “locking for access” in views 5. UPDATE tables through views 6. Restricting UPDATE rows with check option

TERADATA UTILITY AND MECHANISM WITH EXAMPLE 1. BTEQ 2. FastExport 3. FastLoad 4. MultiLoad 5. TPump 6. INMOD Processing 7. OUTMOD Processing

SQL BASICS 1. Rows and columns 2. The SELECT command

Page 4: Teradata Contents

3. The WHERE clause 4. The order BY clause 5. Sorting by multiple columns 6. Teradata SQL Punctuation

SELECTING ROWS 1. Working with NULL 2. The DISTINCT command with multiple columns 3. The AND /OR/ NOT operator 4. Order of precedence for (), NOT, AND, or OR 5. USING an IN List instead of OR 6. The BETWEEN operator 7. The LIKE operator and percent wildcard v 8. SQL that causes a full table scan (FTS)

HELP, SHOW, AND EXPLAIN 1. The HELP command 2. The SHOW command 3. The EXPLAIN command

AGGREGATES 1. Aggregates 2. Aggregate example with NULL values / the GROUP BY statement 3. Aggregates and the HAVING statement 4. WHERE, GROUP BY and HAVING together

SUBQUERIES 1. The IN statement (for review) 2. Normal Subqueries (using multiple tables) 3. Correlated Subquery 4. NOT EXISTS vs. NOT IN

JOINS 1. Primary key/foreign key relationships 2. A LEFT OUTER /RIGHT OUTER / FULL OUTER) 3. Join types vs. join strategies 4. Big table small table join strategy 5. Nested join 6. Hash join 7. Exclusion join 8. Product joins 9. Cartesian product join 10. Cross join 11. Self join

ALIASING, TITLE, CAST, AND FORMAT 1. Title function in BTEQ adds functionality 2. Title function used with distinct 3. Teradata data types 4. CAST function 5. Derived columns /Using and ALIAS on a column 6. Formatting a column 7. Date, time, and timestamp FORMAT

INTERROGATING DATA 1. SUBSTRING

Page 5: Teradata Contents

2. SUBSTR 3. Concatenation of character strings 4. Using SUBSTRING and concatenation together 5. CHARACTER vs. VARCHAR 6. The TRIM function 7. CHARACTERS command 8. Output results for multiple commands mixed 9. The POSITION function 10. The INDEX function 11. SUBSTRING and POSITION together 12. COALESCE with literals 13. ZEROIFNULL 14. NULLIFZERO 15. NULLIF command 16. The CASE command (vlued CASE) 17. The CASE command (searched CASE) 18. Nested CASE statement

SET OPERATORS 1. INTERSECT 2. UNION 3. EXCEPT or MINUS

MACROS 1. Macro basics 2. How to create/ execute a macro 3. Parametric macro 4. Drop macro

DATES AND TIMES 1. DATE Functions 2. ADD_MONTHS /EXTRACT 3. The system calendar 4. INTERVAL processing for arithmetic and conversion 5. TIMESTAMP 6. CURRENT_TIMESTAMP

WITH AND WITH BY FOR TOTALS AND SUBTOTALS 1. The WITH statement 2. The WITH BY statement 3. Combining WITH BY and WITH

SAMPLING 1. Random sampling – number of rows sample 2. Random sampling – percentage of the table sample 3. SAMPLE WITH REPLACEMENT / REPLACEMENT and RANDOMIZED ALLOCATION together 4. SAMPLE with conditional test

RANK AND QUANTILE 1. RANK 2. QUALIFY RANK is like a HAVING statement 3. QUALIFY RANK with a GROUP BY 4. QUANTILE function 5.

Page 6: Teradata Contents

OLAP 1. Cumulative Sum (CSUM) 2. ANSI SUM OVER with PARTITION BY for grouping 3. Moving Sum (MSUM) 4. Moving Average (MAVG) 5. MAVG with ANSI AVG OVER 6. Moving Difference (MDIFF)

STORED PROCEDURES 1. CREATE procedure 2. Nesting BEGIN and END statements 3. Passing a stored procedure parameters 4. The WHILE and END WHILE

TRANSACTION PROCESSING 1. Transactions defined 2. Locking 3. Multi-statement operations

MISCELLANEOUS 1. Single row MERGE INTO command 2. Compression 3. Teradata and ANSI mode 4. SQRT function 5. INSERT/SELECT on two exact tables 6. Triggers

Teradata Warehouse and Competitiveness What is Teradata-RDBMS Teradata 12 Vs Other versions Teradata History Teradata in the Enterprise Teradata Users & Scalability Teradata competitiveness with other RDBMS(Oracle, SqlServer etc) Teradata Warehouse Discussion on interview and certification questions

Teradata Architecture and Components PDE(Parallel Data Extension) Vprocs(Virtual Processors) AMP(Access Module Processing) Board less Bynet TDP(Teradata Directory Program) CLI(Call Level Interface) TPA(Trusted Parallel Application) Going in depth by explaining the process of SQL statement execution

Various Architectures Node Architecture(Shared Nothing) SMP Architecture MPP Architecture Parallelism Architecture Benefits and Types

Page 7: Teradata Contents

Data Recovery and Protection Object Locks Various locks for simultaneous access RAID 1 RAID 5 Disk Arrays Fall Back Clique AMP Clustering Journals Recovery Journal Trasient Journal Permanent Journal Before Journal After Journal Answering various recovery questions

Teradata Storage and Retrival Architectures Request Processing Syntaxer Resolver Security Module Optimizer Step Generator GNC Apply Dispatcher Primary Index Hash algorithm Row Hash DSW Hash Map Hash Bucket Reaching V disk Teradata Indexes Primary Index Unique Non unique Partitioned Secondary Index Unique NonUnique Hash, Join, Value Ordered Skewness Secondary Index Sub table Accessing Records via Primary Index Accessing records via Secondary Index Keys Vs Indexes Full Real time scenarios and explaining

Teradata Sql Reference Fundamentals Data Types and Literals Data Definition Statements Data Manipulation Statements Explaining with proper examples

Page 8: Teradata Contents

Teradata Functions and Operators String Functions Format Function Cast Functions Group & Aggregate Functions With & with by clauses Practices of this section

Teradata Transactions Implicit Transaction Explicit Transaction

Performance Tuning and Explain Utility Explain Usage Collecting Statistics Tuning SQL Performance Usage of PMON Explaining various SQL statements

Joins and Unions Inner Join Left Outer Join Right Outer Join Full Outer Join Detailed explanation

Join Strategies Product Join Merge Join Hash Join Nested Join Questions of this section

Teradata Basic Commands HELP SHOW EXPLAIN COLLECT STATISTICS Teradata Objects Tables SET table Multi Set table Volatile tables Global Temporary tables Derived tables Views Macros Stored Procedures Triggers

Teradata spaces PERM space SPOOL space TEMPORARY space Teradata User and managing Practical Examples

Page 9: Teradata Contents

Teradata Transaction Modes BTET ANSI Interactive Batch

Load and Unload Utilities & Tools Teradata Sql Assistant(Queryman) Teradata Performance Monitor Teradata BTEQ Batch Scripts with samples Branching and Looping Importing data through scripts Exporting data through scripts Error handling Explanation with proper debugging Teradata Fast Load Various Phases of Fast Load Advantages and Process Limitations of Fast Load Sample Scripts Real time Issues and resolving it

Teradata Multi Load Various Phases of Multi Load Limitations of Multi Load Working with multiple tables Applying various operations Sample Scripts Real time Issues and solving it Teradata Parallel Data Pump Limitations of TPump Overview of TPump Sample Scripts Teradata Fast Export Exporting Data Passing Parameters OUTMODS Sample Scripts Utility Vs Parallelism Introduction to Answering AD-hoc queries and FAQs Covering DBA Concepts Concentrating on Performance Tuning

Page 10: Teradata Contents

Informatica 8.6

Data ware Housing Basic concepts Data ware house definitions. Who needs DWH? Why DWH is needed Characteristics of Data ware housing OLTP Vs OLAP Databases

DWH Architecture& Implementation Different Approaches Ralph Kimball Methodology Inmon Methodology Data Marts

Logical ware house modeling & schema designDefinition of star schema Data marts reasons for construction Design of Dimensions and facts Snow flake star schema Slowly changing Dimensions Data marts containing multiple facts Galaxy schema Common design complexities Conformed Dimensions

Marinating the Data ware house Reasons for partitioning Types of partitioning Partitioned data Vs Non Partitioned data

Physical design in Data ware House Moving from logical design to physical designPhysical design Physical Design structures Table and Partitioning tables Views Integrity constraints Materialized Views

Overview of Extraction, Transformation and Loading Overview of different ETL tools

Extraction in DWH Overview Of extraction in DWH Understanding Extraction methods in DWH Logical Extraction Methods Physical Extraction Methods Data ware housing extraction examples Loading and transformation Overview Of loading and transforming Transformation flow

Loading mechanisms External Tables Export and import

Page 11: Teradata Contents

Transformation Mechanism Transformation using sql Transformation using pl/sql Transformation using table function

Loading and Transformation scenario Parallel load scenario Key lookup scenario Exception handling scenario

Informatica 8.6 Informatica Power center

Basic concepts of Informatica Overview of basic definitions Informatica products, Architecture Tools and their roles Understanding informatica server configuration

Informatica admin console (overview) Viewing informatica Repository Creation of repository Backup Of repository Restoring repository Copying of repository

Informatica repository manager Creating of users and groups Managing of Users Assigning of privileges Creation of folders

Informatica Designer Basic concepts Using the designer Working with sources Working with flat files, Targets Mappings, transformations, mapplets Working with debugger Creating dimensions and schema Implementing type1, type2, type3 mappings Loading data into fact tables

Working with Transformations Aggregator XML Expression Filter Joiner Lookup Rank, Router Sequence Generator Sorter Stored procedure Union Transformation Update strategy

Page 12: Teradata Contents

Connected Vs Unconnected

Creating Work Flows Understanding server Architecture Working with work flows Working with task assignment,Command, decision, control, event wait, timer, event raise, email, session task Working with work lets Log files Monitoring work flows Parameter files