29

Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Embed Size (px)

Citation preview

Page 1: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305
Page 2: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Taking Your Application Design to the Next Level with Data Mining

Gavin Russell-RockliffBI Technical SpecialistMicrosoftBIN305

Page 3: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Please Raise Your Hand If You’ve Ever…

<… Put a Party Reference Here… >Attended a Statistics Lecture ??Got a Statistics Degree ??Used SQL Server Data Mining ??

Page 4: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Agenda

Data Mining – What is it?Data Mining – How do we do it?Demonstrations

VisualisationReportingETLApplication

Q&A

Page 5: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Data Mining – What Is It?

According to EncartaNoun“Search for Hidden Information”“The locating of previously unknown patterns and relationships within data”

Server-Driven DiscoveryUses a combination of statistics, probability analysis and database technologies

Page 6: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

DM Enables Predictive Analysis

Predictive Analysis

Presentation Exploration Discovery

Passive

Interactive

Proactive

Business Insight

Canned reporting

Ad-hoc reporting

OLAP

Data mining

Role of Software

Page 7: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Business Scenarios

Forecasting sales

Churn Analysis

Detecting fraud or invalid data

Targeting promotions

Cross-selling

Determine Business Drivers

Page 8: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Our End-to-End BI Offering

END USER TOOLS AND PERFORMANCE MANAGEMENT APPLICATIONS

BI PLATFORM (RDBMS, ETL, OLAP, Reporting)

DELIVERY

Mainframe/ Departmental Systems

The Big Picture

SQL Server Reporting Services

SQL Server Analysis Services

SQL Server DBMS

SQL Server Integration Services

Page 9: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Our End-to-End BI Offering

END USER TOOLS AND PERFORMANCE MANAGEMENT APPLICATIONS

BI PLATFORM (RDBMS, ETL, OLAP, Reporting)

DELIVERY

The Big Picture

SQL Server Analysis Services

Page 10: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

SQL Server™ 2008 Data MiningKey Drivers

Keep Development SimpleRetain Full Suite of AlgortihmsManage Large VolumesAllow for Integration

Page 11: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

SQL Server™ 2008 AlgorithmsMicrosoft Naïve Bayes

Quick and approachable algorithmUsed for classification

Microsoft Decision TreesPopular data mining techniqueUsed for classification, regression and association

Microsoft Linear RegressionFinds the best possible straight line through a series of pointsUsed for prediction analysis

Page 12: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

SQL Server™ 2008 AlgorithmsContinued

Microsoft Neural NetworkMore sophisticated than Decision Trees and Naïve Bayes, this algorithm can explore extremely complex scenariosUsed for classification and regression tasks

Microsoft Logistic RegressionA particular case of the Neural Network algorithm

Microsoft ClusteringFinds natural groupings inside dataSupports segmentation and anomaly detection tasks

Page 13: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

SQL Server™ 2008 AlgorithmsContinued

Microsoft Sequence ClusteringGroups a sequence of discrete events into natural groups based on similarity

Microsoft Time SeriesUsed to predict future values from a time seriesHas been improved in SQL Server 2008 to produce more accurate long-term forecasts

Microsoft Association RulesCommonly supports market basket analysis to learn what products are purchased together

Page 14: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Data Mining Algorithm Usage

What is your task?Predict Variable

•Naïve Bayes•Decision Trees•Neural Network•Logistic Regression

Predict Value

•Decision Trees•Linear Regression•Neural Network•Logistic Regression

Marketing Cluster

•Clustering

Forecast Value

•Time Series

Associate

•Association Rules•Decision Trees

Page 15: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Data Mining Process

Define the ProblemData PreperationModel Validation

AccuracyReliabilityUsefulness

Model Visualisation

Page 16: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Describing the Data Mining Process

Design time

Process time

Query timeMining Model

Page 17: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Describing the Data Mining Process

Design time

Process time

Query timeMining Model

Training Data Data Mining Engine

Page 18: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Data Mining Visualization

Page 19: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Model Creation + Processingdemo

Page 20: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Describing the Data Mining Process

Design time

Process time

Query timeMining Model

Training Data Data Mining Engine

Page 21: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Describing the Data Mining Process

Design time

Process time

Query time

Data Mining Engine

Data to PredictPredicted Data

Mining Model

Page 22: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Predicting the Futuredemo

Page 23: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Data Mining for the Developerdemo

Page 24: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

question & answer

Page 25: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Related Content

Breakout Sessions

Using MDX for Enhanced Scorecards and Dashboards (BIN 307)

Page 26: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Track Resources

www.sqlserverdatamining.com

www.microsoft.com/sql

twitter.com/gavinrr

Page 27: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

www.microsoft.com/teched

Sessions On-Demand & Community

http://microsoft.com/technet

Resources for IT Professionals

http://microsoft.com/msdn

Resources for Developers

www.microsoft.com/learning

Microsoft Certification & Training Resources

Resources

Page 28: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

Complete a session evaluation and enter to win!

10 pairs of MP3 sunglasses to be won

Page 29: Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS,

IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.