The Right Data Warehouse: Automation Now, Business Value Thereafter

Preview:

Citation preview

Grab some coffee and enjoy the pre-show banter before the top of the hour!

The Briefing Room

The Right Data Warehouse: Automation Now, Business Value Thereafter

Twitter Tag: #briefr

The Briefing Room

Welcome

Host: Eric Kavanagh

eric.kavanagh@bloorgroup.com @eric_kavanagh

Twitter Tag: #briefr

The Briefing Room

!   Reveal the essential characteristics of enterprise software, good and bad

!   Provide a forum for detailed analysis of today’s innovative technologies

!   Give vendors a chance to explain their product to savvy analysts

!   Allow audience members to pose serious questions... and get answers!

Mission

Twitter Tag: #briefr

The Briefing Room

Topics

This Month: BIG DATA

May: DATABASE

June: ANALYTICS & MACHINE LEARNING

2014 Editorial Calendar at www.insideanalysis.com/webcasts/the-briefing-room

“ “ The first rule of any technology used

in a business is that automation applied to an efficient operation will magnify the efficiency. The second is that automation applied to an inefficient operation will magnify the inefficiency. -Bill Gates

Twitter Tag: #briefr

The Briefing Room

Analyst: Robin Bloor

Robin Bloor is Chief Analyst at The Bloor Group

robin.bloor@bloorgroup.com @robinbloor

Twitter Tag: #briefr

The Briefing Room

WhereScape

! WhereScape is a data warehousing software company

!   It offers WhereScape 3D, software for planning and reality-testing data warehousing and business intelligence projects; and WhereScape RED, an integrated development environment used for building, deploying and managing data warehouses and data marts.

! WhereScape RED allows developers to automate the data warehousing life cycle

Twitter Tag: #briefr

The Briefing Room

Guest: Mark Budzinski

Mark Budzinski is the President of WhereScape USA. In this role, Mark sees firsthand the challenges that IT organizations face and their desire to better serve the business community with agile, rapid development of data warehouses and related data driven projects. His career has spanned management positions at Intel, Sequent Computer Systems, RadiSys, and Applied Microsystems. He holds an MBA from the Univ. of Oregon, a Master’s Degree in Computer Science from the Univ. of Southern California, a BS in Industrial Engineering from Lehigh Univ., and is a graduate of The Buckley School of Public Speaking.

Mark Budzinski

President

WhereScape USA, Inc.

WhereScape Bloor Group Briefing

Big Data: Are We There Yet?

April 1, 2014

So What Problem Are We Trying to Solve, Again?

¡  Big data = new insights from new sources to answer old questions

¡ Hadoop is a fabulous new technology, not a religion

¡  Business value is *still* where the holy grail is found

¡ Connecting *all* data, in the right form to the business constituency at the pace the business needs it…isn’t *that* the problem worth solving?

Eh, Do You See That Animal Over There?

¡  IT has its hands full

¡  ETL has we know it, isn’t helpful

¡  The business employs all kinds of tricks to work around IT

¡ We hide the whole mess in human capital ¡  Consultants ¡  Politics ¡  Excuses

WhereScape: What We Believe

¡ Getting the data right is critically important, yet business users can seldom articulate what they want until they see it

¡  Business users/consumers of data must be served at their pace

¡  Automate as much of the process as possible

¡  Standards matter

¡ Document everything; later changes demand it

¡  The historical record that “data warehouses take too long to build and are too hard to change” must evolve to a paradigm whereby IT delivers value continuously, manages ongoing changes cost effectively, and mitigates risk throughout

Some of Our Customers

ETL Is Not The Same Thing As…

Move Data

Transform Data

Load Data From Sources including OLTP, files, Hadoop, etc.

Create Database Objects

such as Models, Views, Facts, Dimensions, Indexes

Documentation

WhereScape Value Proposition

¡ We empower IT organizations to build, deploy, and manage their data in a timeframe and manner that creates utter enthusiasm in the business community

¡  Time to results dramatically decreases - faster

¡ Costs decrease: license and human capital - cheaper

¡ Quality is higher: documentation & transparency - better

¡ Risk is mitigated: automation & methodology – with confidence

By automating development, you get more from your “data investment”

Data Warehouse Automation Software: RED

RED is an Automated Data Warehouse Development Platform That Builds Native Target Database Objects, Documents Them, And Schedules Data To Be Loaded. RED Works Well With ETL Tools, or as an ELT Solution Using Native Loaders (e.g. Teradata TPT)

Automation Use Cases

End-to-End Data Warehouse

Development Tool

Including All ETL

Complement to Enterprise ETL, Used

For Rapid/Agile Build Out of a View or Semantic Layer

Green Field

Clean Up Hand-Written

Code and Scripts

Manage Deployments

Rewrite & Recast Heterogeneous

Database Solutions

Big Data Integration

ELT 2.0

Configuration Management

Semantic Layer

“WhereScape RED’s tight integration with the target database and built-in best practices are empowering our resource-constrained development team to tackle a large implementation that will enable us to quickly deliver value to our institution and members.”

Su Rayburn, Director of Business Intelligence Services

“Our results using WhereScape have been extremely impressive. WhereScape enabled us to design, develop, document and deploy a production-ready solution in 6-8 weeks. Using traditional data warehouse development methods would have taken us 6-8 months.

Ryan Fenner, VP, Data Solutions Architect, Union Bank

“We estimate the development lifecycle is 20-25% of what it was previously when we were hand coding.”

Dan Mosher, Director of Enterprise Data Warehousing

Summary

WhereScape’s approach with data warehouse automation yields customer results that are stunning: faster, cheaper, and more manageable

Twitter Tag: #briefr

The Briefing Room

Perceptions & Questions

Analyst: Robin Bloor

Is the Data Warehouse Dead?

u  It’s all about data flow

u  The number of data sources is increasing dramatically; some (like the weather) don’t directly relate to corporate data entities

u  Inconvenient data structures (documents, text, etc.) make a single database engine an impossibility

u  No database product can handle the scale required

Is the Data Warehouse Dead?

u  It’s all about data flow, and there needs to be a control point (or several) for that flow

u  There needs to be a comprehensive integrated map of all data used by the business

u  This means that there needs to be a LOGICAL data warehouse

u  Whether there is a single physical data warehouse is a detail of the implementation

Big Data and the Data Reservoir

The Biological System Analog

u  Our human control system works at different speeds: •  Almost instant reflex •  Swift response •  Considered response

u  Organizations will gradually implement similar control systems

u  This suggests a data-flow- based architecture

u  Memory is the Data Warehouse

Data Warehouse Design

The design issue:

u  In the past a data warehouse design could remain fairly static for a while – it rarely saw dramatic change

u  Those days are probably over for many businesses

u  The warehouse design process now needs to have a maintenance aspect

u  It also needs to embrace Hadoop and think in terms of “logical warehouse”

u  How does the typical WhereScape engagement proceed in terms of: product only/training/consultancy?

u  What is the typical time frame from project initiation to operating warehouse (green field)?

u  What factors impact that time frame?

u  What are your customers (who use Hadoop) using it for?

u  How would you characterize the role of Hadoop in the data warehouse?

u  Do you become involved in database migrations? If so, what are the contexts for such projects?

u  Do you have any direct competition?

Twitter Tag: #briefr

The Briefing Room

Twitter Tag: #briefr

The Briefing Room

Upcoming Topics

www.insideanalysis.com

2014 Editorial Calendar at www.insideanalysis.com/webcasts/the-briefing-room

This Month: BIG DATA

May: DATABASE

June: ANALYTICS & MACHINE LEARNING

Twitter Tag: #briefr

The Briefing Room

THANK YOU for your

ATTENTION!

Images borrowed from the Internet: Slide 29: http://evaporatedpast.com/2013/09/07/art-is-never-finished-only-abandoned-leonardo-da-vinci/

Recommended