Data Vault Modeling DW2.0 & Unstructured Data Big Data Agile DW Ensemble Modeling
Modeling the Agile Data Warehouse with Data Vault
© 2012 Genesee Academy, LLC
Hans Hultgren
Presentation: Agile BI Congres Book Launch
© 2012 Genesee Academy, LLC
Book Launch
• Modeling the Agile Data Warehouse with Data Vault
2
What and Why
• A complete book on Data Vault • An Introduction, a Guide and a Reference • Modeling, Architecture & the Data Warehousing Program • Data & Semantic Integration for Enterprise Central Meaning • Applying Concepts to a successful Agile DWBI Program
© 2012 Genesee Academy, LLC
Book Launch
• Modeling the Agile Data Warehouse with Data Vault
3
• Data Vault Modeling • Agile Data Warehousing BI • Enterprise Data Warehousing • Data Integration and DWBI Architecture • Unified Decomposition™
• Ensemble Modeling™
© 2012 Genesee Academy, LLC
Agile Data Warehousing BI
4
• Agility = Measure of ability to Adapt to Change
• The EDW is constantly needing to adapt to change
– New Sources – New Attributes – Changing Sources – New and Changing Requirements – New and Changing Business Rules – New and Changing Deliveries – Expanding Subject Areas
© 2012 Genesee Academy, LLC
Agile Data Warehousing BI
5
• Current Architectures and Modeling Patterns have challenges adapting to change
– Models become hardened and inflexible – Entities or Conformed Dimensions grow to huge record lengths – Modeling constructs have difficulty adapting to change
IDEA:
– Split the concepts into parts to isolate changing parts and reduce the impact of the changes
© 2012 Genesee Academy, LLC
Unified Decomposition™
6
• Break things out into component parts for flexibility, adaptability, agility, and generally to facilitate the capture of things that are either interpreted in different ways or changing independently of each other.
• At the same time a core premise of data warehousing is integration and moving to a common standard view of unified concepts. So we also want to tie things together – to Unify.
© 2012 Genesee Academy, LLC
Ensemble Modeling™
7
All the parts of a thing taken together, so that each part is considered only in relation to the whole.
• The constellation of component parts acts as a whole – an Ensemble.
• With Ensemble Modeling the Core Business Concepts that we define and model are represented as a whole – an ensemble – including all of the component parts. An Ensemble is based on all things defining a Core Business Concept that can be uniquely and specifically said for one instance of that Concept.
© 2012 Genesee Academy, LLC
The Data Vault Ensemble
8
• The Data Vault Ensemble conforms to a single key – embodied in the Hub construct.
• The component parts for the Data Vault Ensemble include: – Hub The Natural Business Key – Link The Natural Business Relationships – Satellite All Context, Descriptive Data and History
© 2012 Genesee Academy, LLC
Applying the Data Vault Ensemble
9
• Data Vault constructs have been broken out by type of data…
© 2012 Genesee Academy, LLC
Applying the Data Vault Ensemble
10
• Mixing “color types of data” is not Data Vaulting but rather unvaulting • A blended pattern has different dynamics…
Thinking Differently
• Stay with the Ensemble Modeling Pattern. Continue practicing Unified Decomposition. Continue Vaulting. Be aware when you change patterns.
© 2012 Genesee Academy, LLC
Data Vault Modeling
11
• A Data Vault Model for Customer Sales with Employee and Product.
© 2012 Genesee Academy, LLC
Selected Topics Covered
12
• Concept Constellations • Modeling the Core Business Concept • The Specific, Unique, Natural Business Relationship • Attribute Clustering • Managing non-unique Natural Business Keys • Concept level of Abstraction • Model-Driven and Data-Driven constructs • Generic Models and NVP • Integration, Alignment and Reconciliation • The Data Vault EDW and Big Data • Satellite Design and Attribution
© 2012 Genesee Academy, LLC
Data Vault around the World
13
An estimated 600 Data Vault based Data Warehouses around the world
© 2012 Genesee Academy, LLC
Data Vault Certification Course
• Certified Data Vault Data Modeler CDVDM • Course consists of 2 weeks of online, on-demand video training lessons.
These cover the fundamentals of Data Vault and the underlying core concepts.
• Classroom Seminar is 2 days. – Day 1
• Covers the Data Vault constructs of Hub, Link and Satellite • Impact of Data Vault – the benefits of applying this model • Sales Receipt example modeling with Data Vault • Deep dive into Hub design • Group Case: Secaucus Soccer • Deep dive into Link Design • Group Case: Mt Evans Harley • Advanced Constructs • Assign Integration Case
14
© 2012 Genesee Academy, LLC
Data Vault Certification Course
– Day 2 • Group Case: Integration Case • Working with Data Vault • Loading Paradigms • Applying to the EDW • Data Vault in Practice • Exam Review and Discussions • CDVDM Exam
• Course information found on www.GeneseeAcademy.com • In Netherlands: www.Centennium.nl/index.php/data-vault
• Online courses DataVaultAcademy.com
15
Amsterdam
Feb. 25-26
© 2012 Genesee Academy, LLC
Links and Information
CDVDM Training www.GeneseeAcademy.com
Hans Hultgren
Twitter: gohansgo
HansHultgren.wordpress.com
LinkedIn: HansHultgren
YouTube: DataVaultAcademy
17
Online, on-demand training
DataVaultAcademy.com
Amsterdam Feb. 25-26
Congres Special
30% Discount
Purchase anytime on Amazon