Upload
others
View
2
Download
0
Embed Size (px)
Citation preview
TheNewSQLdatabaseforhighvelocityapplica9ons
NYCBigDataMeetup
Bloomberg5/21/2012
Value of Individual Data Item
Aggregate Data Value
DataValueChain
Interac9ve Real9meAnaly9cs RecordLookup HistoricalAnaly9cs ExploratoryAnaly9cs
Milliseconds Hundredthsofseconds Second(s) Minutes Hours
.Placetrade
.Servead
.Enrichstream
.Examinepacket
.Approvetrans.
.Calculaterisk
.Leaderboard
.Aggregate
.Count
.Retrieveclickstream
.Showorders.Backtestalgo.BI.Dailyreports
.Algodiscovery
.Loganalysis
.FraudpaKernmatch
AgeofData
DataValue
Source:VoltDB,Inc.
Tradi9onalRDBMS
MarketLandscape
Interac;ve Real;meAnaly;cs RecordLookup HistoricalAnaly;cs ExploratoryAnaly;cs
SimpleSlowSmall
FastComplexLarge
App
lica9
onCom
plexity
Hadoop,etc.
DataWarehouse
Transac9onal Analy9c
Value of Individual Data Item
Aggregate Data Value
NewSQL
DataValue
NoSQLVoltDB
TheClosed‐Loop‐Big‐Data‐Dream
• Makethebestdecisionevery9methereisaninterac9on
• Real‐9medecisionsareinformedbyopera9onalanaly9cs&pastknowledge
• Some9mescalledOLTP
“itisnotenoughtocapturemassiveamountsofdata;organiza9onsmustalsosiVthroughthedata,extractinforma9onandtransformitintoac;onableknowledge.”
Real‐9meDecisionMaking
Reports&Analy9cs
ExploratoryAnalysis
Who: AutomatedWhat: Transact, Opera9onalAnaly9cs
Who: AnalystWhat: Reports,Drilldown,…
Who: DataScien9stWhat:Discovertrends,rules,…
Knowledge
Knowledge
Real‐9medecisionmakingUseCases
• Highthroughput,relentlessdatafeeds
• Fastopera9onsonhighvaluedata
• Real‐9me,opera9onalanaly9cspresentimmediatevisibility
Real‐9meAnaly9cs
DeepAnaly9cs
Warehouse/Hadoop
Real‐9meOpera9ons
HighVelocityData
Sources
Real‐9meAnaly9csReal‐9meOpera9onsData Feed
NetworkTrafficMonitoring NetworkpacketsExaminepacketbysource/des9na9on
Iden9fybandwidthoutliers
FinancialTradeSupport Marketorders IngesttradedataRecallposttradeordergroupings
Sensortracking&analy9cs Sensorposi9onfeedIden9fica9onandcleansingoftaginfo
No9fica9onandgroupings
MobileGaming OnlinegameGamestateupdatesandusagepaKerns
Leaderboardlookups
DigitalAdTech Adbid/clickstream Bid,op9mizecontentReportadperformance
TheVoltDBDatabase
• High‐performanceRDBMS
• In‐memorydatabase
• Automa9cscale‐outoncommodityservers
• Built‐inhighavailability
• Rela9onalstructures,ACIDandSQL
CostDisrup9on
Ex.Tradi9onalRDBMS VoltDB
SystemSPARCSuperCluster/
Oracle11g18,8‐coreIntelservers
Price/tpmC $1.01 $0.012
VoltDBPerformanceAdvantage
TPC‐Csinglenode(Oracle) 45X
TPC‐Csinglenode(MySQL) 100X
Rela9onshipofthe3V’s
• Uniqueop9miza9onsforeach• Naturalpartnershipsacross• Coretechnologybarriers
• Lookforrighttool,polyglotandpartners!
“Theleadingedgeofbigdataisstreamingdata.”‐TDWI
• UnstructuredData• R&W• Flexibility
Variety
• StructuredData• R&W• Transac;onal
Velocity• Structured&Unstructured• ReadHeavy• Analy;cs
Volume
DW
NoSQL
Hadoop
MarketDisrup9onfromDataCrea9on
Tradi9onaldatabasearchitecturescan’tkeepup
MacroTrend
s
• Creation:EndpointsXprocessingXnetwork• Data:Perceivedascompe99veweapon
• Speed:Valueradicallyincreasesasyouapproachreal‐9medecisioning
• Eco-system:Datastoreintegra9ons
Every60seconds…168millionemailssent694,445searchqueries98,000Tweets695,000+Facebookupdates13,000+iPhoneappsdownloaded
EndPointProlifera;on