1. [SSA] Big Data Analytics Big Data Database Technology
[email protected] 2014. 2. 5.
2. Contents I. II. III. 1
3. 2
4. 1956 IBM (RAMAC) 5MB 5 , 2011 2TB 70 CPU , 2010 50 N (PC, ,
, TV) , , , : 1)
http://en.wikipedia.org/wiki/Memory_storage_density#Effects_on_price
2) MGI(McKinsey Global Institute) 2011.06 Big data: the next
frontier for innovation, competition, and productivity 3
11. . . - [ ] " (noise) (signal) by Claude Shannon " by Gregory
Bateson . . . - [ ] :
http://terms.naver.com/entry.nhn?docId=1526261&cid=3619&categoryId=3623
10
12. vs. Raw, unorganized facts No context Just numbers and text
Processed data Data with context Value added to data summarized
origanized analyzed Example: 51007 Example 5/10/07 The date of your
final exam. $51,007 The average starting salary of an account
manager. :
http://www.slideshare.net/EinsteinX2/data-vs-information,
http://www.diffen.com/difference/Data_vs_Information 11
25. GFS 2003 Google File System: A Distributed Storage
MapReduce 2004 Simplified Data Processing on Large Clusters Sawzall
2005 Interpreting the Data: Parallel Analysis with Sawzall Chubby
2006 The Chubby Lock Service for Loosely-Coupled Distributed
Systems BigTable 2006 A Distributed Storage System for Structured
Data Paxos 2007 Paxos Made Live - An Engineering Perspective
Colossus 2009 GFS II Percolator 2010 Large-scale Incremental
Processing Using Distributed Transactions and Notifications Pregel
2010 A System for Large-Scale Graph Processing Dremel 2010
Interactive Analysis of Web-Scale Datasets Tenzing 2011 A SQL
Implementation On The MapReduce Framework Megastore 2011 Providing
Scalable, Highly Available Storage for Interactive Services Spanner
2012 Google's Globally-Distributed Database F1 2012 The
Fault-Tolerant Distributed RDBMS Supporting Google's Ad Business :
Google researchs 24