17
Hadoop Edit by Cassell Hsu 2013.04.19

Pptx present

Embed Size (px)

Citation preview

Page 1: Pptx present

HadoopEdit by Cassell Hsu

2013.04.19

Page 2: Pptx present

Hadoop

Master

Slave

Slave

Slave

Page 3: Pptx present

Hadoop•Master•NameNode•JobTracker•SecondaryNameNode

Page 4: Pptx present

NameNode•Where is NameNode?•Master•HDFS(Hadoop Distributed File

System)•What is NameNode?•資料之位置資訊 •資料之屬性

??

Page 5: Pptx present

NameNode

•位置資訊 ?•所有資料皆存放在 – DataNode

Page 6: Pptx present

DataNode•What is DataNode•存放資料

•Where is DataNode•HDFS•Slaves (and Master)

Page 7: Pptx present

User

DataNode

NameNode

DataNode

DataNode

128Mb

B64Mb

A64Mb

Check hdfs-site.xml

B64Mb

A64Mb

Page 8: Pptx present

Hadoop

DataNodeNameNode

MasterDataNode

DataNode

DataNode

Slaves

Page 9: Pptx present

Hadoop•Master•NameNode•JobTracker•SecondaryNameNode

Page 10: Pptx present

JobTracker•What is JobTracker?•排程工作

•Where is JobTracker?•Master

誰來工作?

Page 11: Pptx present

JobTracker & TaskTrackerJobTracker TaskTracker

Where Master Slaves

What 排程工作 執行工作

Page 12: Pptx present

Hadoop

DataNodeNameNode

MasterDataNode

Slaves

JobTracker

TaskTracker

Page 13: Pptx present

Hadoop•Master•NameNode•JobTracker•SecondaryNameNode

Page 14: Pptx present

SecondaryNameNode

•What is SecondaryNameNode?•NameNode 發生錯誤時補救

•Where is SecondaryNameNode?•Master

Page 15: Pptx present

Hadoop

DataNodeNameNode

MasterDataNode

Slaves

JobTracker

TaskTracker

SecondaryNode

Page 16: Pptx present

MapReduce

User Master

Slave2

Slave1

A

A1

A2Task

NameNode

Task

Result1Result2

ReduceFinal

ResultHDFS

Page 17: Pptx present

MapReduce

•檔案切割•Hadoop 上區塊切割•程式指定