Upload
easydata
View
116
Download
1
Embed Size (px)
Citation preview
Big Data And Hadoop
By easydata
easydata - Online Training
Big Data And Hadoop
• Big Data is an asset, often a complex and ambiguous one.
• Hadoop is a program that accomplishes a set of goals and objectives for dealing with that asset.
• Big data is large sets of data that businesses and other parties put together for specific goals and operations.
• Businesses / Companies collect these data over a period of time.
easydata - Online Training
• These data may include customer identifiers like name , Social Security number , age group , location or anything.
• On product information in the form of model numbers, sales numbers , inventory numbers, complain numbers.
• Customer feedback, angry customers, happy customers etc.
• All of this can be called big data.
• But they all are raw and unsorted data.
• Hadoop is one of the tools designed to handle this raw and unsorted big data.
easydata - Online Training
• Hadoop works to interpret or parse the results of big data searches.
• Hadoop uses some algorithms and methods to understand it.
• Hadoop is an open-source program under the Apache license that is maintained by a global community of users.
• To understand Hadoop, you have to understand two fundamental things.
• They are: How Hadoop stores files, and how it processes data.
easydata - Online Training
• Hadoop includes various main components like MapReduce , HDFS.
• HDFS : Stores raw and unsorted data.
• MapReduce : Its ability to process that data, or provide a framework for processing that data.
• HDFS == Storage
• MapReduce == Processing
easydata - Online Training
Thank You
easydata - Online Training