4
GreyCampus provides the course Online self-learning on Hadoop Administration. The course is intended for System Administrators, DBA’s, Linux admins and Software engineers responsible for managing and maintaining Hadoop clusters. This is designed to provide knowledge for the aspirants who want to become a successful Hadoop Administrator. This course covers Hadoop architecture and its components, Managing, Maintaining, Monitoring and Troubleshooting a Hadoop Cluster. The focus of this course is to give the participants hands on experience, so there would be multiple assignments, quizzes and a project. Online self-learner will get the required support from the livesupport team. COURSE OBJECTIVES Upon successful completion of this course, participants should be able to: f Describe the fundamental concepts of using Big Data f Identify where Hadoop fits into Big Data f Hadoop Architecture and HDFS f Gain insight on YARN and MapReduce f Installing and Configuring Apache Ecosystem Tools f Configuration and Performance Tuning f Learn about Hadoop Cluster f Manage, Maintain, Monitor and Troubleshoot a Hadoop Cluster COURSE INCLUSION ONE YEAR ACCESS Participants will have access to GreyCampus learn platform for a period of one year, this includes access to the Course PPTs, reading material, quizzes, assignments, project and class videos FACT SHEET HADOOP ADMINISTRATOR TRAINING & CERTIFICATION TRAINING & CERTIFICATION - ONLINE SELF LEARNING © www.greycampus.com

TRAINING & CERTIFICATION - ONLINE SELF LEARNING€¦ · module 2: hadoop architecture and hdfs • hive • pig • mahout • hbase • hcatalog/hive • hbase administration module

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: TRAINING & CERTIFICATION - ONLINE SELF LEARNING€¦ · module 2: hadoop architecture and hdfs • hive • pig • mahout • hbase • hcatalog/hive • hbase administration module

GreyCampus provides the course Online self-learning on Hadoop Administration. The course is intended for System Administrators, DBA’s, Linux admins and Software engineers responsible for managing and maintaining Hadoop clusters. This is designed to provide knowledge for the aspirants who want to become a successful Hadoop Administrator. This course covers Hadoop architecture and its components, Managing, Maintaining, Monitoring and Troubleshooting a Hadoop Cluster. The focus of this course is to give the participants hands on experience, so there would be multiple assignments, quizzes and a project. Online self-learner will get the required support from the livesupport team.

COURSE OBJECTIVESUpon successful completion of this course, participants should be able to:

f Describe the fundamental concepts of using Big Data

f Identify where Hadoop fits into Big Data

f Hadoop Architecture and HDFS

f Gain insight on YARN and MapReduce

f Installing and Configuring Apache Ecosystem Tools

f Configuration and Performance Tuning

f Learn about Hadoop Cluster

f Manage, Maintain, Monitor and Troubleshoot a Hadoop Cluster

COURSE INCLUSIONONE YEAR ACCESSParticipants will have access to GreyCampus learn platform for a period of one year, this includes access to the Course PPTs, reading material, quizzes, assignments, project and class videos

FACT SHEET

HADOOP ADMINISTRATOR TRAINING & CERTIFICATION

TRAINING & CERTIFICATION - ONLINE SELF LEARNING

© www.greycampus.com

Page 2: TRAINING & CERTIFICATION - ONLINE SELF LEARNING€¦ · module 2: hadoop architecture and hdfs • hive • pig • mahout • hbase • hcatalog/hive • hbase administration module

DEDICATED SUPPORTParticipants will get the Technical and Nontechnical support through email within 1 business day. Participants can send their queries at [email protected] or they can call the toll free num: 1800 102 0723.

VIRTUAL MACHINEParticipants will be provided instructions to set up their Virtual Machine before the course starts.

HANDS ON PROJECTAt the end of the course participants should submit a project which covers all the key aspects of the course. This allows them to implement techniques they learnt in the course.

COURSE CERTIFICATIONAfter completing 30 hrs of training participants will be provided a Project which they have to submit within 15 days. A successful completion of the project would make the participants eligible for the GreyCampus certificate.

30 PDUS30 PDUs will be sent to PMI credential holders within 2 business days upon request.

2

© www.greycampus.com

Page 3: TRAINING & CERTIFICATION - ONLINE SELF LEARNING€¦ · module 2: hadoop architecture and hdfs • hive • pig • mahout • hbase • hcatalog/hive • hbase administration module

COURSE AGENDA 3

MODULE 1: UNDERSTANDING BIG DATA AND HADOOP

• Big Data

• Limitations and Solutions of existing Data

Analytics Architecture

• Hadoop

• Hadoop Features

• Hadoop Ecosystem

• Hadoop 2.x core components

• Hadoop Storage: HDFS

• Hadoop Processing: MapReduce Framework

• Anatomy of File Write and Read

• Rack Awareness

MODULE 2: HADOOP ARCHITECTURE AND HDFS

• Hive

• Pig

• Mahout

• HBase

• Hcatalog/Hive

• Hbase Administration

MODULE 5: INSTALLING AND CONFIGURING APACHE ECOSYSTEM TOOLS

MODULE 6: ADVANCED CLUSTER CONFIGURATION

MODULE 7: HADOOP SECURITY

MODULE 8: MANAGING AND SCHEDULING JOBS

MODULE 9: CONFIGURATION AND PERFORMANCE TUNING

• OS

• JVM and Hadoop configuration parameters tuning

MODULE 10: INSTALLING AND CONFIGURING APACHE ECOSYSTEM TOOLS

• Checking HDFS Status

• Copying Data between Clusters

• Adding and Removing Cluster Nodes

• Rebalancing the Cluster

• Cluster Upgrading

• General System Monitoring

• Monitoring Hadoop Clusters

• Common Troubleshooting Hadoop Clusters

• Common Misconfigurations

• Checking Logs and Log File Locations

• Managing Running Jobs

• Scheduling Hadoop Jobs

• Configuring the Fair schedulers

• Why Hadoop Security Is Important

• Hadoop’s Security System Concepts

• What Kerberos is and How it Works

• Securing a Hadoop Cluster with Kerberos

• Advanced Configuration Parameters

• Configuring Hadoop Ports

• Explicitly Including and Excluding Hosts

• Configuring HDFS for Rack Awareness

• Configuring HDFS High Availability

MODULE 3: YARN AND MAPREDUCE

MODULE 4: LOAD DATA AND RUN APPLICATIONS

• Data Loading Techniques: Hadoop Copy Commands

• FLUME

• SQOOP

• What Is MapReduce?

• Basic MapReduce Concepts

• YARN Cluster Architecture

• Resource Allocation

• Failure Recovery

• Using the YARN Web UI

• MapReduce Version 1

• Hadoop 2.x Cluster Architecture - Federation and

High Availability

• A Typical Production Hadoop Cluster

• Hadoop Cluster Modes

• Common Hadoop Shell Commands

• Installation of Hadoop on Single Node/Multi

Cluster environment

• Hadoop 2.x Configuration Files

• Password-Less SSH

• MapReduce Job Execution

• Data Loading Techniques: Hadoop Copy Commands

• FLUME

• SQOOP

• Node roles

• Data Processing

• Network configuration

Page 4: TRAINING & CERTIFICATION - ONLINE SELF LEARNING€¦ · module 2: hadoop architecture and hdfs • hive • pig • mahout • hbase • hcatalog/hive • hbase administration module

© www.greycampus.com

TRAINED OVER 15,000PROFESSIONALS

REACH ACROSS50+ COUNTRIES

EXAM PASS RATE OFOVER 97 %

COURSES ACCREDITED BY LEADING GLOBAL BODIES

ABOUT GREYCAMPUS

GreyCampus is a leading provider of on-demand training that address the unique learning needs of professionals, delivered as online self-learning, live online training or in-person classroom training. Our aim is to provide quality training enabling professionals to achieve their certification and career enhancement goals. We offer training for certifications in areas of Big Data & Hadoop, Project Management, IT Service Management, Quality Management, Python Programming, Agile Training Coaching & Certification and Workplace Tools.

DISCLAIMER

“PMI®”, “PMBOK®”, “PMP®” “CAPM®” and “PMI-ACP®” are registered marks of the Project Management Institute, Inc.

The Swirl logo™ is a trade mark of AXELOS Limited.ITIL® is a registered trade mark of AXELOS Limited.PRINCE2® is a Registered Trade Mark of AXELOS Limited.

ACCREDITATIONS & ASSOCIATIONS

Provider ID : 3871