HRDF Funded Course - Apache Hadoop Administrator Training
Details
This four-day administrator training course for Apache Hadoop provides participants with a comprehensive understanding of all the steps necessary to operate and maintain a Hadoop cluster. From installation and configuration through load balancing and tuning. This training course is the best preparation for the real-world challenges faced by Hadoop administrators.
Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as: • The internals of YARN, MapReduce, and HDFS • Determining the correct hardware and infrastructure for your cluster • Proper cluster configuration and deployment to integrate with the data center • How to load data into the cluster from dynamically-generated files using Flume and from RDBMS using Sqoop • Configuring the FairScheduler to provide service-level agreements for multiple users of a cluster • Best practices for preparing and maintaining Apache Hadoop in production • Troubleshooting, diagnosing, tuning, and solving Hadoop issues
Upon completion of the course, attendees can go for CCAH or HDP Administrator. Certification is a great differentiator; it helps establish you as a leader in the field, providing employers and customers with tangible evidence of your skills and expertise.
HRDF SBL Claimable for Employers Registered with HRD
For more details, please visithttps://www.tertiarycourses.com.my/apache-hadoop-administrator-training.html
Outline
Module 1: Get Started on Apache Hadoop
- Why Hadoop?
- Core Hadoop Components
- Fundamental Concepts
Module 2: HDFS
- HDFS Features
- Writing and Reading Files
- NameNode Memory Considerations
- Overview of HDFS Security> Using the Namenode Web UI
- Using the Hadoop File Shell
Module 3: Getting Data into HDFS
- Ingesting Data from External Sources with Flume
- Ingesting Data from Relational Databases with Sqoop
- Best Practices for Importing Data
Module 4: YARN and MapReduce
- What Is MapReduce?
- Basic MapReduce Concepts
- YARN Cluster Architecture
- Resource Allocation
- Failure Recovery
- Using the YARN Web UI
- MapReduce Version 1
Module 5: Planning Your Hadoop Cluster
- General Planning Considerations
- Choosing the Right Hardware
- Network Considerations
- Configuring Nodes
- Planning for Cluster Management
Module 6: Hadoop Installation and Initial Configuration
- Deployment Types
- Installing Hadoop
- Specifying the Hadoop Configuration
- Performing Initial HDFS Configuration
- Performing Initial YARN and MapReduce Configuration
- Hadoop Logging
Module 7: Installing and Configuring Hive, Impala, and Pig
- Hive
- Impala
- Pig
Module 8: Hadoop Clients
- What is a Hadoop Client?
- Installing and Configuring Hadoop Clients
- Installing and Configuring Hue
- Hue Authentication and Authorization
Module 9: Cloudera Manager / APACHE Ambari
- The Motivation for Cloudera Manager /Apache Ambari
- Cloudera Manager/ Apache Ambari Features
- Express and Enterprise Versions
- Cloudera Manager / Apache Ambari Topology
- Installing Cloudera Manager / Apache Ambari
- Installing Hadoop Using Cloudera Manager / Apache Ambari
- Performing Basic Administration Tasks Using Cloudera Manager / Apache Ambari
Module 10: Advanced Cluster Configuration
- Configuring Hadoop Ports
- Explicitly Including and Excluding Hosts
- Configuring HDFS for Rack Awareness
- Configuring HDFS High Availability
Module 11: Hadoop Security
- Why Hadoop Security Is Important
- Hadoop’s Security System Concepts
- What Kerberos Is and How it Works
Module 12: Cluster Maintenance
- Checking HDFS Status
- Copying Data Between Clusters
- Adding and Removing Cluster Nodes
- Rebalancing the Cluster
- Cluster Upgrading
Module 13: Cluster Monitoring and Troubleshooting
- General System Monitoring
- Monitoring Hadoop Clusters
- Common Troubleshooting Hadoop Clusters
- Common Misconfigurations
Speaker/s
All our courses and trainings are funded by HRDF (Human Resources Development Fund Malaysia). Our courses include Infocomm, Digital Media, Robotics, Semiconductor,Telecommunication, Life Science, Horticulture Industries , and Business Administration . Below are some of our popular courses
- Python Programming
- R Programming
- Tableau
- Machine Learning
- Raspberry Pi
- Arduino
- 3D Printing
- iOS Apps Development
- Android Apps Development
- Magento eCommerce
- Wordpress
- Joomla
- Search Engine Optimizatoin
- Web Design
- Google Analytics
- Facebook Marketing