We've noticed this is not your region.
Redirect me to my region
What do you want to learn today?

HRDF Funded Course - Apache Hadoop Administrator Training

Training by  Tertiary Infotech
Inquire Now
On-Site / Training

Details

This four-day administrator training course for Apache Hadoop provides participants with a comprehensive understanding of all the steps necessary to operate and maintain a Hadoop cluster. From installation and configuration through load balancing and tuning. This training course is the best preparation for the real-world challenges faced by Hadoop administrators.

Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as: • The internals of YARN, MapReduce, and HDFS • Determining the correct hardware and infrastructure for your cluster • Proper cluster configuration and deployment to integrate with the data center • How to load data into the cluster from dynamically-generated files using Flume and from RDBMS using Sqoop • Configuring the FairScheduler to provide service-level agreements for multiple users of a cluster • Best practices for preparing and maintaining Apache Hadoop in production • Troubleshooting, diagnosing, tuning, and solving Hadoop issues

Upon completion of the course, attendees can go for CCAH or HDP Administrator. Certification is a great differentiator; it helps establish you as a leader in the field, providing employers and customers with tangible evidence of your skills and expertise.

HRDF SBL Claimable for Employers Registered with HRD

For more details, please visit 
https://www.tertiarycourses.com.my/apache-hadoop-administrator-training.html

Outline

Module 1: Get Started on Apache Hadoop

  • Why Hadoop?
  • Core Hadoop Components
  • Fundamental Concepts

Module 2: HDFS

  • HDFS Features
  • Writing and Reading Files
  • NameNode Memory Considerations
  • Overview of HDFS Security> Using the Namenode Web UI
  • Using the Hadoop File Shell

Module 3: Getting Data into HDFS

  • Ingesting Data from External Sources with Flume
  • Ingesting Data from Relational Databases with Sqoop
  • Best Practices for Importing Data

Module 4: YARN and MapReduce

  • What Is MapReduce?
  • Basic MapReduce Concepts
  • YARN Cluster Architecture
  • Resource Allocation
  • Failure Recovery
  • Using the YARN Web UI
  • MapReduce Version 1

Module 5: Planning Your Hadoop Cluster

  • General Planning Considerations
  • Choosing the Right Hardware
  • Network Considerations
  • Configuring Nodes
  • Planning for Cluster Management

Module 6: Hadoop Installation and Initial Configuration

  • Deployment Types
  • Installing Hadoop
  • Specifying the Hadoop Configuration
  • Performing Initial HDFS Configuration
  • Performing Initial YARN and MapReduce Configuration
  • Hadoop Logging

Module 7: Installing and Configuring Hive, Impala, and Pig

  • Hive
  • Impala
  • Pig

Module 8: Hadoop Clients

  • What is a Hadoop Client?
  • Installing and Configuring Hadoop Clients
  • Installing and Configuring Hue
  • Hue Authentication and Authorization

Module 9: Cloudera Manager / APACHE Ambari

  • The Motivation for Cloudera Manager /Apache Ambari
  • Cloudera Manager/ Apache Ambari Features
  • Express and Enterprise Versions
  • Cloudera Manager / Apache Ambari Topology
  • Installing Cloudera Manager / Apache Ambari
  • Installing Hadoop Using Cloudera Manager / Apache Ambari
  • Performing Basic Administration Tasks Using Cloudera Manager / Apache Ambari

Module 10: Advanced Cluster Configuration

  • Configuring Hadoop Ports
  • Explicitly Including and Excluding Hosts
  • Configuring HDFS for Rack Awareness
  • Configuring HDFS High Availability

Module 11: Hadoop Security

  • Why Hadoop Security Is Important
  • Hadoop’s Security System Concepts
  • What Kerberos Is and How it Works

Module 12: Cluster Maintenance

  • Checking HDFS Status
  • Copying Data Between Clusters
  • Adding and Removing Cluster Nodes
  • Rebalancing the Cluster
  • Cluster Upgrading

Module 13: Cluster Monitoring and Troubleshooting

  • General System Monitoring
  • Monitoring Hadoop Clusters
  • Common Troubleshooting Hadoop Clusters
  • Common Misconfigurations

Speaker/s

Jason is a native of Kuala Lumpur, Malaysia; studied Bachelor’s Degree in Accounting and Finance from the London School of Economics Program, University of London. Raised in a typical Chinese family with entrepreneurial business background that is involved in manufacturing and real estate development. Worked as an Executive at the Asset and License Management Department in Standard Chartered, Malaysia; promoted to Data Analyst six months later. Later joined Tune Hotels Regional Services, a hotel management and hotel chain operator; served as Senior Revenue Executive. Served as Research Analyst with Wealth-X, a company that provides prospecting, intelligence and wealth due diligence on ultra-high net worth individuals. Thereafter served as Senior Data Analyst with Xchanging Malaysia, a joint venture between Xchanging and YTL Communications to develop and deliver enhanced mobile internet and cloud-based hosting offerings in Malaysia. Currently working as a Data Analyst with GoQuO, a full service e-commerce solutions provider to airlines and OTAs. Community Organizer of Big Data Malaysia, a professional network for individuals with interest in all aspects of Big Data, and Member of the Founder Institute for Malaysian Chapter, the world’s largest entrepreneur training and startup launch program. Occasionally participates in marathons and is an avid off-road cyclist. Passionate about technology, economics and enjoys social events.
Reviews
Be the first to write a review about this course.
Write a Review
Tertiary Courses Malaysia is a HRDF Approved Training Provider in Malaysia. We offers wide range of classroom instructor-led technical training courses for working professionals and executives in Malaysia.

All our courses and trainings are funded by HRDF (Human Resources Development Fund Malaysia). Our courses include Infocomm, Digital Media, Robotics, Semiconductor,Telecommunication, Life Science, Horticulture Industries , and Business Administration . Below are some of our popular courses

  1. Python Programming
  2. R Programming
  3. Tableau
  4. Machine Learning
  5. Raspberry Pi
  6. Arduino
  7. 3D Printing
  8. iOS Apps Development
  9. Android Apps Development
  10. Magento eCommerce
  11. Wordpress
  12. Joomla
  13. Search Engine Optimizatoin
  14. Web Design
  15. Google Analytics
  16. Facebook Marketing
Sending Message
Please wait...
× × Speedycourse.com uses cookies to deliver our services. By continuing to use the site, you are agreeing to our use of cookies, Privacy Policy, and our Terms & Conditions.