Interview Questions
Add-on Software Packages
-
In addition to the four core frameworks contained in the Hadoop framework, there are additional software packages which are installed with Hadoop system. They are,
Apache Mahout
With the help of library files in Apache Mahout, meaningful patters from big data Hadoop clusters can be derived effectively.
Apache Hive
This add-on application is a data warehouse application which can be associated with Hadoop framework to manage huge datasets in HDFS (Hadoop Distributed File System). It contains interactive interface where the SQL-like queries can be used to access and manage the datasets.
Apache Pig
Data analysis programs can be written to perform analysis on the data stored in the Hadoop distributed file system.
Apache Zookeeper
The entire configuration service for the Hadoop distributed environment is managed with the help of Apache Zookeeper application.
Apache HBase
Data sources pertaining to various different data structures and schemas are effectively combined with the help of HBase application.
Apache Oozie
This application is helpful in managing the jobs created in Hadoop distributed environment. It also acts a workflow scheduler system to integrate various components of the Apache Hadoop system.
Apache Ambari
This application can be used along with Apache Hadoop to simplify the tasks like managing, monitoring and provisioning of Hadoop System.
Get in touch with training experts Get Free Quotes