Flume: To import Streaming Data
OOZIE: To Schedule the Jobs(Hadoop) and Define
Workflows
ZOOKEEPER: State Maintenance (Controlling Data Locks)
To
store the records, failure of records nd Controlling the data locks.
Map
Reduce: Map
Reduce is a
software framework that allows
developers to write programs that process
massive amounts of unstructured data in parallel across a distributed
cluster of processors or
stand-alone computers.
The
framework is divided into two parts:
Map Process:
In this process input is taken by the master node,which divides it into smaller tasks and distribute them to the workers nodes. The workers nodes process these sub tasks and pass them back to the master node.
In this process input is taken by the master node,which divides it into smaller tasks and distribute them to the workers nodes. The workers nodes process these sub tasks and pass them back to the master node.
Reduce Process :
In this the master node combines all the answers provided by the worker nodes to get the results of the original task. The main advantage of Map reduce is that the map and reduce are performed in distributed mode. Since each operation is independent, so each map can be
In this the master node combines all the answers provided by the worker nodes to get the results of the original task. The main advantage of Map reduce is that the map and reduce are performed in distributed mode. Since each operation is independent, so each map can be
performed in parallel and hence reducing the net
computing time.
This information which you provided is very much useful for us.It was very interesting and useful for hadoop online training.We also providing qa online training institute in USA.
ReplyDeleteThis information which you provided is very much useful for us.It was very interesting and useful for Hadoop training Persons.We also providing ccna ccnp online training institute in worldwide.
ReplyDelete