Friday, June 7, 2013

Echo Systems Definitions

SQOOP: Used to import Data from RDBMS and also Export data into RDBMS.     
Flume:  To import Streaming Data
OOZIE:  To Schedule the Jobs(Hadoop) and Define Workflows


ZOOKEEPER:  State Maintenance (Controlling Data Locks)
 To store the records, failure of records nd Controlling the data locks.

Map Reduce: Map Reduce is a software framework that  allows developers to write programs that process  massive amounts of unstructured data in parallel across a distributed  cluster of processors or stand-alone computers.
The framework is divided into two parts:
Map Process:
In this process input is taken by the master
node,which divides it into smaller tasks and distribute them to the workers nodes. The workers nodes process these sub tasks and pass them back to the master node.
Reduce Process :
In this the master node combines all the answers provided by the worker nodes to get the results of the original task. The main advantage of Map reduce is that the map and
reduce are performed in distributed mode. Since each operation is independent, so each map can be
performed in parallel and hence reducing the net computing time.








2 comments:

  1. This information which you provided is very much useful for us.It was very interesting and useful for hadoop online training.We also providing qa online training institute in USA.

    ReplyDelete
  2. This information which you provided is very much useful for us.It was very interesting and useful for Hadoop training Persons.We also providing ccna ccnp online training institute in worldwide.

    ReplyDelete