Posted On: Dec 31, 2020
Apache Hadoop is used to process a huge amount of data. The architecture of Apache Hadoop consists of Hadoop components and various technologies which is helpful to solve complex data problems easily.
The description of each component of the architecture of the Hadoop ecosystem are as follows:
Namenode controls the operation of data.
Datanode writes the data to local storage. To store all data at a single place is not always recommended, as it may cause loss of data in case of an outage situation.
Task tracker accepts tasks assigned to the slave node.
The map takes data from a stream and each line is processed after splitting it into various fields.
Reduce: The fields, obtained through Map are grouped together or concatenated with each other.
Never Miss an Articles from us.