What is distributed Cache in MapReduce?

Sharad Jaiswal
Sharad Jaiswal

Posted On: Mar 02, 2020

 

The Distributed cache in the Hadoop MapReduce framework is used to cache the files that are needed by the applications.

It is used to cache read-only text files, archives, jar files, etc. The cached file in the MapReduce distributed cache is available on each data node where the map/reduce tasks are running. Distributed cache provides many benefits such as storing complex data, eliminating a single point of failure, and ensuring data consistency.

    Related Questions

    Please Login or Register to leave a response.

    Related Questions

    Mapreduce Interview Questions

    What is Hadoop Map Reduce?

    Map Reduce is the core of Hadoop. It is one of the programming paradigms that acknowledge into consideration enormous a..

    Mapreduce Interview Questions

    What is the need of Map Reduce?

    Map Reduce is information handling paradigms in itself. This was one of its kind information handlings and has been tra..

    Mapreduce Interview Questions

    Clarify what is shuffling in Map Reduce?

    The procedure by which the framework lays out the sort and transfers the map outputs to the reducer as sources of inform..