Can you define RDD?

devquora
devquora

Posted On: Feb 22, 2018

 

The acronym for Resale in Distributed Datasheets is RDD. It is a fault-tolerant collection for all of the elements that run parallel. The sorted data in RDD is immutable and primarily of two types –

  • Parallelized collections
  • Hadoop datasets

    Related Questions

    Please Login or Register to leave a response.

    Related Questions

    Apache Spark Interview Questions

    What is Apache Spark?

    Apache Spark is basically a processing framework which is extremely fast and convenient to use...

    Apache Spark Interview Questions

    Can you mention some features of spark?

    On a general note, the most essential features of Apache Spark are-..

    Apache Spark Interview Questions

    Do you know the comparative differences between Apache Spark and Hadoop?

    Yes there are several segments on which they can be differentiated. Few of them are-..