Name the operations supported by RDD?

Sharad Jaiswal
Sharad Jaiswal

Posted On: Feb 22, 2018

 

As the major logical data units in Apache Spark, RDD possesses a distributed collection of data. It is a read-only data structure and you cannot change the original format but it can always be transformed into a different form with the changes. The two operations which are supported by RDD are -

  • Transformation - It creates a new RDD from the former one. They are executed only on demand.
  • Actions - The final outcomes of the RDD computations are returned by actions.

    Related Questions

    Please Login or Register to leave a response.

    Related Questions

    Apache Spark Interview Questions

    What is Apache Spark?

    Apache Spark is basically a processing framework which is extremely fast and convenient to use...

    Apache Spark Interview Questions

    Can you mention some features of spark?

    On a general note, the most essential features of Apache Spark are-..

    Apache Spark Interview Questions

    Can you define RDD?

    The acronym for Resale in Distributed Datasheets is RDD...