Skip to content

Spark Transformations

Lecture 17: Dataframe Transformations in Spark Part 1🔗

image Data gets stored in Row() format in the form of bytes

image

Columns are expressions. Expressions are set of transformations on more than one value in a record.

Ways to select values / columns🔗

image

image

Column Manipulations

image

Other methods image

selectExpr image

Aliasing Columns image

Lecutre 18 : Dataframe Transformations in Spark Part II🔗

filter() / where() no difference🔗

image

image

Multiple filter conditions🔗

image

Literals in spark🔗

Used to pass same value in all the columns image

Adding Columns🔗

If the column already exists then it gets overwritten. image

Renaming Columns🔗

image