Skip to content

What Is Spark

Lecture 1 : What is Apache Spark🔗

image

Unified :🔗

image

Computing Engine:🔗

image

Spark is not storage platform we can store the data in hdfs, rdbms etc...

Spark can process terabytes of data in distributed manner.

Compute Cluster:🔗

image

  • each slave has 16 GB RAM, 1TB storage and 4 core CPU
  • even master has some data and RAM
  • the above cluster can compute 64 gb of data at a time.
  • the master divides the data among the slave nodes and then slaves process the data.