What Is Spark
Lecture 1 : What is Apache Spark🔗
Unified :🔗
Computing Engine:🔗
Spark is not storage platform we can store the data in hdfs, rdbms etc...
Spark can process terabytes of data in distributed manner.
Compute Cluster:🔗
- each slave has 16 GB RAM, 1TB storage and 4 core CPU
- even master has some data and RAM
- the above cluster can compute 64 gb of data at a time.
- the master divides the data among the slave nodes and then slaves process the data.