Posts

Introduction to Apache Spark

Image
Hello Friends, In this blog, we will study why we need Apache Spark and benefits of using Spark. So, let's get started....... Basic Info about Spark... The main idea for creating Spark is that MapReduce was not good for Iterative and Interactive applications. Now, another question arises, what is actually Spark is ??? Spark is a cluster computing framework. It's fast supports in-memory computation( Biggest advantage of using Spark ), designed to cover a wide range of workloads like batch application,  interactive queries, streaming, machine learning, etc.. In-Memory computation means it takes data into RAM and processes it and returns the result. The capability of processing data in RAM makes Spark faster as it avoids costly disk I/O operations. Also, Spark handles different types of workloads , for example,  we can process batches, structured data, Machine Learning, streaming data, graphs, etc.. In contrast to Spark in Hadoop, we can do all above-mentioned tasks