Apache Spark has emerged as an efficient big data platform over the years and it has its own loyal fan base. It is often considered a rival of Hadoop but that is not the case. Both platforms complement each other and can be used with or without each other. Leaving that topic for some other time, let’s focus on the big...