Problem Statement: Design a horizontally scalable and highly concurrent Movie Ticket Booking Platform with the following features.

In continuation with my last blog, I added some more features to the MovieBuzz project. Here is the link to the previous blog:

New Features

In any machine learning model , the objective is to find the cost function for that algorithm and then minimize the cost function. In simple terms, higher is the cost of model worst is the algorithm and vice versa. The cost function is made up of some parameters. …

Structured Streaming is a scalable and fault-tolerant stream processing engine built on the Spark SQL engine. In short, Structured Streaming provides fast, scalable, fault-tolerant, end-to-end exactly-once stream processing without the user having to reason about streaming.

Stateful stream processing means that a “state” is shared between events and therefore past…

Here in this blog, we’ll analyze the ‘Google Play Store Apps User Reviews’ dataset which is available for free on You can find that dataset on this link :

Here is the link to download that dataset in zip file:

This dataset consists of two CSV files…

Bhushan Gosavi

Big Data Engineer. Software Engineer. Datastax Certified Cassandra Developer. (

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store