Do you know that you can apply machine learning algorithms to big data very easily? What makes it simple is Spark and its machine learning library MLlib. And it gets even simpler using the python API PySpark.
To better visualize how to do that, please take a look at this notebook:
Spark_MLlib_Classification
To better visualize how to do that, please take a look at this notebook:
Spark_MLlib_Classification