In this notebook we will be working with spotify songs Dataset from Kaggle. Specifically we will work with nested data types where the columns are of type ARRAYS or MAPS.
Recently, I needed to work with Spark dataframes having Map datatypes for one of our projects. I realized that
Array are the two most commonly used datatypes. So, I explored in detail how can we
implode columns of
map datatypes. I created this notebook to be a handy reference for myself. Please feel free to checkout this notebook on if you also…
Machine Learning Platform Engineer at Lyft Inc.