The Collection of
DATA
We collected our data from three sources. Including music data from Spotify, text data from Wikipedia and connection data from AllMusic.com
Spotify
Spotify is providing APIs for developers to use their service and provide unique applications to end users. For better recommendations, Spotify APIs are providing in-depth audio features in a normalized scale. With this data, we can analyze patterns between genres, artists, eras, and more.
Click for downloading the data
Wikipedia
We also collected some text data from Wikipedia, especially two parts, the summary of musicians and their influence section. We can analyze the similarity or difference of the musicians through text clustering within summary data to draw a portrait for each musician.
Click for downloading the data
AllMusic.com
We have also found a data-set that contains who the musicians themselves are claiming to have influence on them. It’s a very dense data-set that draws the connection between musicians who think to have influence on each other.
Click for downloading the data