Predict the rhythm of the genre using Spotify's rich musical dataset. Drawing data from Kaggle, the project is an ensemble of analytical processes aimed at predicting the pulse of the top music genres.
Gear up by installing:
- pandas
- numpy
- matplotlib
- seaborn
- sklearn
To install, use the following pip command:
- pip install pandas numpy matplotlib seaborn scikit-learn
With musical features like tempo, speechiness, and acousticness accentuating each track, the dataset paints a melodious picture. Every song is labelled with its respective genre, and the quest is to predict the genre.
- ID: Track's unique code.
- title: Song's name.
- artist: Maestro behind the melody.
- top genre: The very heart of the song (our target).
- Other sonic signatures like
tempo,speechiness,acousticness, and more.
- Data Tune-up: Initial pre-processing involved removing any discordant missing values and visually inspecting the dataset's composition.
- Harmonizing Features: Used one-hot encoding for artists, realizing that certain maestros have distinct genre imprints.
- Orchestrating Models: Crafted a harmonious ensemble using a
VotingClassifierthat resonates with Logistic Regression, Random Forest, Support Vector Machines, and Extra Trees. - Final Score: Assessed the symphony using accuracy metrics and a confusion matrix.
Our ensemble rendered a sonorous prediction, harmonizing perfectly with the genres. The minimal misclassifications echoed the model's finesse.
- Sync the repository to your local ensemble.
- Steer to the directory echoing with the notebook.
- Initiate Jupyter Notebook and tune into the shared notebook.
- Play the cells in harmony to experience the musical analysis.
