Album-Popularity-Predictor

A project for CSC 422 Automated Learning & Data Analysis at NC State.

Introduction

We hoped to be able to predict an albums popularity on the year end Billboard top charts based on various acoustic features. Our models assumed an album was popular if the rank was ≤ 25 or not popular if the rank was > 25.

In order to assess whether or not an album is popular, we utilized different machine learning models:

Naive-Bayes
Decision Tree (utilizing Information Gain and Entropy)
Support Vector Machine
Deep Neural Networks

Dataset

Full Album Data with Acoustic Features (Link to Dataset)

Created using data from:

Acoustic and meta features of albums and songs on the Billboard 200
The Billboard Year End Top Albums List

The first dataset was used for the acoustic features and the the Top Albums List was scraped for the album name

Traning & Testing

We performed a 70/30 training testing spilt and standardized the data

Model Results

Model	Accuracy
Naive Bayes Model (Gaussian)	74.9%
Decision Tree Model (Gini)	86.5%
SVM Model	85.3 %
2-NN + 10-Fold CV	85.58%
Deep Neural Network	86.00%

Setup

Auto Installation using pip!

Make sure you have installed virtualenv, or if not then run pip3 install virtualenv
Create the python three virtual environment virtualenv venv
Start the environment source venv/bin/activate
Automatically install all relevant dependencies using the following command pip install -r requirements.txt

Download Testing and Training Data

Allow dataset_download.sh permission to execute by running

$ chmod +x dataset_download.sh

Download the data byt running

$ ./dataset_download.sh

The training and testing data should be available in data/

Usage

In the root folder of the program run this command to start the virtual environment

$ source venv/bin/activate

After the virtual environment has started run this command to start the program

$ python models/decision_tree.py
$ python models/knn_model.py
$ python models/naive_bayes_model.py
$ python models/neural_net.py
$ python models/svm_model.py

Additional Resources

Data Science for Hit Song Prediction

Song Popularity Predictor

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data_parsers		data_parsers
models		models
output		output
.gitignore		.gitignore
README.md		README.md
dataset_download.sh		dataset_download.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Album-Popularity-Predictor

Introduction

Dataset

Traning & Testing

Model Results

Setup

Auto Installation using pip!

Download Testing and Training Data

Usage

Additional Resources

About

Releases

Packages

Contributors 4

Languages

alaydeliwala/Album-Popularity-Predictor

Folders and files

Latest commit

History

Repository files navigation

Album-Popularity-Predictor

Introduction

Dataset

Traning & Testing

Model Results

Setup

Auto Installation using pip!

Download Testing and Training Data

Usage

Additional Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages