🎻 Genre Classifier

Train a classifier that accepts the name of a musician and predicts the most likely genre of their music out of the set:

{ jazz, opera, country, electronic, metal, rap, classical, reggae }

You will train this classifier to operate on top of a pretrained text encoder that converts artist names into embedding vectors. In support of this task, you will assemble a small dataset of artist names for each genre using GPT.

Steps

Create a dataset of artists by genre and save to JSON. We suggest using the OpenAI GPT API and will provide you an access key.

python create_dataset.py --count 20 --output data.json

Compute an embedding vector from each artist name and save to disk. We suggest using the text encoder from open_clip and saving as a pandas dataframe.

python compute_embeddings.py --input data.json --output embeddings.pkl

Visualize the embeddings in a 2D projection space using umap. We provide the code for this.

python visualize_embeddings.py --input embeddings.pkl

Train a simple classifier to predict the genre from the embedding vector of the artist's name. It is up to you to pick an architecture, train, and evaluate the model.

python train_classifier.py --input embeddings.pkl

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
compute_embeddings.py		compute_embeddings.py
create_dataset.py		create_dataset.py
requirements.txt		requirements.txt
train_classifier.py		train_classifier.py
visualize_embeddings.py		visualize_embeddings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎻 Genre Classifier

Steps

About

Uh oh!

Releases

Packages

Languages

ysaatchi/genre-classifier

Folders and files

Latest commit

History

Repository files navigation

🎻 Genre Classifier

Steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages