GitHub - fotwo/sampleCNN-pytorch: Pytorch implementation of "Sample-level Deep Convolutional Neural Networks for Music Auto-tagging Using Raw Waveforms"

Sample-level Deep CNN

Pytorch implementation of Sample-level Deep Convolutional Neural Networks for Music Auto-tagging Using Raw Waveforms

Data

Used tag annotations and audio data

Model

3^9 model with input sample size 59049
3 : stride length of the first conv layer (along with filter size 3, it reduces input dimension to 19683)
9 : 9 hidden conv layers

Procedures

Data processing
- audio (to read audio signal from mp3s and save as npy) : python process_audio.py
- annotation (process redundant tags and select top N=50 tags): python process_annotations.py
  - this will create and save train/valid/test annotation files
Training
- python main.py --device_num 0
Testing
- predict tags for given songs
- python evaluate.py --device_num 0

Tag prediction

python eval_tags.py --device_num 0 --mp3_file "path/to/mp3file/to/predict.mp3"

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
50_tags.txt		50_tags.txt
README.md		README.md
__init__.py		__init__.py
config.py		config.py
data_loader.py		data_loader.py
eval_tags.py		eval_tags.py
main.py		main.py
model.py		model.py
process_annotations.py		process_annotations.py
process_audio.py		process_audio.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sample-level Deep CNN

Data

Model

Procedures

Tag prediction

References

About

Uh oh!

Releases

Packages

Languages

fotwo/sampleCNN-pytorch

Folders and files

Latest commit

History

Repository files navigation

Sample-level Deep CNN

Data

Model

Procedures

Tag prediction

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages