A LeNet-5 inspired convolutional network applied to the popular GTZAN musical genre dataset. Audio files are broken into slices of ~3 seconds, re-sampled to 16000 Hz, and an 80-channel log-magnitude Mel spectrogram representation is computed on 25-millisecond windows with a stride of 10 milliseconds.
-
Notifications
You must be signed in to change notification settings - Fork 0
michailmelonas/musical-genre-recog
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published