Identification of the Gender of a Vocalist with Mel-spectrogram and 1D CNN. Sample vocal WAV files from DSD100 and ccmixter.org were split into 20s segments. In some cases these files were isolated, in others they were source files. 800 Samples were then fed into a 1D CNN with a train, validation, and test set.
- Librosa
- Numpy
- Pandas
- Keras
- Scikit-learn
- MongoDB
- Python 3.6+
- Scipy
- Chunk into 20s
- Librosa Mel-Spectrogram
- Librosa power_to_db
- Cepstral Mean Normalization
- Flatten
Precision: 81.8%, Recall: 97.6%, Test Accuracy 89.7%
For more information on results, see this post https://mblance.com/blog/identification-of-a-vocalist-s-gender-with-cnn/