STM Model Zoo Audio Event Detection
I want to train one of the pretrained AI models (lets say i choose yamnet), on my own data. I want to use mimii dataset which consists of different industrial equipment sound files in .wav format in normal working condition and anomalous working condition. I only want to test on valve type sound so only two classes normal and anomalous sound type are needed. How do i convert the format to esc-50 format as said in the tutorial, i see that there are .csv files as well as .wav files in the esc-50 dataset.
