README.md
May 7, 2024 ยท View on GitHub
This repository contains the CSV files for the processed dataset used to train VoiceLDM. These files include the transcriptions generated using the Whisper model.
Speech Segments
as_speech_en.csvcv1.csv(cv.csvhas been split into two due to file size limitations on GitHub.)cv2.csvvoxceleb.csv
Non-Speech Segments
as_noise.csvnoise_demand.csv
Evaluation Segments
Additionally, I've included the CSV file corresponding to the ac_filtered test set, which was specifically used to evaluate VoiceLDM.
ac_filtered.csv