README.md

May 7, 2024 ยท View on GitHub

This repository contains the CSV files for the processed dataset used to train VoiceLDM. These files include the transcriptions generated using the Whisper model.

Speech Segments

  • as_speech_en.csv
  • cv1.csv (cv.csv has been split into two due to file size limitations on GitHub.)
  • cv2.csv
  • voxceleb.csv

Non-Speech Segments

  • as_noise.csv
  • noise_demand.csv

Evaluation Segments

Additionally, I've included the CSV file corresponding to the ac_filtered test set, which was specifically used to evaluate VoiceLDM.

  • ac_filtered.csv