README.md

August 19, 2025 ยท View on GitHub

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing

Faegheh Sardari, Armin Mustafa, Philip J.B. Jacksonn, Adrian Hilton

Code for ECCV 2024 paper CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing




Prerequisites

  • Linux
  • Python 3
  • CPU or NVIDIA GPU + CUDA CuDNN

Data Preparation

  1. Create a folder named 'features' inside the data folder
  2. Download the audio and visual features from https://github.com/YapengTian/AVVP-ECCV20 and transfer them to the 'features' folder.

Train & Test

Run main.py

Test our pretrianed model

Run main.py --mode test

Project | Paper

Citation

If you use this code for your research, please cite our papers.

@article{sardari2024coleaf,
  title={CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing},
  author={Sardari, Faegheh and Mustafa, Armin and Jackson, Philip JB and Hilton, Adrian},
  journal={European Conference on Computer Vision},
  year={2024}
} 

License

Copyright (C) 2024 University of Surrey. The code repository is published under the CC-BY-NC 4.0 license.

Acknowledgments

This repository includes the modified codes from:

We are grateful to the creators of these repositories.

This research is also supported by UKRI EPSRC Platform Grant EP/P022529/1, and EPSRC BBC Prosperity Partnership AI4ME: Future Personalised ObjectBased Media Experiences Delivered at Scale Anywhere EP/V038087/1.