RECODE for SGG in Pytorch

October 21, 2025 · View on GitHub

Our paper Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models has been accepted by NIPS 2023.

Installation

Check INSTALL.md for installation instructions.

Dataset

Check DATASET.md for instructions of dataset preprocessing.

Extract CLIP Visual Features

bash scripts/extract_clip_obj_feature.sh

Generate Spatial Images and Offline Spatial Logits

bash scripts/draw_imgs_and_generate_spatial_logits.sh

Inference with RECODE

bash scripts/infer.sh

Generated Files

We provide the extracted clip visual feature, visual cue descriptions, and some spatial information, you can download from here*.

Due to the expiration of the previous cloud disk membership, the data has been cleared. The new link is as here*.

Citations

If you find this project helps your research, please kindly consider citing our paper in your publications.

@article{li2023zero,
  title={Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models},
  author={Li, Lin and Xiao, Jun and Chen, Guikun and Shao, Jian and Zhuang, Yueting and Chen, Long},
  journal={arXiv preprint arXiv:2305.12476},
  year={2023}
}

Credits

Our codebase is based on Scene-Graph-Benchmark.pytorch.