Official Implementation of
"Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation"
For simplicity, you can directly run:
bash install.shwhich includes the following steps:
- Install PyTorch 1.9.1 and other dependencies:
pip install torch==1.9.1+cu111 torchvision==0.10.1+cu111 torchaudio==0.9.1 -f https://download.pytorch.org/whl/torch_stable.html
pip install -r requirements.txt(Adjust CUDA version if necessary.)
- Install GroundingDINO:
cd GroundingDINO && python3 setup.py installPrepare the dataset under data/ folder following the instruction.
bash scripts/gen_sg_triplets.shbash scripts/gen_pseudo_triplets.shbash scripts/train.shAdjust
CUDA_VISIBLE_DEVICESif needed. Effective batch size = batch size × number of GPUs.
bash scripts/DINO_eval.sh vg [config file] [data path] [output path] [checkpoint]The checkpoints are released at here.
We thank:
- Scene-Graph-Benchmark.pytorch
- GroundingDINO
- OvSGTR for their awesome open-source codes and models.
If you find our work helpful, please cite:
@inproceedings{chen2024expanding,
title={Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation},
author={Li, Lin and Zhang, Chuhan and Zhang, Dong and Sun, Chong and Li, Chen and Chen, Long},
booktitle={NeurIPS},
year={2025}
}