README.md
July 22, 2024 · View on GitHub
Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding
School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen*Equal contribution †Corresponding author
Updates
- [07/2024] Arxiv paper released.
:fire: The codes will be released soon
Citation
If you find this work useful for your research, please kindly cite our paper:
@misc{zhang2024token,
title={Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding},
author={Renshan Zhang, Yibo Lyu, Rui Shao, Gongwei Chen, Weili Guan and Liqiang Nie},
year={2024},
eprint={2407.14439},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2407.14439},
}