README.md

July 22, 2024 · View on GitHub

Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding

School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen
*Equal contribution †Corresponding author

arXiv

Updates

:fire: The codes will be released soon

Citation

If you find this work useful for your research, please kindly cite our paper:

@misc{zhang2024token,
      title={Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding}, 
      author={Renshan Zhang, Yibo Lyu, Rui Shao, Gongwei Chen, Weili Guan and Liqiang Nie},
      year={2024},
      eprint={2407.14439},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2407.14439}, 
}