AnyRef

December 26, 2024 · View on GitHub

[CVPR 2024] Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception [paper]