MouSi: Poly-Visual-Expert Vision-Language Models
March 7, 2024 ยท View on GitHub
The code is coming soon.
MouSi: Poly-Visual-Expert Vision-Language Models
Multimodal large language model with integration of multiple vision experts