ComfyUI-VisualQueryTemplate

August 28, 2024 ยท View on GitHub

A ComfyUI node for transforming images into descriptive text using templated visual question answering. Leverages Hugging Face's VQA models with transformers

Screenshot 2024-08-28 144142

image

image