title: Visual Question Answering
type: community
group: Computer Vision
image: /static/templates/visual-question-answering.png
details: |
Answer the questions related to what you see on the picture
- Industry Applications
- accessibility tools, educational assessment, autonomous systems, robotics navigation, medical image analysis, surveillance intelligence, content moderation, e-commerce search, museum guide systems, smart home automation, industrial quality control, retail analytics
- Associated Models
- ViLBERT, LXMERT, VisualBERT, UNITER, BLIP, GPT-4V, Flamingo
- Domain Terminology
- multimodal reasoning, visual reasoning, scene understanding, visual intelligence, cross-modal fusion
config: |