SOTAVerified

Medical Visual Question Answering

Papers

Showing 5197 of 97 papers

TitleStatusHype
Efficiency in Focus: LayerNorm as a Catalyst for Fine-tuning Medical Visual Language Pre-trained Models0
Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering0
Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation0
Free Form Medical Visual Question Answering in Radiology0
Fusion of Domain-Adapted Vision and Language Models for Medical Visual Question Answering0
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis0
GEMeX-ThinkVG: Towards Thinking with Visual Grounding in Medical VQA via Reinforcement Learning0
Grounded Knowledge-Enhanced Medical VLP for Chest X-Ray0
Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion0
Interpretable Medical Image Visual Question Answering via Multi-Modal Relationship Graph Learning0
Leveraging Medical Visual Question Answering with Supporting Facts0
LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound0
Med-2E3: A 2D-Enhanced 3D Medical Multimodal Large Language Model0
Medical Visual Question Answering: A Survey0
Medical visual question answering using joint self-supervised learning0
MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility0
MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale0
MF2-MVQA: A Multi-stage Feature Fusion method for Medical Visual Question Answering0
MuVAM: A Multi-View Attention-based Model for Medical Visual Question Answering0
OVQA: A Clinically Generated Visual Question Answering Dataset0
Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models0
Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis0
UIT-Saviors at MEDVQA-GI 2023: Improving Multimodal Learning with Image Enhancement for Gastrointestinal Visual Question Answering0
UnICLAM:Contrastive Representation Learning with Adversarial Masking for Unified and Interpretable Medical Vision Question Answering0
Vision-Amplified Semantic Entropy for Hallucination Detection in Medical Visual Question Answering0
Visual Question Answering in the Medical Domain0
V-Learning -- A Simple, Efficient, Decentralized Algorithm for Multiagent RL0
WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language Models0
Which Client is Reliable?: A Reliable and Personalized Prompt-based Federated Learning for Medical Image Question Answering0
Does CLIP Benefit Visual Question Answering in the Medical Domain as Much as it Does in the General Domain?0
Prompt-based Personalized Federated Learning for Medical Visual Question Answering0
Q2ATransformer: Improving Medical VQA via an Answer Querying Decoder0
RepsNet: Combining Vision with Language for Automated Medical Reports0
R-LLaVA: Improving Med-VQA Understanding through Visual Region of Interest0
SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning0
Structure Causal Models and LLMs Integration in Medical Visual Question Answering0
TM-PATHVQA:90000+ Textless Multilingual Questions for Medical Visual Question Answering0
Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next ParadigmCode0
Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust MedVQA in Gastrointestinal EndoscopyCode0
FEDMEKI: A Benchmark for Scaling Medical Foundation Models via Federated Knowledge InjectionCode0
A Dual-Attention Learning Network with Word and Sentence Embedding for Medical Visual Question AnsweringCode0
Kvasir-VQA: A Text-Image Pair GI Tract DatasetCode0
ClinKD: Cross-Modal Clinical Knowledge Distiller For Multi-Task Medical ImagesCode0
Targeted Visual Prompting for Medical Visual Question AnsweringCode0
Hierarchical Deep Multi-modal Network for Medical Visual Question AnsweringCode0
Hallucination Benchmark in Medical Visual Question AnsweringCode0
ViLMedic: a framework for research at the intersection of vision and language in medical AICode0
Show:102550
← PrevPage 2 of 2Next →

No leaderboard results yet.