SOTAVerified

AUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)

A more reliable and balanced version of original MUSIC-AVQA benchmark for Audio-Visual Question Answering, proposed in the paper "Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering"

Papers

Showing 15 of 5 papers

TitleStatusHype
Learning to Answer Questions in Dynamic Audio-Visual ScenariosCode1
Meerkat: Audio-Visual Large Language Model for Grounding in Space and TimeCode1
Question-Aware Gaussian Experts for Audio-Visual Question AnsweringCode1
Vision Transformers are Parameter-Efficient Audio-Visual LearnersCode1
Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-AnsweringCode0
Show:102550

No leaderboard results yet.