SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 48014850 of 10817 papers

TitleStatusHype
BinaryVQA: A Versatile Test Set to Evaluate the Out-of-Distribution Generalization of VQA ModelsCode0
ACL-Fig: A Dataset for Scientific Figure Classification0
Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation0
ThoughtSource: A central hub for large language model reasoning dataCode3
A Comparative Study of Pretrained Language Models for Long Clinical TextCode1
Graph Attention with Hierarchies for Multi-hop Question Answering0
Towards a Unified Model for Generating Answers and Explanations in Visual Question Answering0
ViDeBERTa: A powerful pre-trained language model for VietnameseCode1
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language ModelsCode0
Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute0
Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction0
PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and DevelopmentCode2
HRVQA: A Visual Question Answering Benchmark for High-Resolution Aerial Images0
Ensemble Transfer Learning for Multilingual Coreference Resolution0
Champion Solution for the WSDM2023 Toloka VQA ChallengeCode3
Weakly-Supervised Questions for Zero-Shot Relation ExtractionCode0
Rationalization for Explainable NLP: A Survey0
Reversing The Twenty Questions Game0
Temporal Perceiving Video-Language Pre-training0
Towards Models that Can See and Read0
Curriculum Script Distillation for Multilingual Visual Question Answering0
SlideVQA: A Dataset for Document Visual Question Answering on Multiple ImagesCode1
Explaining ELH Concept Descriptions through Counterfactual Reasoning0
Multimodal Inverse Cloze Task for Knowledge-based Visual Question AnsweringCode1
Semantic Web Enabled Geographic Question Answering Framework: GeoTR0
Towards Answering Climate Questionnaires from Unstructured Climate ReportsCode0
Recommending Root-Cause and Mitigation Steps for Cloud Incidents using Large Language Models0
Language Models sounds the Death Knell of Knowledge Graphs0
There is No Big Brother or Small Brother: Knowledge Infusion in Language Models for Link Prediction and Question AnsweringCode0
MAQA: A Multimodal QA Benchmark for Negation0
Mind Reasoning Manners: Enhancing Type Perception for Generalized Zero-shot Logical Reasoning over TextCode1
A Brain-inspired Memory Transformation based Differentiable Neural Computer for Reasoning-based Question Answering0
Knowledge Reasoning via Jointly Modeling Knowledge Graphs and Soft Rules0
RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm0
Adaptively Clustering Neighbor Elements for Image-Text GenerationCode0
Topic Segmentation Model Focusing on Local Context0
Emotion-Cause Pair Extraction as Question Answering0
SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout GraphCode1
Learning Trajectory-Word Alignments for Video-Language Tasks0
PIE-QG: Paraphrased Information Extraction for Unsupervised Question Generation from Small Corpora0
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-ShotCode4
Exploring Temporal Concurrency for Video-Language Representation LearningCode0
Variational Causal Inference Network for Explanatory Visual Question AnsweringCode1
PromptCap: Prompt-Guided Image Captioning for VQA with GPT-30
Knowledge Proxy Intervention for Deconfounded Video Question Answering0
Toward Multi-Granularity Decision-Making: Explicit Visual Reasoning with Hierarchical KnowledgeCode0
Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question Answering0
IS-GGT: Iterative Scene Graph Generation With Generative Transformers0
From Images to Textual Prompts: Zero-Shot Visual Question Answering With Frozen Large Language Models0
Exploring the Effect of Primitives for Compositional Generalization in Vision-and-LanguageCode0
Show:102550
← PrevPage 97 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified