SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 37013750 of 10817 papers

TitleStatusHype
A Technical Question Answering System with Transfer LearningCode0
ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation FusionCode0
A Deep Architecture for Semantic Matching with Multiple Positional Sentence RepresentationsCode0
Match-Prompt: Improving Multi-task Generalization Ability for Neural Text Matching via Prompt LearningCode0
Improving Question Answering with External KnowledgeCode0
Improving Machine Reading Comprehension with General Reading StrategiesCode0
Conversational BrowsingCode0
Combining Data Generation and Active Learning for Low-Resource Question AnsweringCode0
Improving language models by retrieving from trillions of tokensCode0
Accurate and Nuanced Open-QA Evaluation Through Textual EntailmentCode0
An Adaptive Framework for Generating Systematic Explanatory Answer in Online Q&A PlatformsCode0
SemEval-2019 Task 8: Fact Checking in Community Question Answering ForumsCode0
Interactive Text Ranking with Bayesian Optimisation: A Case Study on Community QA and SummarisationCode0
KEPR: Knowledge Enhancement and Plausibility Ranking for Generative Commonsense Question AnsweringCode0
Improving Consistency in Large Language Models through Chain of GuidanceCode0
Improving Complex Knowledge Base Question Answering via Question-to-Action and Question-to-Question AlignmentCode0
Improving Conversational Question Answering Systems after Deployment using Feedback-Weighted LearningCode0
Addressing Semantic Drift in Question Generation for Semi-Supervised Question AnsweringCode0
A BERT Baseline for the Natural QuestionsCode0
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible AdapterCode0
Improved Word Sense Disambiguation Using Pre-Trained Contextualized Word RepresentationsCode0
Contrastive Representation Learning for Conversational Question Answering over Knowledge GraphsCode0
Improved RAMEN: Towards Domain Generalization for Visual Question AnsweringCode0
Improve Query Focused Abstractive Summarization by Incorporating Answer RelevanceCode0
Characterizing the Efficiency vs. Accuracy Trade-off for Long-Context NLP ModelsCode0
Improving Differentiable Neural Computers Through Memory Masking, De-allocation, and Link Distribution Sharpness ControlCode0
Awakening Augmented Generation: Learning to Awaken Internal Knowledge of Large Language Models for Question AnsweringCode0
Contrastive Learning for Task-Independent SpeechLLM-PretrainingCode0
Imitation Learning of Agenda-based Semantic ParsersCode0
TANDA: Transfer and Adapt Pre-Trained Transformer Models for Answer Sentence SelectionCode0
A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question AnsweringCode0
TAPE: Assessing Few-shot Russian Language UnderstandingCode0
Image Question Answering using Convolutional Neural Network with Dynamic Parameter PredictionCode0
Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question AnsweringCode0
Attacking Open-domain Question Answering by Injecting MisinformationCode0
AMUSE: Multilingual Semantic Parsing for Question Answering over Linked DataCode0
Image Content Generation with Causal ReasoningCode0
IIU: Independent Inference Units for Knowledge-based Visual Question AnsweringCode0
ILLUME: Rationalizing Vision-Language Models through Human InteractionsCode0
Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation TasksCode0
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Question Answering and Implicit Dialogue IdentificationCode0
Extractive Summarization with SWAP-NET: Sentences and Words from Alternating Pointer NetworksCode0
Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual IllusionsCode0
If the Sources Could Talk: Evaluating Large Language Models for Research Assistance in HistoryCode0
Continual VQA for Disaster Response SystemsCode0
Identifying Unclear Questions in Community Question Answering WebsitesCode0
A Multi-Type Multi-Span Network for Reading Comprehension that Requires Discrete ReasoningCode0
Idiom Paraphrases: Seventh Heaven vs Cloud NineCode0
II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question AnsweringCode0
IMAD: IMage-Augmented multi-modal DialogueCode0
Show:102550
← PrevPage 75 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified