SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 64516500 of 10817 papers

TitleStatusHype
How Generative-AI can be Effectively used in Government Chatbots0
A Survey on Legal Question Answering Systems0
How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering?0
Modelling Long-distance Node Relations for KBQA with Global Dynamic Graph0
Modelling the Semantics of Adjectives in the Ontology-Lexicon Interface0
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants0
Contextualized Query Embeddings for Conversational Search0
Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models0
Modern Question Answering Datasets and Benchmarks: A Survey0
Modular Blended Attention Network for Video Question Answering0
A Multi-Stage Memory Augmented Neural Network for Machine Reading Comprehension0
Modular Graph Attention Network for Complex Visual Relational Reasoning0
Addressing Hallucinations with RAG and NMISS in Italian Healthcare LLM Chatbots0
Contextualized Knowledge-aware Attentive Neural Network: Enhancing Answer Selection with Knowledge0
How do QA models combine knowledge from LM and 100 passages?0
How do Negation and Modality Impact on Opinions?0
Contextualized Embeddings based Convolutional Neural Networks for Duplicate Question Identification0
A Survey on Large Language Models with some Insights on their Capabilities and Limitations0
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications0
How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations0
MOKA: Open-World Robotic Manipulation through Mark-Based Visual Prompting0
Bridging the Gap Between Information Seeking and Product Search Systems: Q&A Recommendation for E-commerce0
Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering0
Mondrian: Prompt Abstraction Attack Against Large Language Models for Cheaper API Pricing0
Mongolian Named Entity Recognition System with Rich Features0
Mongolian Questions Classification Based on Mulit-Head Attention0
Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models0
Contextual Evaluation of Large Language Models for Classifying Tropical and Infectious Diseases0
A Survey on Knowledge-Oriented Retrieval-Augmented Generation0
How Context Affects Language Models' Factual Predictions0
Monolingual Social Media Datasets for Detecting Contradiction and Entailment0
Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation0
A Multi-Source Retrieval Question Answering Framework Based on RAG0
How Can Objects Help Video-Language Understanding?0
Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling0
A Survey on Knowledge Graph Embeddings with Literals: Which model links better Literal-ly?0
Accounting for Focus Ambiguity in Visual Questions0
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering0
Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing0
Morpho-syntactic Lexical Generalization for CCG Semantic Parsing0
Morpho-Syntactic Study of Errors from Speech Recognition System0
A Survey on Graph Neural Networks for Knowledge Graph Completion0
MoRS at SemEval-2017 Task 3: Easy to use SVM in Ranking Tasks0
MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles0
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision0
MOSMOS: Multi-organ segmentation facilitated by medical report supervision0
Motion-Appearance Co-Memory Networks for Video Question Answering0
HopRetriever: Retrieve Hops over Wikipedia to Answer Complex Questions0
Contextual Code Switching for Machine Translation using Language Models0
A Multi-Resolution Word Embedding for Document Retrieval from Large Unstructured Knowledge Bases0
Show:102550
← PrevPage 130 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified