SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1015110200 of 10817 papers

TitleStatusHype
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding0
SPBERTQA: A Two-Stage Question Answering System Based on Sentence Transformers for Medical Texts0
Spectral Graph-Based Method of Multimodal Word Embedding0
Speech Act Modeling of Written Asynchronous Conversations with Task-Specific Embeddings and Conditional Structured Models0
SpeechBERT: An Audio-and-text Jointly Learned Language Model for End-to-end Spoken Question Answering0
SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering0
Speech-Enabled Hybrid Multilingual Translation for Mobile Devices0
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models0
Speech Retrieval-Augmented Generation without Automatic Speech Recognition0
Speeding Up Question Answering Task of Language Models via Inverted Index0
SPFresh: Incremental In-Place Update for Billion-Scale Vector Search0
Sphere Neural-Networks for Rational Reasoning0
sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting0
Spinning Straw into Gold: Using Free Text to Train Monolingual Alignment Models for Non-factoid Question Answering0
SplatTalk: 3D VQA with Gaussian Splatting0
SplaXBERT: Leveraging Mixed Precision Training and Context Splitting for Question Answering0
Spoken Conversational Search for General Knowledge0
SpokenNativQA: Multilingual Everyday Spoken Queries for LLMs0
Spoken question answering for visual queries0
Sports Intelligence: Assessing the Sports Understanding Capabilities of Language Models through Question Answering from Text to Video0
Spot the Odd Man Out: Exploring the Associative Power of Lexical Resources0
SPred: Large-scale Harvesting of Semantic Predicates0
SQALER: Scaling Question Answering by Decoupling Multi-Hop and Logical Reasoning0
SQUARE: Automatic Question Answering Evaluation using Multiple Positive and Negative References0
SQuARE: Semantics-based Question Answering and Reasoning Engine0
Squibs: What Is a Paraphrase?0
SRAG: Structured Retrieval-Augmented Generation for Multi-Entity Question Answering over Wikipedia Graph0
sranjans : Semantic Textual Similarity using Maximal Weighted Bipartite Graph Matching0
SRDF: Extracting Lexical Knowledge Graph for Preserving Sentence Meaning0
SSP: Semantic Space Projection for Knowledge Graph Embedding with Text Descriptions0
Stable Code Technical Report0
Stacked Latent Attention for Multimodal Reasoning0
Stacking with Auxiliary Features for Visual Question Answering0
StackOverflowVQA: Stack Overflow Visual Question Answering Dataset0
STAMPsy: Towards SpatioTemporal-Aware Mixed-Type Dialogues for Psychological Counseling0
STAR: A Benchmark for Situated Reasoning in Real-World Videos0
Stars at Qur’an QA 2022: Building Automatic Extractive Question Answering Systems for the Holy Qur’an with Transformer Models and Releasing a New Dataset0
Correctness Coverage Evaluation for Medical Multiple-Choice Question Answering Based on the Enhanced Conformal Prediction Framework0
Statistical Machine Translation Improves Question Retrieval in Community Question Answering via Matrix Factorization0
Statistical Script Learning with Recurrent Neural Networks0
Statistical Uncertainty Quantification for Aggregate Performance Metrics in Machine Learning Benchmarks0
Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation0
SteLLA: A Structured Grading System Using LLMs with RAG0
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training0
Steps are all you need: Rethinking STEM Education with Prompt Engineering0
Steps to Knowledge Graphs Quality Assessment0
STL-CQA: Structure-based Transformers with Localization and Encoding for Chart Question Answering0
STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training0
Stochastic Precision Ensemble: Self-Knowledge Distillation for Quantized Deep Neural Networks0
Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization0
Show:102550
← PrevPage 204 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified