SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 43514400 of 10817 papers

TitleStatusHype
BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering0
Detecting Ongoing Events Using Contextual Word and Sentence Embeddings0
DHP Benchmark: Are LLMs Good NLG Evaluators?0
DGRAG: Distributed Graph-based Retrieval-Augmented Generation in Edge-Cloud Systems0
Barriers in Integrating Medical Visual Question Answering into Radiology Workflows: A Scoping Review and Clinicians' Insights0
A Neural Question Answering Model Based on Semi-Structured Tables0
Development of Hybrid Algorithm for Automatic Extraction of Multiword Expressions from Monolingual and Parallel Corpus of English and Punjabi0
A Neural Network Model for Low-Resource Universal Dependency Parsing0
Developing Question-Answering Models in Low-Resource Languages: A Case Study on Turkish Medical Texts Using Transformer-Based Approaches0
Developing ML-based Systems to Extract Medical Information from Japanese Medical History Summaries0
Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Losses0
A Competence-aware Curriculum for Visual Concepts Learning via Question Answering0
Improving Differentiable Neural Computers Through Memory Masking, De-allocation, and Link Distribution Sharpness Control0
Developing ChatGPT for Biology and Medicine: A Complete Review of Biomedical Question Answering0
Developing a How-to Tip Machine Comprehension Dataset and its Evaluation in Machine Comprehension by BERT0
BanglaQuAD: A Bengali Open-domain Question Answering Dataset0
Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning Based Methods0
Detours for Navigating Instructional Videos0
A Neural Network for Factoid Question Answering over Paragraphs0
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models0
Determining is-a relationships for Textual Entailment0
Determining Event Durations: Models and Error Analysis0
Detect, Retrieve, Comprehend: A Flexible Framework for Zero-Shot Document-Level Question Answering0
Detection-based Intermediate Supervision for Visual Question Answering0
BanditRank: Learning to Rank Using Contextual Bandits0
3D Concept Grounding on Neural Fields0
Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning0
A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections0
Detecting Subevent Structure for Event Coreference Resolution0
Towards Understanding Domain Adapted Sentence Embeddings for Document Retrieval0
Detecting Spammers in Community Question Answering0
Detecting Semantically Equivalent Questions in Online User Forums0
Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization0
A Neural Method for Goal-Oriented Dialog Systems to interact with Named Entities0
Detecting Bot-Answerable Questions in Ubuntu Chat0
Balancing Performance and Efficiency in Zero-shot Robotic Navigation0
Improving Conditioning in Context-Aware Sequence to Sequence Models0
Improving Cross-lingual Representation for Semantic Retrieval with Code-switching0
Improving Data Augmentation for Robust Visual Question Answering with Effective Curriculum Learning0
Detecting Asymmetric Semantic Relations in Context: A Case-Study on Hypernymy Detection0
Detecting and Resolving Shell Nouns in German0
Detecting and Evaluating Medical Hallucinations in Large Vision Language Models0
Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE0
A neural document language modeling framework for spoken document retrieval0
Detect, Describe, Discriminate: Moving Beyond VQA for MLLM Evaluation0
Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs0
A Neural Comprehensive Ranker (NCR) for Open-Domain Question Answering0
Despite "super-human" performance, current LLMs are unsuited for decisions about ethics and safety0
Bactrainus: Optimizing Large Language Models for Multi-hop Complex Question Answering Tasks0
Adversarial Attacks Beyond the Image Space0
Show:102550
← PrevPage 88 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified