SOTAVerified

Answer Generation

Papers

Showing 150 of 280 papers

TitleStatusHype
Small Encoders Can Rival Large Decoders in Detecting GroundednessCode0
GEMeX-ThinkVG: Towards Thinking with Visual Grounding in Medical VQA via Reinforcement Learning0
RAGtifier: Evaluating RAG Generation Approaches of State-of-the-Art RAG Systems for the SIGIR LiveRAG Competition0
RMIT-ADM+S at the SIGIR 2025 LiveRAG ChallengeCode1
FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design0
CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making0
LLM-Driven Personalized Answer Generation and Evaluation0
TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document ReasoningCode2
Neural at ArchEHR-QA 2025: Agentic Prompt Optimization for Evidence-Grounded Clinical Question Answering0
CC-RAG: Structured Multi-Hop Reasoning via Theme-Based Causal Graphs0
A Survey on Large Language Models for Mathematical Reasoning0
ECoRAG: Evidentiality-guided Compression for Long Context RAGCode1
CoRe-MMRAG: Cross-Source Knowledge Reconciliation for Multimodal RAG0
LaMP-QA: A Benchmark for Personalized Long-form Question Answering0
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time0
DGRAG: Distributed Graph-based Retrieval-Augmented Generation in Edge-Cloud Systems0
The Silent Saboteur: Imperceptible Adversarial Attacks against Black-Box Retrieval-Augmented Generation Systems0
O^2-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question AnsweringCode1
Grounding Chest X-Ray Visual Question Answering with Generated Radiology Reports0
BR-TaxQA-R: A Dataset for Question Answering with References for Brazilian Personal Income Tax Law, including case law0
GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI AgentsCode1
When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning0
The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval AugmentationCode1
Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement LearningCode1
NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search0
KG-QAGen: A Knowledge-Graph-Based Framework for Systematic Question Generation and Long-Context LLM EvaluationCode0
RAG-VR: Leveraging Retrieval-Augmented Generation for 3D Question Answering in VR EnvironmentsCode0
Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG0
Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation PatchingCode0
MHTS: Multi-Hop Tree Structure Framework for Generating Difficulty-Controllable QA Datasets for RAG Evaluation0
A Retrieval-Augmented Knowledge Mining Method with Deep Thinking LLMs for Biomedical Research and Clinical Support0
The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction0
A Survey of Large Language Model Agents for Question Answering0
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning TransferCode1
RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning0
Conversational Gold: Evaluating Personalized Conversational Search System using Gold NuggetsCode0
MoEMoE: Question Guided Dense and Scalable Sparse Mixture-of-Expert for Multi-source Multi-modal Answering0
Zero-Shot Complex Question-Answering on Long Scientific DocumentsCode0
Enhancing Multi-hop Reasoning in Vision-Language Models via Self-Distillation with Multi-Prompt Ensembling0
EgoNormia: Benchmarking Physical Social Norm UnderstandingCode1
AgentRM: Enhancing Agent Generalization with Reward Modeling0
A Hybrid Approach to Information Retrieval and Answer Generation for Regulatory TextsCode0
Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines0
Mitigating Lost-in-Retrieval Problems in Retrieval Augmented Multi-Hop Question Answering0
TabSD: Large Free-Form Table Question Answering with SQL-Based Table Decomposition0
TrustRAG: An Information Assistant with Retrieval Augmented GenerationCode5
PeerQA: A Scientific Question Answering Dataset from Peer ReviewsCode1
QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval0
ReTreever: Tree-based Coarse-to-Fine Representations for Retrieval0
HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language ModelsCode0
Show:102550
← PrevPage 1 of 6Next →

No leaderboard results yet.