SOTAVerified

Answer Generation

Papers

Showing 125 of 280 papers

TitleStatusHype
Small Encoders Can Rival Large Decoders in Detecting GroundednessCode0
GEMeX-ThinkVG: Towards Thinking with Visual Grounding in Medical VQA via Reinforcement Learning0
RMIT-ADM+S at the SIGIR 2025 LiveRAG ChallengeCode1
RAGtifier: Evaluating RAG Generation Approaches of State-of-the-Art RAG Systems for the SIGIR LiveRAG Competition0
FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design0
CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making0
LLM-Driven Personalized Answer Generation and Evaluation0
Neural at ArchEHR-QA 2025: Agentic Prompt Optimization for Evidence-Grounded Clinical Question Answering0
TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document ReasoningCode2
CC-RAG: Structured Multi-Hop Reasoning via Theme-Based Causal Graphs0
A Survey on Large Language Models for Mathematical Reasoning0
ECoRAG: Evidentiality-guided Compression for Long Context RAGCode1
CoRe-MMRAG: Cross-Source Knowledge Reconciliation for Multimodal RAG0
LaMP-QA: A Benchmark for Personalized Long-form Question Answering0
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time0
DGRAG: Distributed Graph-based Retrieval-Augmented Generation in Edge-Cloud Systems0
The Silent Saboteur: Imperceptible Adversarial Attacks against Black-Box Retrieval-Augmented Generation Systems0
O^2-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question AnsweringCode1
Grounding Chest X-Ray Visual Question Answering with Generated Radiology Reports0
BR-TaxQA-R: A Dataset for Question Answering with References for Brazilian Personal Income Tax Law, including case law0
The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval AugmentationCode1
GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI AgentsCode1
When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning0
Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement LearningCode1
NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search0
Show:102550
← PrevPage 1 of 12Next →

No leaderboard results yet.