SOTAVerified

Long Form Question Answering

Long-form question answering is a task requiring elaborate and in-depth answers to open-ended questions.

Papers

Showing 125 of 61 papers

TitleStatusHype
GenerationPrograms: Fine-grained Attribution with Executable ProgramsCode0
LaMP-QA: A Benchmark for Personalized Long-form Question Answering0
Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs0
Atomic Consistency Preference Optimization for Long-Form Question AnsweringCode0
An Empirical Study of Evaluating Long-form Question AnsweringCode0
MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration0
Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution0
On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation SystemsCode0
How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the WildCode0
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language ModelsCode0
Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization0
To Retrieve or Not to Retrieve? Uncertainty Detection for Dynamic Retrieval Augmented Generation0
A Claim Decomposition Benchmark for Long-form Answer VerificationCode0
Retrieving Contextual Information for Long-Form Question Answering using Weak Supervision0
CALF: Benchmarking Evaluation of LFQA Using Chinese Examinations0
Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for Ancient Indian PhilosophyCode0
Putting People in LLMs' Shoes: Generating Better Answers via Question RewriterCode0
Localizing and Mitigating Errors in Long-form Question AnsweringCode0
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language ModelsCode2
Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation0
CaLMQA: Exploring culturally specific long-form question answering across 23 languagesCode0
FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering0
OLAPH: Improving Factuality in Biomedical Long-form Question AnsweringCode1
FinTextQA: A Dataset for Long-form Financial Question Answering0
Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.