SOTAVerified

Long Form Question Answering

Long-form question answering is a task requiring elaborate and in-depth answers to open-ended questions.

Papers

Showing 150 of 61 papers

TitleStatusHype
GenerationPrograms: Fine-grained Attribution with Executable ProgramsCode0
LaMP-QA: A Benchmark for Personalized Long-form Question Answering0
Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs0
Atomic Consistency Preference Optimization for Long-Form Question AnsweringCode0
An Empirical Study of Evaluating Long-form Question AnsweringCode0
MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration0
Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution0
On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation SystemsCode0
How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the WildCode0
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language ModelsCode0
Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization0
To Retrieve or Not to Retrieve? Uncertainty Detection for Dynamic Retrieval Augmented Generation0
A Claim Decomposition Benchmark for Long-form Answer VerificationCode0
Retrieving Contextual Information for Long-Form Question Answering using Weak Supervision0
CALF: Benchmarking Evaluation of LFQA Using Chinese Examinations0
Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for Ancient Indian PhilosophyCode0
Putting People in LLMs' Shoes: Generating Better Answers via Question RewriterCode0
Localizing and Mitigating Errors in Long-form Question AnsweringCode0
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language ModelsCode2
Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation0
CaLMQA: Exploring culturally specific long-form question answering across 23 languagesCode0
FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering0
OLAPH: Improving Factuality in Biomedical Long-form Question AnsweringCode1
FinTextQA: A Dataset for Long-form Financial Question Answering0
Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study0
Learning to Plan and Generate Text with Citations0
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systemsCode1
Attribute First, then Generate: Locally-attributable Grounded Text GenerationCode1
Multi-Review Fusion-in-Context0
ALaRM: Align Language Models via Hierarchical Rewards ModelingCode1
KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking TechniquesCode2
Genie: Achieving Human Parity in Content-Grounded Datasets Generation0
Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement LearningCode0
LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback0
Long-form Question Answering: An Iterative Planning-Retrieval-Generation Approach0
SEMQA: Semi-Extractive Multi-Source Question AnsweringCode1
Adapting Pre-trained Generative Models for Extractive Question Answering0
PreWoMe: Exploiting Presuppositions as Working Memory for Long Form Question Answering0
Understanding Retrieval Augmentation for Long-Form Question Answering0
Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference OptimizationCode1
A Novel Computational and Modeling Foundation for Automatic Coherence Assessment0
Investigating Answerability of LLMs for Long-Form Question Answering0
Fine-Grained Human Feedback Gives Better Rewards for Language Model TrainingCode2
Concise Answers to Complex Questions: Summarization of Long-form AnswersCode0
A Critical Evaluation of Evaluations for Long-form Question AnsweringCode1
Revisiting Sentence Union Generation as a Testbed for Text ConsolidationCode0
WebCPM: Interactive Web Search for Chinese Long-form Question AnsweringCode2
Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive TasksCode1
LongForm: Effective Instruction Tuning with Reverse InstructionsCode2
Generative Long-form Question Answering: Relevance, Faithfulness and Succinctness0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.