SOTAVerified

Long Form Question Answering

Long-form question answering is a task requiring elaborate and in-depth answers to open-ended questions.

Papers

Showing 125 of 61 papers

TitleStatusHype
Fine-Grained Human Feedback Gives Better Rewards for Language Model TrainingCode2
KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking TechniquesCode2
LongForm: Effective Instruction Tuning with Reverse InstructionsCode2
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language ModelsCode2
WebCPM: Interactive Web Search for Chinese Long-form Question AnsweringCode2
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systemsCode1
OLAPH: Improving Factuality in Biomedical Long-form Question AnsweringCode1
A Critical Evaluation of Evaluations for Long-form Question AnsweringCode1
Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive TasksCode1
Attribute First, then Generate: Locally-attributable Grounded Text GenerationCode1
SEMQA: Semi-Extractive Multi-Source Question AnsweringCode1
Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference OptimizationCode1
ELI5: Long Form Question AnsweringCode1
Hurdles to Progress in Long-form Question AnsweringCode1
D2S: Document-to-Slide Generation Via Query-Based Text SummarizationCode1
Controllable Generation from Pre-trained Language Models via Inverse PromptingCode1
ALaRM: Align Language Models via Hierarchical Rewards ModelingCode1
Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization0
Adapting Pre-trained Generative Models for Extractive Question Answering0
Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation0
How State-Of-The-Art Models Can Deal With Long-Form Question Answering0
Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs0
CALF: Benchmarking Evaluation of LFQA Using Chinese Examinations0
Generative Long-form Question Answering: Relevance, Faithfulness and Succinctness0
Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.