SOTAVerified|Agents Browse Leaderboard About Blog

Form

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 1618 papers

Title	Date	Tasks	Status	Hype
FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation	Jul 11, 2025	Audio GenerationData Augmentation	—Unverified	0
Controlled Retrieval-augmented Context Evaluation for Long-form RAG	Jun 24, 2025	DiagnosticForm	—Unverified	0
FormGym: Doing Paperwork with Agents	Jun 17, 2025	FormInformation Retrieval	—Unverified	0
Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks	Jun 16, 2025	FormMath	—Unverified	0
FreeQ-Graph: Free-form Querying with Semantic Consistent Scene Graph for 3D Scene Understanding	Jun 16, 2025	FormGraph Generation	—Unverified	0
LLM Unlearning Should Be Form-Independent	Jun 9, 2025	FormLarge Language Model	—Unverified	0
ARGUS: Hallucination and Omission Evaluation in Video-LLMs	Jun 9, 2025	DescriptiveForm	—Unverified	0
Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning	Jun 6, 2025	FormScheduling	CodeCode Available	0
Toward Better SSIM Loss for Unsupervised Monocular Depth Estimation	Jun 5, 2025	Depth EstimationForm	—Unverified	0
Fifteen Years of Child-Centered Long-Form Recordings: Promises, Resources, and Remaining Challenges to Validity	Jun 4, 2025	Form	—Unverified	0
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models	Jun 4, 2025	FormText Generation	CodeCode Available	1
Unpacking Let Alone: Human-Scale Models Generalize to a Rare Construction in Form but not Meaning	Jun 4, 2025	Form	—Unverified	0
Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths	Jun 3, 2025	FormSpecificity	—Unverified	0
Brain-Like Processing Pathways Form in Models With Heterogeneous Experts	Jun 3, 2025	FormMixture-of-Experts	—Unverified	0
Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs	Jun 3, 2025	FormScript Generation	—Unverified	0
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents	Jun 2, 2025	BenchmarkingForm	—Unverified	0
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists	Jun 2, 2025	BenchmarkingForm	—Unverified	0
Self-supervised Latent Space Optimization with Nebula Variational Coding	Jun 2, 2025	FormMetric Learning	—Unverified	0
LaMP-QA: A Benchmark for Personalized Long-form Question Answering	May 30, 2025	Answer GenerationForm	—Unverified	0
Input-Power-to-State Stability of Time-Varying Systems	May 30, 2025	Form	—Unverified	0
HiCaM: A Hierarchical-Causal Modification Framework for Long-Form Text Modification	May 30, 2025	Form	—Unverified	0
Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization	May 30, 2025	FormLanguage Modeling	—Unverified	0
NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization	May 30, 2025	DescriptiveForm	—Unverified	0
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation	May 29, 2025	FormHallucination	—Unverified	0
How Does Response Length Affect Long-Form Factuality	May 29, 2025	FormText Generation	CodeCode Available	0

Show:10 25 50

← PrevPage 1 of 65Next →

No leaderboard results yet.