SOTAVerified

Form

Papers

Showing 125 of 1618 papers

TitleStatusHype
FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation0
Controlled Retrieval-augmented Context Evaluation for Long-form RAG0
FormGym: Doing Paperwork with Agents0
Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks0
FreeQ-Graph: Free-form Querying with Semantic Consistent Scene Graph for 3D Scene Understanding0
LLM Unlearning Should Be Form-Independent0
ARGUS: Hallucination and Omission Evaluation in Video-LLMs0
Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement LearningCode0
Toward Better SSIM Loss for Unsupervised Monocular Depth Estimation0
Fifteen Years of Child-Centered Long-Form Recordings: Promises, Resources, and Remaining Challenges to Validity0
SuperWriter: Reflection-Driven Long-Form Generation with Large Language ModelsCode1
Unpacking Let Alone: Human-Scale Models Generalize to a Rare Construction in Form but not Meaning0
Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths0
Brain-Like Processing Pathways Form in Models With Heterogeneous Experts0
Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs0
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents0
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists0
Self-supervised Latent Space Optimization with Nebula Variational Coding0
LaMP-QA: A Benchmark for Personalized Long-form Question Answering0
Input-Power-to-State Stability of Time-Varying Systems0
HiCaM: A Hierarchical-Causal Modification Framework for Long-Form Text Modification0
Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization0
NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization0
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation0
How Does Response Length Affect Long-Form FactualityCode0
Show:102550
← PrevPage 1 of 65Next →

No leaderboard results yet.