SOTAVerified

Form

Papers

Showing 150 of 1618 papers

TitleStatusHype
FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation0
Controlled Retrieval-augmented Context Evaluation for Long-form RAG0
FormGym: Doing Paperwork with Agents0
FreeQ-Graph: Free-form Querying with Semantic Consistent Scene Graph for 3D Scene Understanding0
Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks0
LLM Unlearning Should Be Form-Independent0
ARGUS: Hallucination and Omission Evaluation in Video-LLMs0
Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement LearningCode0
Toward Better SSIM Loss for Unsupervised Monocular Depth Estimation0
Fifteen Years of Child-Centered Long-Form Recordings: Promises, Resources, and Remaining Challenges to Validity0
SuperWriter: Reflection-Driven Long-Form Generation with Large Language ModelsCode1
Unpacking Let Alone: Human-Scale Models Generalize to a Rare Construction in Form but not Meaning0
Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths0
Brain-Like Processing Pathways Form in Models With Heterogeneous Experts0
Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs0
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents0
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists0
Self-supervised Latent Space Optimization with Nebula Variational Coding0
LaMP-QA: A Benchmark for Personalized Long-form Question Answering0
Input-Power-to-State Stability of Time-Varying Systems0
Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization0
HiCaM: A Hierarchical-Causal Modification Framework for Long-Form Text Modification0
NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization0
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation0
How Does Response Length Affect Long-Form FactualityCode0
GUST: Quantifying Free-Form Geometric Uncertainty of Metamaterials Using Small Data0
A Characterization of Reny's Weakly Sequentially Rational Equilibrium through -Perfect γ-Weakly Sequentially Rational Equilibrium0
MedScore: Factuality Evaluation of Free-Form Medical AnswersCode0
Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding0
Frankentext: Stitching random text fragments into long-form narrativesCode1
UNCLE: Uncertainty Expressions in Long-Form Generation0
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement LearningCode1
VeriFastScore: Speeding up long-form factuality evaluationCode0
Long-Form Information Alignment Evaluation Beyond Atomic FactsCode0
Generating Realistic Multi-Beat ECG Signals0
Representation of perceived prosodic similarity of conversational feedback0
Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation0
Historical and psycholinguistic perspectives on morphological productivity: A sketch of an integrative approach0
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documentsCode0
VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts0
Atomic Consistency Preference Optimization for Long-Form Question AnsweringCode0
Closed-Form Information Capacity of Canonical Signaling ModelsCode0
STORYANCHORS: Generating Consistent Multi-Scene Story Frames for Long-Form Narratives0
VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language ModelsCode0
Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information0
Binding threshold units with artificial oscillatory neuronsCode0
BLAB: Brutally Long Audio Bench0
Learning Universal User Representations Leveraging Cross-domain User Intent at Snapchat0
Scaling and shape of financial returns distributions modeled as conditionally independent random variables0
Stealing Creator's Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation0
Show:102550
← PrevPage 1 of 33Next →

No leaderboard results yet.