SOTAVerified

Form

Papers

Showing 201225 of 1618 papers

TitleStatusHype
Brain-Like Processing Pathways Form in Models With Heterogeneous Experts0
Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs0
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents0
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists0
Self-supervised Latent Space Optimization with Nebula Variational Coding0
Input-Power-to-State Stability of Time-Varying Systems0
NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization0
HiCaM: A Hierarchical-Causal Modification Framework for Long-Form Text Modification0
LaMP-QA: A Benchmark for Personalized Long-form Question Answering0
Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization0
How Does Response Length Affect Long-Form FactualityCode0
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation0
GUST: Quantifying Free-Form Geometric Uncertainty of Metamaterials Using Small Data0
A Characterization of Reny's Weakly Sequentially Rational Equilibrium through -Perfect γ-Weakly Sequentially Rational Equilibrium0
MedScore: Factuality Evaluation of Free-Form Medical AnswersCode0
Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding0
UNCLE: Uncertainty Expressions in Long-Form Generation0
VeriFastScore: Speeding up long-form factuality evaluationCode0
Long-Form Information Alignment Evaluation Beyond Atomic FactsCode0
Representation of perceived prosodic similarity of conversational feedback0
Generating Realistic Multi-Beat ECG Signals0
Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation0
Historical and psycholinguistic perspectives on morphological productivity: A sketch of an integrative approach0
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documentsCode0
Atomic Consistency Preference Optimization for Long-Form Question AnsweringCode0
Show:102550
← PrevPage 9 of 65Next →

No leaderboard results yet.