SOTAVerified

Form

Papers

Showing 201250 of 1618 papers

TitleStatusHype
Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths0
Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs0
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents0
Self-supervised Latent Space Optimization with Nebula Variational Coding0
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists0
HiCaM: A Hierarchical-Causal Modification Framework for Long-Form Text Modification0
Input-Power-to-State Stability of Time-Varying Systems0
LaMP-QA: A Benchmark for Personalized Long-form Question Answering0
Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization0
NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization0
How Does Response Length Affect Long-Form FactualityCode0
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation0
GUST: Quantifying Free-Form Geometric Uncertainty of Metamaterials Using Small Data0
A Characterization of Reny's Weakly Sequentially Rational Equilibrium through -Perfect γ-Weakly Sequentially Rational Equilibrium0
MedScore: Factuality Evaluation of Free-Form Medical AnswersCode0
Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding0
UNCLE: Uncertainty Expressions in Long-Form Generation0
VeriFastScore: Speeding up long-form factuality evaluationCode0
Long-Form Information Alignment Evaluation Beyond Atomic FactsCode0
Generating Realistic Multi-Beat ECG Signals0
Representation of perceived prosodic similarity of conversational feedback0
Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation0
Historical and psycholinguistic perspectives on morphological productivity: A sketch of an integrative approach0
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documentsCode0
VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts0
Atomic Consistency Preference Optimization for Long-Form Question AnsweringCode0
STORYANCHORS: Generating Consistent Multi-Scene Story Frames for Long-Form Narratives0
VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language ModelsCode0
Closed-Form Information Capacity of Canonical Signaling ModelsCode0
Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information0
Binding threshold units with artificial oscillatory neuronsCode0
BLAB: Brutally Long Audio Bench0
Learning Universal User Representations Leveraging Cross-domain User Intent at Snapchat0
Scaling and shape of financial returns distributions modeled as conditionally independent random variables0
A Dictionary of Closed-Form Kernel Mean EmbeddingsCode0
Stealing Creator's Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation0
An Empirical Study of Evaluating Long-form Question AnsweringCode0
Axiomatic Equilibrium Selection: The Case of Generic Extensive Form Games0
Optimal Procurement Design: A Reduced Form Approach0
Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL0
Compton Form Factor Extraction using Quantum Deep Neural Networks0
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: KwaiSR Dataset and Study0
Dominated Actions in Imperfect-Information Games0
Density Approximation of Affine Jump Diffusions via Closed-Form Moment Matching0
Optimal Bayesian Affine Estimator and Active Learning for the Wiener ModelCode0
Rotation Invariance in Floor Plan Digitization using Zernike Moments0
Recent Advances in Real-Time Models for UWB Transmission Systems0
Gaussian Process Tilted Nonparametric Density Estimation using Fisher Divergence Score Matching0
SemEval-2025 Task 4: Unlearning sensitive content from Large Language Models0
A Characterization of Nash Equilibrium in Behavioral Strategies through Local Sequential Rationality0
Show:102550
← PrevPage 5 of 33Next →

No leaderboard results yet.