SOTAVerified

valid

Papers

Showing 451500 of 3589 papers

TitleStatusHype
The 4th Dimension for Scaling Model Size0
Broad Validity of the First-Order Approach in Moral Hazard0
Auto-Regressive Surface Cutting0
Symbolic Reduction for Formal Synthesis of Global Lyapunov Functions0
Identifying economic narratives in large text corpora -- An integrated approach using Large Language Models0
Performative Validity of Recourse Explanations0
Probabilistic Trajectory GOSPA: A Metric for Uncertainty-Aware Multi-Object Tracking Performance Evaluation0
S^4C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models0
Reimagining Target-Aware Molecular Generation through Retrieval-Enhanced Aligned Diffusion0
On the relationship between prediction intervals, tests of sharp nulls and inference on realized treatment effects in settings with few treated units0
HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance0
Geometric Jensen-Shannon Divergence Between Gaussian Measures On Hilbert Space0
Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification?0
General Reference Frame Identification and Transformation in Unbalanced Power Systems0
Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMsCode0
Generalizing Supervised Contrastive learning: A Projection Perspective0
Vector Representations of Vessel Trees0
Employing self-supervised learning models for cross-linguistic child speech maturity classificationCode0
Asymptotic Normality of Infinite Centered Random Forests -Application to Imbalanced Classification0
Policy-Based Trajectory Clustering in Offline Reinforcement Learning0
AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists0
Ensuring Reliability of Curated EHR-Derived Data: The Validation of Accuracy for LLM/ML-Extracted Information and Data (VALID) Framework0
PhysiInter: Integrating Physical Mapping for High-Fidelity Human Interaction Generation0
Language Models over Canonical Byte-Pair Encodings0
Can LLMs Generate Reliable Test Case Generators? A Study on Competition-Level Programming Problems0
Inference on the value of a linear program0
On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive RandomizationCode0
Speech Neurophysiology in Realistic Contexts: Big Hype or Big Leap?0
Does It Make Sense to Speak of Introspection in Large Language Models?0
SQLens: An End-to-End Framework for Error Detection and Correction in Text-to-SQL0
DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience0
DRE: An Effective Dual-Refined Method for Integrating Small and Large Language Models in Open-Domain Dialogue Evaluation0
Pi-SQL: Enhancing Text-to-SQL with Fine-Grained Guidance from Pivot Programming Languages0
Quantization-based Bounds on the Wasserstein Metric0
Behavioral Augmentation of UML Class Diagrams: An Empirical Study of Large Language Models for Method GenerationCode0
Clinical Annotations for Automatic Stuttering Severity AssessmentCode0
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs0
Stable Thompson Sampling: Valid Inference via Variance Inflation0
Conformal Object Detection by Sequential Risk Control0
Generalizability vs. Counterfactual Explainability Trade-Off0
Maximum Likelihood Learning of Latent Dynamics Without Reconstruction0
What Has Been Lost with Synthetic Evaluation?0
Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language ModelsCode0
STACI: Spatio-Temporal Aleatoric Conformal Inference0
PrivATE: Differentially Private Confidence Intervals for Average Treatment Effects0
On the Robustness of RSMA to Adversarial BD-RIS-Induced Interference0
We Need to Measure Data Diversity in NLP -- Better and Broader0
Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach0
PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation0
Collision- and Reachability-Aware Multi-Robot Control with Grounded LLM Planners0
Show:102550
← PrevPage 10 of 72Next →

No leaderboard results yet.