SOTAVerified

valid

Papers

Showing 2650 of 3589 papers

TitleStatusHype
Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMsCode0
Policy-Based Trajectory Clustering in Offline Reinforcement Learning0
Employing self-supervised learning models for cross-linguistic child speech maturity classificationCode0
Asymptotic Normality of Infinite Centered Random Forests -Application to Imbalanced Classification0
Language Models over Canonical Byte-Pair Encodings0
PhysiInter: Integrating Physical Mapping for High-Fidelity Human Interaction Generation0
Ensuring Reliability of Curated EHR-Derived Data: The Validation of Accuracy for LLM/ML-Extracted Information and Data (VALID) Framework0
AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists0
Can LLMs Generate Reliable Test Case Generators? A Study on Competition-Level Programming Problems0
Inference on the value of a linear program0
On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive RandomizationCode0
Speech Neurophysiology in Realistic Contexts: Big Hype or Big Leap?0
Does It Make Sense to Speak of Introspection in Large Language Models?0
DRE: An Effective Dual-Refined Method for Integrating Small and Large Language Models in Open-Domain Dialogue Evaluation0
DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience0
SQLens: An End-to-End Framework for Error Detection and Correction in Text-to-SQL0
Pi-SQL: Enhancing Text-to-SQL with Fine-Grained Guidance from Pivot Programming Languages0
Behavioral Augmentation of UML Class Diagrams: An Empirical Study of Large Language Models for Method GenerationCode0
Quantization-based Bounds on the Wasserstein Metric0
Clinical Annotations for Automatic Stuttering Severity AssessmentCode0
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs0
Conformal Object Detection by Sequential Risk Control0
Generalizability vs. Counterfactual Explainability Trade-Off0
Maximum Likelihood Learning of Latent Dynamics Without Reconstruction0
Stable Thompson Sampling: Valid Inference via Variance Inflation0
Show:102550
← PrevPage 2 of 144Next →

No leaderboard results yet.