SOTAVerified

valid

Papers

Showing 150 of 3589 papers

TitleStatusHype
A Directed Lazy Random Walk Model to Three-Way Dynamic Matching Problem0
Fast and Simplex: 2-Simplicial Attention in Triton0
Potemkin Understanding in Large Language Models0
Model-Based Real-Time Pose and Sag Estimation of Overhead Power Lines Using LiDAR for Drone InspectionCode0
The kernel of graph indices for vector searchCode0
Valid Selection among Conformal Sets0
A Sharp and Robust Test for Selective Reporting0
PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket ConditioningCode2
Broad Validity of the First-Order Approach in Moral Hazard0
The 4th Dimension for Scaling Model Size0
Symbolic Reduction for Formal Synthesis of Global Lyapunov Functions0
Auto-Regressive Surface Cutting0
Identifying economic narratives in large text corpora -- An integrated approach using Large Language Models0
Probabilistic Trajectory GOSPA: A Metric for Uncertainty-Aware Multi-Object Tracking Performance Evaluation0
Performative Validity of Recourse Explanations0
On the relationship between prediction intervals, tests of sharp nulls and inference on realized treatment effects in settings with few treated units0
S^4C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models0
Reimagining Target-Aware Molecular Generation through Retrieval-Enhanced Aligned Diffusion0
HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance0
General Reference Frame Identification and Transformation in Unbalanced Power Systems0
SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation BenchmarksCode2
Geometric Jensen-Shannon Divergence Between Gaussian Measures On Hilbert Space0
Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification?0
Vector Representations of Vessel Trees0
Generalizing Supervised Contrastive learning: A Projection Perspective0
Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMsCode0
Policy-Based Trajectory Clustering in Offline Reinforcement Learning0
Employing self-supervised learning models for cross-linguistic child speech maturity classificationCode0
Asymptotic Normality of Infinite Centered Random Forests -Application to Imbalanced Classification0
Language Models over Canonical Byte-Pair Encodings0
PhysiInter: Integrating Physical Mapping for High-Fidelity Human Interaction Generation0
Ensuring Reliability of Curated EHR-Derived Data: The Validation of Accuracy for LLM/ML-Extracted Information and Data (VALID) Framework0
AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists0
Can LLMs Generate Reliable Test Case Generators? A Study on Competition-Level Programming Problems0
Inference on the value of a linear program0
On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive RandomizationCode0
Speech Neurophysiology in Realistic Contexts: Big Hype or Big Leap?0
Does It Make Sense to Speak of Introspection in Large Language Models?0
DRE: An Effective Dual-Refined Method for Integrating Small and Large Language Models in Open-Domain Dialogue Evaluation0
DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience0
SQLens: An End-to-End Framework for Error Detection and Correction in Text-to-SQL0
Pi-SQL: Enhancing Text-to-SQL with Fine-Grained Guidance from Pivot Programming Languages0
Behavioral Augmentation of UML Class Diagrams: An Empirical Study of Large Language Models for Method GenerationCode0
Quantization-based Bounds on the Wasserstein Metric0
Clinical Annotations for Automatic Stuttering Severity AssessmentCode0
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs0
Conformal Object Detection by Sequential Risk Control0
Generalizability vs. Counterfactual Explainability Trade-Off0
Maximum Likelihood Learning of Latent Dynamics Without Reconstruction0
Stable Thompson Sampling: Valid Inference via Variance Inflation0
Show:102550
← PrevPage 1 of 72Next →

No leaderboard results yet.