SOTAVerified

valid

Papers

Showing 5175 of 3589 papers

TitleStatusHype
What Has Been Lost with Synthetic Evaluation?0
Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language ModelsCode0
STACI: Spatio-Temporal Aleatoric Conformal Inference0
PrivATE: Differentially Private Confidence Intervals for Average Treatment Effects0
Collision- and Reachability-Aware Multi-Robot Control with Grounded LLM Planners0
On the Robustness of RSMA to Adversarial BD-RIS-Induced Interference0
Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach0
HomeBench: Evaluating LLMs in Smart Homes with Valid and Invalid Instructions Across Single and Multiple DevicesCode0
We Need to Measure Data Diversity in NLP -- Better and Broader0
PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation0
Optimal Conformal Prediction under Epistemic UncertaintyCode0
NTIRE 2025 Challenge on Video Quality Enhancement for Video Conferencing: Datasets, Methods and ResultsCode0
Efficient Long CoT Reasoning in Small Language Models0
MedScore: Factuality Evaluation of Free-Form Medical AnswersCode0
Graph Style Transfer for Counterfactual ExplainabilityCode0
Flexible MOF Generation with Torsion-Aware Flow Matching0
Anytime-valid, Bayes-assisted,Prediction-Powered Inference0
Efficient Adaptive Experimentation with Non-ComplianceCode0
Applications of Modular Co-Design for De Novo 3D Molecule Generation0
Effects of auditory distance cues and reverberation on spatial perception and listening strategiesCode0
Statistical Inference for Online AlgorithmsCode0
MuseRAG: Idea Originality Scoring At ScaleCode0
A collaborative constrained graph diffusion model for the generation of realistic synthetic moleculesCode0
Statistical Test for Saliency Maps of Graph Neural Networks via Selective Inference0
Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack0
Show:102550
← PrevPage 3 of 144Next →

No leaderboard results yet.