SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 87268750 of 177340 papers

TitleStatusHype
NGBoost: Natural Gradient Boosting for Probabilistic PredictionCode2
Minusformer: Improving Time Series Forecasting by Progressively Learning ResidualsCode2
Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining MethodsCode2
Linear-time Minimum Bayes Risk Decoding with Reference AggregationCode2
KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph CompletionCode2
A Hard-to-Beat Baseline for Training-free CLIP-based AdaptationCode2
Privacy Leakage on DNNs: A Survey of Model Inversion Attacks and DefensesCode2
Towards Aligned Layout Generation via Diffusion Model with Aesthetic ConstraintsCode2
FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation ModelsCode2
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement LearningCode2
PyVRP: a high-performance VRP solver packageCode2
Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical TextsCode2
GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended TasksCode2
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine TranslatorsCode2
Debating with More Persuasive LLMs Leads to More Truthful AnswersCode2
Learning Continuous 3D Words for Text-to-Image GenerationCode2
Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM InferenceCode2
LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning DatasetCode2
Universal Machine Learning Kohn-Sham Hamiltonian for MaterialsCode2
MultiMedEval: A Benchmark and a Toolkit for Evaluating Medical Vision-Language ModelsCode2
Conditional Diffusion Probabilistic Model for Speech EnhancementCode2
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of ElectrocardiogramCode2
A StrongREJECT for Empty JailbreaksCode2
Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set RelationshipsCode2
GREEN: a lightweight architecture using learnable wavelets and Riemannian geometry for biomarker explorationCode2
Show:102550
← PrevPage 350 of 7094Next →