SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 43514400 of 177340 papers

TitleStatusHype
dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive CachingCode3
SkillMimic: Learning Basketball Interaction Skills from DemonstrationsCode3
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat GenerationCode3
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal ModelCode3
RMPE: Regional Multi-person Pose EstimationCode3
Language Model Council: Democratically Benchmarking Foundation Models on Highly Subjective TasksCode3
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion ModelsCode3
PAL: Program-aided Language ModelsCode3
HUGSIM: A Real-Time, Photo-Realistic and Closed-Loop Simulator for Autonomous DrivingCode3
Learning and discovering multiple solutions using physics-informed neural networks with random initialization and deep ensembleCode3
3D Facial Expressions through Analysis-by-Neural-SynthesisCode3
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense PredictionsCode3
GLU Variants Improve TransformerCode3
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning ResearchCode3
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video GenerationCode3
ADOPT: Modified Adam Can Converge with Any β_2 with the Optimal RateCode3
FlashSpeech: Efficient Zero-Shot Speech SynthesisCode3
Momentum Contrast for Unsupervised Visual Representation LearningCode3
Characterization of Excess Risk for Locally Strongly Convex Population RiskCode3
wav2letter++: The Fastest Open-source Speech Recognition SystemCode3
Identifying Audio Adversarial Examples via Anomalous Pattern DetectionCode3
Towards VQA Models That Can ReadCode3
First Order Motion Model for Image AnimationCode3
Transformers in Medical Imaging: A SurveyCode3
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsCode3
Pythia v0.1: the Winning Entry to the VQA Challenge 2018Code3
SQLFlow: A Bridge between SQL and Machine LearningCode3
Mesh R-CNNCode3
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian SplattingCode3
MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and AudioCode3
Efficient and Robust Automated Machine LearningCode3
MM-Agent: LLM as Agents for Real-world Mathematical Modeling ProblemCode3
SynSin: End-to-end View Synthesis from a Single ImageCode3
An Extensible Framework for Open Heterogeneous Collaborative PerceptionCode3
Multi-Head RAG: Solving Multi-Aspect Problems with LLMsCode3
Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligenceCode3
MMLSpark: Unifying Machine Learning Ecosystems at Massive ScalesCode3
Simulating the Real World: A Unified Survey of Multimodal Generative ModelsCode3
AlphaEvolve: A Learning Framework to Discover Novel Alphas in Quantitative InvestmentCode3
VideoRoPE: What Makes for Good Video Rotary Position Embedding?Code3
Green AICode3
Bag of Freebies for Training Object Detection Neural NetworksCode3
Characterizing signal propagation to close the performance gap in unnormalized ResNetsCode3
SnapKV: LLM Knows What You are Looking for Before GenerationCode3
Towards Next-Generation LLM-based Recommender Systems: A Survey and BeyondCode3
Distributional Generalization: A New Kind of GeneralizationCode3
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept SpaceCode3
ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and ReasoningCode3
Bilinear Attention NetworksCode3
Caption Anything: Interactive Image Description with Diverse Multimodal ControlsCode3
Show:102550
← PrevPage 88 of 3547Next →