SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 44014450 of 661570 papers

TitleStatusHype
Reason-RFT: Reinforcement Fine-Tuning for Visual ReasoningCode3
Designing and building the mlpack open-source machine learning libraryCode3
One-step Diffusion with Distribution Matching DistillationCode3
EAFormer: Scene Text Segmentation with Edge-Aware TransformersCode3
Accurate clinical and biomedical Named entity recognition at scaleCode3
Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1Code3
EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language ModelsCode3
LRM: Large Reconstruction Model for Single Image to 3DCode3
GluonTS: Probabilistic Time Series Models in PythonCode3
Practical Deep Reinforcement Learning Approach for Stock TradingCode3
CodeBLEU: a Method for Automatic Evaluation of Code SynthesisCode3
Aguvis: Unified Pure Vision Agents for Autonomous GUI InteractionCode3
Merlin: A Vision Language Foundation Model for 3D Computed TomographyCode3
Text Embeddings Reveal (Almost) As Much As TextCode3
dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive CachingCode3
SkillMimic: Learning Basketball Interaction Skills from DemonstrationsCode3
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat GenerationCode3
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal ModelCode3
RMPE: Regional Multi-person Pose EstimationCode3
Language Model Council: Democratically Benchmarking Foundation Models on Highly Subjective TasksCode3
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion ModelsCode3
PAL: Program-aided Language ModelsCode3
HUGSIM: A Real-Time, Photo-Realistic and Closed-Loop Simulator for Autonomous DrivingCode3
Learning and discovering multiple solutions using physics-informed neural networks with random initialization and deep ensembleCode3
3D Facial Expressions through Analysis-by-Neural-SynthesisCode3
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense PredictionsCode3
GLU Variants Improve TransformerCode3
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning ResearchCode3
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video GenerationCode3
ADOPT: Modified Adam Can Converge with Any β_2 with the Optimal RateCode3
FlashSpeech: Efficient Zero-Shot Speech SynthesisCode3
Momentum Contrast for Unsupervised Visual Representation LearningCode3
Characterization of Excess Risk for Locally Strongly Convex Population RiskCode3
wav2letter++: The Fastest Open-source Speech Recognition SystemCode3
Identifying Audio Adversarial Examples via Anomalous Pattern DetectionCode3
Towards VQA Models That Can ReadCode3
First Order Motion Model for Image AnimationCode3
Transformers in Medical Imaging: A SurveyCode3
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsCode3
Pythia v0.1: the Winning Entry to the VQA Challenge 2018Code3
SQLFlow: A Bridge between SQL and Machine LearningCode3
Mesh R-CNNCode3
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian SplattingCode3
MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and AudioCode3
Efficient and Robust Automated Machine LearningCode3
MM-Agent: LLM as Agents for Real-world Mathematical Modeling ProblemCode3
SynSin: End-to-end View Synthesis from a Single ImageCode3
An Extensible Framework for Open Heterogeneous Collaborative PerceptionCode3
Multi-Head RAG: Solving Multi-Aspect Problems with LLMsCode3
Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligenceCode3
Show:102550
← PrevPage 89 of 13232Next →