The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1250 of 659983 papers

Title	Date	Tasks	Status	Hype
EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models	Dec 10, 2024		CodeCode Available	5
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model	Feb 11, 2024		CodeCode Available	5
Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions	Jan 7, 2024	BenchmarkingImage Segmentation	CodeCode Available	5
aeon: a Python toolkit for learning from time series	Jun 20, 2024	Anomaly DetectionModel Selection	CodeCode Available	5
Controllable Generation with Text-to-Image Diffusion Models: A Survey	Mar 7, 2024	Denoising	CodeCode Available	5
Datasets for Large Language Models: A Comprehensive Survey	Feb 28, 2024	Language ModellingLarge Language Model	CodeCode Available	5
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding	Jan 15, 2024	Language ModelingLanguage Modelling	CodeCode Available	5
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis	Jan 16, 2024	3D ReconstructionFace Generation	CodeCode Available	5
Make Your LLM Fully Utilize the Context	Apr 25, 2024	4kInformation Retrieval	CodeCode Available	5
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training	May 23, 2023	Contrastive LearningSelf-Supervised Learning	CodeCode Available	5
Unified Training of Universal Time Series Forecasting Transformers	Feb 4, 2024	Time SeriesTime Series Forecasting	CodeCode Available	5
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework	Apr 16, 2025	Image Generation	CodeCode Available	5
TimeMixer++: A General Time Series Pattern Machine for Universal Predictive Analysis	Oct 21, 2024	Anomaly DetectionImputation	CodeCode Available	5
Learning Flow Fields in Attention for Controllable Person Image Generation	Dec 11, 2024	AttributeImage Generation	CodeCode Available	5
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts	Apr 13, 2024	DiversityLanguage Modeling	CodeCode Available	5
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens	Mar 2, 2026		—Unverified	4
Unified Personalized Reward Model for Vision Generation	Feb 10, 2026		—Unverified	4
Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills	Mar 9, 2026		—Unverified	4
Reinforcement Learning via Self-Distillation	Feb 16, 2026		—Unverified	4
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks	Mar 13, 2026		—Unverified	4
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models	Feb 18, 2026		—Unverified	4
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery	Mar 18, 2026		—Unverified	4
QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining	Feb 6, 2026		—Unverified	4
VideoWorld 2: Learning Transferable Knowledge from Real-world Videos	Feb 10, 2026		—Unverified	4
R-Zero: Self-Evolving Reasoning LLM from Zero Data	Feb 13, 2026		—Unverified	4
ATOM: AdapTive and OptiMized dynamic temporal knowledge graph construction using LLMs	Jan 24, 2026		—Unverified	4
Precise Object and Effect Removal with Adaptive Target-Aware Attention	Mar 16, 2026		—Unverified	4
MOSS-TTS Technical Report	Mar 18, 2026		—Unverified	4
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations	Feb 2, 2026		—Unverified	4
MotionStream: Real-Time Video Generation with Interactive Motion Controls	Mar 5, 2026		—Unverified	4
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models	Feb 3, 2026		—Unverified	4
Closing the Loop: Universal Repository Representation with RPG-Encoder	Feb 3, 2026		—Unverified	4
MOVA: Towards Scalable and Synchronized Video-Audio Generation	Feb 10, 2026		—Unverified	4
Cautious Weight Decay	Feb 24, 2026		—Unverified	4
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs	Jan 22, 2026		—Unverified	4
Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching	Mar 17, 2026		—Unverified	4
SkillNet: Create, Evaluate, and Connect AI Skills	Feb 26, 2026		—Unverified	4
TTT3R: 3D Reconstruction as Test-Time Training	Mar 3, 2026		—Unverified	4
Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation	Feb 6, 2026		—Unverified	4
SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds	Jan 22, 2026		—Unverified	4
UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers	Mar 1, 2026		—Unverified	4
Utonia: Toward One Encoder for All Point Clouds	Mar 3, 2026		—Unverified	4
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models	Jan 29, 2026		—Unverified	4
On the Theoretical Limitations of Embedding-Based Retrieval	Mar 12, 2026		—Unverified	4
MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator	Mar 16, 2026		—Unverified	4
Hyperagents	Mar 19, 2026		—Unverified	4
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations	Feb 28, 2026		—Unverified	4
Masked Depth Modeling for Spatial Perception	Jan 25, 2026		—Unverified	4
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research	Feb 6, 2026		—Unverified	4
Learning to Discover at Test Time	Feb 5, 2026		—Unverified	4