The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–700 of 659983 papers

Title	Date	Tasks	Status	Hype
EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery	Mar 9, 2026		—Unverified	5
OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data	Feb 14, 2026		—Unverified	5
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE	Feb 4, 2026		—Unverified	5
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length	Mar 16, 2026		—Unverified	5
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery	Feb 9, 2026		—Unverified	5
FireRed-Image-Edit-1.0 Technical Report	Feb 12, 2026		—Unverified	5
SAMTok: Representing Any Mask with Two Words	Jan 22, 2026		—Unverified	5
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning	Feb 5, 2026		—Unverified	5
World Action Models are Zero-shot Policies	Feb 17, 2026		—Unverified	5
Helios: Real Real-Time Long Video Generation Model	Mar 4, 2026		—Unverified	5
Rethinking the Design of Reinforcement Learning-Based Deep Research Agents	Feb 21, 2026		—Unverified	5
Kimi K2.5: Visual Agentic Intelligence	Feb 2, 2026		—Unverified	5
Training Large Language Models to Reason in a Continuous Latent Space	Dec 9, 2024	Logical Reasoning	CodeCode Available	5
YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception	Jun 21, 2025	Computational Efficiencyobject-detection	CodeCode Available	5
YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications	Sep 7, 2022	GPUObject Detection	CodeCode Available	5
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification	Oct 14, 2024	Image Generation	CodeCode Available	5
OminiControl2: Efficient Conditioning for Diffusion Transformers	Mar 11, 2025	Conditional Image GenerationDenoising	CodeCode Available	5
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B	Jun 11, 2024	Decision MakingGSM8K	CodeCode Available	5
Semantic Operators: A Declarative Model for Rich, AI-based Data Processing	Jul 16, 2024	Extreme Multi-Label ClassificationFact Checking	CodeCode Available	5
OMG-Seg: Is One Model Good Enough For All Segmentation?	Jan 18, 2024	AllDecoder	CodeCode Available	5
Ferret: Refer and Ground Anything Anywhere at Any Granularity	Oct 11, 2023	HallucinationLanguage Modeling	CodeCode Available	5
TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting	May 23, 2024	Future predictionTime Series	CodeCode Available	5
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI	Nov 27, 2023	Complex Query AnsweringLogical Reasoning	CodeCode Available	5
SoftHGNN: Soft Hypergraph Neural Networks for General Visual Recognition	May 21, 2025		CodeCode Available	5
Masked Completion via Structured Diffusion with White-Box Transformers	Apr 3, 2024	Representation Learning	CodeCode Available	5
Inpaint Anything: Segment Anything Meets Image Inpainting	Apr 13, 2023	Image Inpainting	CodeCode Available	5
Extreme Compression of Large Language Models via Additive Quantization	Jan 11, 2024	CPUGPU	CodeCode Available	5
Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement Learning	Jul 8, 2024		CodeCode Available	5
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning	Feb 29, 2024	GPULanguage Modeling	CodeCode Available	5
CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling	May 26, 2024		CodeCode Available	5
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities	May 5, 2025	Image GenerationSurvey	CodeCode Available	5
Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation	Apr 16, 2023	Instruction Following	CodeCode Available	5
MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model	Sep 4, 2024	Language ModelingLanguage Modelling	CodeCode Available	5
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search	Dec 24, 2024		CodeCode Available	5
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models	Jul 21, 2024	AllFashion Synthesis	CodeCode Available	5
Arbitrary-steps Image Super-resolution via Diffusion Inversion	Dec 12, 2024	Image Super-ResolutionSuper-Resolution	CodeCode Available	5
SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural Networks	Apr 15, 2024	Quantization	CodeCode Available	5
SymbolicAI: A framework for logic-based approaches combining generative models and solvers	Feb 1, 2024	Few-Shot LearningIn-Context Learning	CodeCode Available	5
That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design	Nov 15, 2024	Deep Reinforcement Learning	CodeCode Available	5
GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation	Nov 27, 2024	Depth EstimationDiversity	CodeCode Available	5
Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) with Pitch Prediction	May 31, 2024	Speech Synthesis	CodeCode Available	5
A quantum semantic framework for natural language processing	Jun 11, 2025		CodeCode Available	5
Single-seed generation of Brownian paths and integrals for adaptive and high order SDE solvers	May 10, 2024		CodeCode Available	5
The Path To Autonomous Cyber Defense	Apr 12, 2024		CodeCode Available	5
CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians	Apr 1, 2024	3DGS3D Scene Reconstruction	CodeCode Available	5
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions	Mar 12, 2024	Model Editing	CodeCode Available	5
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond	Apr 11, 2023	Text to 3D	CodeCode Available	5
Interpretability, Then What? Editing Machine Learning Models to Reflect Human Knowledge and Values	Jun 30, 2022	Additive modelsBIG-bench Machine Learning	CodeCode Available	5
Magic Clothing: Controllable Garment-Driven Image Synthesis	Apr 15, 2024	Image Generation	CodeCode Available	5
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation	Jun 25, 2024	DiversityNatural Language Understanding	CodeCode Available	5