The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1401–1450 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models	Mar 14, 2024	BlockingGPU	CodeCode Available	4	5
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers	Aug 12, 2024	GSM8KMath	CodeCode Available	4	5
Data quality dimensions for fair AI	May 11, 2023	ClassificationFairness	CodeCode Available	4	5
AnyText: Multilingual Visual Text Generation And Editing	Nov 6, 2023	Image GenerationOptical Character Recognition (OCR)	CodeCode Available	4	5
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders	Dec 12, 2024	Gaze Target Estimation	CodeCode Available	4	5
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation	May 26, 2022	3D Multi-Object Tracking3D Object Detection	CodeCode Available	4	5
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing	May 7, 2024	Image ManipulationLanguage Modeling	CodeCode Available	4	5
TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control	Feb 24, 2025	reinforcement-learningReinforcement Learning	CodeCode Available	4	5
CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models	Mar 24, 2025		CodeCode Available	4	5
Kubric: A scalable dataset generator	Mar 7, 2022	FairnessNeRF	CodeCode Available	4	5
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation	Aug 8, 2024	ChunkingFact Checking	CodeCode Available	4	5
R^2-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction	May 31, 2024	3DGSNeRF	CodeCode Available	4	5
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments	Jun 6, 2024	Language ModelingLanguage Modelling	CodeCode Available	4	5
RecBole 2.0: Towards a More Up-to-Date Recommendation Library	Jun 15, 2022	BenchmarkingData Augmentation	CodeCode Available	4	5
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates	Feb 10, 2025	Hierarchical Reinforcement LearningLanguage Modeling	CodeCode Available	4	5
IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching	Sep 1, 2024	Patch MatchingStereo Matching	CodeCode Available	4	5
Long Context Transfer from Language to Vision	Jun 24, 2024	Language ModelingLanguage Modelling	CodeCode Available	4	5
RealisDance: Equip controllable character animation with realistic hands	Sep 10, 2024		CodeCode Available	4	5
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals	Jul 18, 2024	Experimental DesignGPU	CodeCode Available	4	5
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning	May 6, 2025	Image Generation	CodeCode Available	4	5
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters	Oct 30, 2024	model	CodeCode Available	4	5
A Closer Look at Deep Learning Methods on Tabular Datasets	Jul 1, 2024	AttributeDeep Learning	CodeCode Available	4	5
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking	Mar 14, 2024	GSM8KLanguage Modelling	CodeCode Available	4	5
Magicoder: Empowering Code Generation with OSS-Instruct	Dec 4, 2023	Code GenerationHumanEval	CodeCode Available	4	5
Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation Models	Apr 15, 2025	Humanoid ControlReinforcement Learning (RL)	CodeCode Available	4	5
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference	Oct 6, 2023	GPUImage Generation	CodeCode Available	4	5
XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL	Jul 7, 2025	Text to SQLText-To-SQL	CodeCode Available	4	5
VM-UNet: Vision Mamba UNet for Medical Image Segmentation	Feb 4, 2024	Image SegmentationMamba	CodeCode Available	4	5
FedCP: Separating Feature Information for Personalized Federated Learning via Conditional Policy	Jul 1, 2023	Federated LearningPersonalized Federated Learning	CodeCode Available	4	5
Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering	Feb 26, 2024	Evidence SelectionOpen-Ended Question Answering	CodeCode Available	4	5
NExT-GPT: Any-to-Any Multimodal LLM	Sep 11, 2023	AI Agent	CodeCode Available	4	5
Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning	Jun 3, 2025	Code Generationreinforcement-learning	CodeCode Available	4	5
Eliminating Domain Bias for Federated Learning in Representation Space	Nov 25, 2023	Federated LearningPrivacy Preserving	CodeCode Available	4	5
MotionClone: Training-Free Motion Cloning for Controllable Video Generation	Jun 8, 2024	DenoisingMotion Generation	CodeCode Available	4	5
Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation	Feb 23, 2025	Benchmarking	CodeCode Available	4	5
GIM: Learning Generalizable Image Matcher From Internet Videos	Feb 16, 2024	3D ReconstructionCamera Pose Estimation	CodeCode Available	4	5
Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach	Dec 4, 2024	Image Super-ResolutionSuper-Resolution	CodeCode Available	4	5
Pearl: A Production-ready Reinforcement Learning Agent	Dec 6, 2023	Benchmarkingreinforcement-learning	CodeCode Available	4	5
Towards All-in-One Medical Image Re-Identification	Mar 11, 2025	All	CodeCode Available	4	5
LocAgent: Graph-Guided LLM Agents for Code Localization	Mar 12, 2025	GitHub issue resolutionNavigate	CodeCode Available	4	5
GPFL: Simultaneously Learning Global and Personalized Feature Information for Personalized Federated Learning	Aug 20, 2023	FairnessFederated Learning	CodeCode Available	4	5
Data-centric Artificial Intelligence: A Survey	Mar 17, 2023	Survey	CodeCode Available	4	5
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement	Mar 9, 2025	Domain GeneralizationObject Detection	CodeCode Available	4	5
KeyPoint Relative Position Encoding for Face Recognition	Mar 21, 2024	Face RecognitionGait Recognition	CodeCode Available	4	5
Diffusion Model-Based Image Editing: A Survey	Feb 27, 2024	DenoisingImage Generation	CodeCode Available	4	5
Planning-oriented Autonomous Driving	Dec 20, 2022	Autonomous DrivingBench2Drive	CodeCode Available	4	5
Generation of Training Data from HD Maps in the Lanelet2 Framework	Jul 24, 2024		CodeCode Available	4	5
NAFSSR: Stereo Image Super-Resolution Using NAFNet	Apr 19, 2022	Image RestorationImage Super-Resolution	CodeCode Available	4	5
Visual Mamba: A Survey and New Outlooks	Apr 29, 2024	MambaSurvey	CodeCode Available	4	5
Weighted-Reward Preference Optimization for Implicit Model Fusion	Dec 4, 2024	model	CodeCode Available	4	5