The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,323 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3551–3600 of 661570 papers

Title	Date	Tasks	Status	Hype
Long-Context Autoregressive Video Modeling with Next-Frame Prediction	Mar 25, 2025	Text GenerationVideo Generation	CodeCode Available	3
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation	Apr 23, 2024	AttributeVideo Generation	CodeCode Available	3
Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion	Mar 20, 2024	3DGSNovel View Synthesis	CodeCode Available	3
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow	Jun 12, 2023		CodeCode Available	3
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer	Jul 2, 2019	Depth EstimationMonocular Depth Estimation	CodeCode Available	3
Consistency Models Made Easy	Jun 20, 2024	Computational EfficiencyGPU	CodeCode Available	3
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers	Sep 6, 2024	Experimental Designscientific discovery	CodeCode Available	3
UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction	Mar 22, 2024	DiversityPrediction	CodeCode Available	3
Scalable Optimization in the Modular Norm	May 23, 2024		CodeCode Available	3
SupeRANSAC: One RANSAC to Rule Them All	Jun 5, 2025	AllPose Estimation	CodeCode Available	3
Wordflow: Social Prompt Engineering for Large Language Models	Jan 25, 2024	Prompt Engineering	CodeCode Available	3
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing	Dec 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines	Jun 20, 2024	Diversityobject-detection	CodeCode Available	3
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models	Jun 19, 2024	Instruction Following	CodeCode Available	3
Face Anonymization Made Simple	Nov 1, 2024	AttributeFace Anonymization	CodeCode Available	3
Locating and Editing Factual Associations in GPT	Feb 10, 2022	counterfactualModel Editing	CodeCode Available	3
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection network	Sep 10, 2022	Continual LearningObject	CodeCode Available	3
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos	May 3, 2024	Depth EstimationDepth Prediction	CodeCode Available	3
ImageInWords: Unlocking Hyper-Detailed Image Descriptions	May 5, 2024	Image GenerationSpecificity	CodeCode Available	3
Flow Q-Learning	Feb 4, 2025	Action GenerationD4RL	CodeCode Available	3
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs	Apr 1, 2025	Knowledge GraphsMathematical Reasoning	CodeCode Available	3
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility	Mar 18, 2024	Image InpaintingVideo Alignment	CodeCode Available	3
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition	Dec 12, 2024	EgoSchema	CodeCode Available	3
The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report	Apr 16, 2024	Image Super-ResolutionSuper-Resolution	CodeCode Available	3
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding	Oct 22, 2024	Token ReductionVideo Question Answering	CodeCode Available	3
Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation	Jun 2, 2025	4kDescriptive	CodeCode Available	3
PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers	Feb 13, 2024	Question AnsweringRetrieval	CodeCode Available	3
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning	May 15, 2022	FairnessSemi-Supervised Image Classification	CodeCode Available	3
Unlimited-Size Diffusion Restoration	Mar 1, 2023	Image GenerationImage Restoration	CodeCode Available	3
TorchSparse: Efficient Point Cloud Inference Engine	Apr 21, 2022	Autonomous Driving	CodeCode Available	3
Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis	Jul 18, 2023	NeRF	CodeCode Available	3
AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection	Jul 22, 2024	Anomaly DetectionLanguage Modeling	CodeCode Available	3
From Matching to Generation: A Survey on Generative Information Retrieval	Apr 23, 2024	Incremental LearningInformation Retrieval	CodeCode Available	3
SAM-Med2D	Aug 30, 2023	DecoderImage Segmentation	CodeCode Available	3
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents	Feb 8, 2024	Autonomous DrivingLanguage Modeling	CodeCode Available	3
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation	Apr 8, 2024	Image GenerationImage-to-Image Translation	CodeCode Available	3
DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations	Mar 11, 2024	Disentanglement	CodeCode Available	3
GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation	Jun 10, 2024	3D GenerationNeRF	CodeCode Available	3
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details	Jun 19, 2025	Texture Synthesis	CodeCode Available	3
ResearchTown: Simulator of Human Research Community	Dec 23, 2024		CodeCode Available	3
From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision	Dec 15, 2024	Active Learning	CodeCode Available	3
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs	Jan 12, 2024		CodeCode Available	3
LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion	Nov 4, 2023	BenchmarkingImitation Learning	CodeCode Available	3
TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug Discovery	Feb 16, 2022	BIG-bench Machine LearningDrug Discovery	CodeCode Available	3
MathArena: Evaluating LLMs on Uncontaminated Math Competitions	May 29, 2025	MathMathematical Reasoning	CodeCode Available	3
Frequency-aware Feature Fusion for Dense Image Prediction	Aug 23, 2024	Prediction	CodeCode Available	3
VoiceBench: Benchmarking LLM-Based Voice Assistants	Oct 22, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	3
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation	Mar 18, 2024	3D Generation3D Reconstruction	CodeCode Available	3
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents	Jan 24, 2025	Benchmarking	CodeCode Available	3
GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction	Mar 13, 2025	Autonomous DrivingSurface Reconstruction	CodeCode Available	3