The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4051–4100 of 661570 papers

Title	Date	Tasks	Status	Hype
DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks	Jun 13, 2024	Benchmarking	CodeCode Available	3
CORL: Research-oriented Deep Offline Reinforcement Learning Library	Oct 13, 2022	BenchmarkingD4RL	CodeCode Available	3
Data Filtering Networks	Sep 29, 2023	Language ModelingLanguage Modelling	CodeCode Available	3
FastMap: Revisiting Dense and Scalable Structure from Motion	May 7, 2025	GPU	CodeCode Available	3
ToRL: Scaling Tool-Integrated RL	Mar 30, 2025	Mathreinforcement-learning	CodeCode Available	3
Safety of Multimodal Large Language Models on Images and Texts	Feb 1, 2024	Survey	CodeCode Available	3
Low-Rank Few-Shot Adaptation of Vision-Language Models	May 28, 2024	Few-Shot Learningparameter-efficient fine-tuning	CodeCode Available	3
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text	Jun 12, 2024	In-Context Learning	CodeCode Available	3
Large Spatial Model: End-to-end Unposed Images to Semantic 3D	Oct 24, 2024	3D ReconstructionAttribute	CodeCode Available	3
FlatQuant: Flatness Matters for LLM Quantization	Oct 12, 2024	Quantization	CodeCode Available	3
Optimal Stepsize for Diffusion Sampling	Mar 27, 2025	DenoisingImage Generation	CodeCode Available	3
DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus	May 22, 2024	3DGS3D Reconstruction	CodeCode Available	3
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model	Apr 30, 2024	Motion GenerationMotion Synthesis	CodeCode Available	3
Benchmarking LLMs via Uncertainty Quantification	Jan 23, 2024	BenchmarkingUncertainty Quantification	CodeCode Available	3
Olympus: A Universal Task Router for Computer Vision Tasks	Dec 12, 2024		CodeCode Available	3
A guide to convolution arithmetic for deep learning	Mar 23, 2016	Deep Learning	CodeCode Available	3
ARC Prize 2024: Technical Report	Dec 5, 2024	ARCProgram Synthesis	CodeCode Available	3
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation	Apr 15, 2024	Contrastive LearningDescriptive	CodeCode Available	3
LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis	Apr 3, 2024	3D Reconstruction4D reconstruction	CodeCode Available	3
Defeating Prompt Injections by Design	Mar 24, 2025		CodeCode Available	3
SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks	Nov 20, 2023	DiversityImage Segmentation	CodeCode Available	3
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models	May 31, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Faithful Logical Reasoning via Symbolic Chain-of-Thought	May 28, 2024	Logical Reasoning	CodeCode Available	3
Multimodal Table Understanding	Jun 12, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
KV-Edit: Training-Free Image Editing for Precise Background Preservation	Feb 24, 2025	Text-based Image Editing	CodeCode Available	3
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving	Aug 1, 2024		CodeCode Available	3
VideoGen-Eval: Agent-based System for Video Generation Evaluation	Mar 30, 2025	DiversityVideo Generation	CodeCode Available	3
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis	May 5, 2025	ChatbotDecoder	CodeCode Available	3
JAFAR: Jack up Any Feature at Any Resolution	Jun 10, 2025	Feature Upsampling	CodeCode Available	3
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation	Dec 30, 2024	Video GenerationVideo Quality Assessment	CodeCode Available	3
GENERator: A Long-Context Generative Genomic Foundation Model	Feb 11, 2025	model	CodeCode Available	3
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models	Feb 10, 2025	Decoder	CodeCode Available	3
SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model	Nov 21, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Half-Inverse Gradients for Physical Deep Learning	Mar 18, 2022	Deep Learning	CodeCode Available	3
pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction	Dec 19, 2023	3D ReconstructionGeneralizable Novel View Synthesis	CodeCode Available	3
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models	Feb 15, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
DisCo: Disentangled Control for Realistic Human Dance Generation	Jun 30, 2023	Attribute	CodeCode Available	3
^2DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials	Jun 20, 2024	Drug DiscoveryMolecular Property Prediction	CodeCode Available	3
DARWIN 1.5: Large Language Models as Materials Science Adapted Learners	Dec 16, 2024	Large Language ModelMulti-Task Learning	CodeCode Available	3
NeuralOM: Neural Ocean Model for Subseasonal-to-Seasonal Simulation	May 27, 2025	Computational EfficiencyGraph Neural Network	CodeCode Available	3
A Comprehensive Survey on Segment Anything Model for Vision and Beyond	May 14, 2023		CodeCode Available	3
HLOB -- Information Persistence and Structure in Limit Order Books	May 29, 2024	Deep Learning	CodeCode Available	3
Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving	Aug 14, 2024	3D Object Detection3D Object Tracking	CodeCode Available	3
Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap	Jan 3, 2025	Recommendation SystemsWorld Knowledge	CodeCode Available	3
Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians	Mar 21, 2024	Binarization	CodeCode Available	3
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow	Oct 9, 2024		CodeCode Available	3
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams	Jun 12, 2024	cross-modal alignmentLanguage Modelling	CodeCode Available	3
Opportunities and Risks of LLMs for Scalable Deliberation with Polis	Jun 20, 2023		CodeCode Available	3
RePlay: a Recommendation Framework for Experimentation and Production Use	Sep 11, 2024	Recommendation Systems	CodeCode Available	3
Deep Reinforcement Learning	Oct 15, 2018	Deep Reinforcement LearningManagement	CodeCode Available	3