The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7051–7100 of 661570 papers

Title	Date	Tasks	Status	Hype
Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image Segmentation	Mar 17, 2025	Domain AdaptationDomain Generalization	CodeCode Available	2
DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry	Mar 17, 2025	valid	CodeCode Available	2
Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt	May 14, 2025	Anomaly DetectionAnomaly Segmentation	CodeCode Available	2
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models	May 15, 2025	Mathreinforcement-learning	CodeCode Available	2
Relational Graph Transformer	May 16, 2025	Graph Neural Network	CodeCode Available	2
AdaptThink: Reasoning Models Can Learn When to Think	May 19, 2025	Math	CodeCode Available	2
AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection	May 19, 2025	Anomaly DetectionCode Generation	CodeCode Available	2
FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models	May 19, 2025	Disaster ResponseVision and Language Navigation	CodeCode Available	2
GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent	May 22, 2025		CodeCode Available	2
Ranked Entropy Minimization for Continual Test-Time Adaptation	May 22, 2025	Test-time Adaptation	CodeCode Available	2
Training Long-Context LLMs Efficiently via Chunk-wise Optimization	May 22, 2025	16kGPU	CodeCode Available	2
Training-Free Multi-Step Audio Source Separation	May 26, 2025	Audio Source SeparationDenoising	CodeCode Available	2
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning	May 26, 2025	Decision MakingHierarchical Reinforcement Learning	CodeCode Available	2
WeatherEdit: Controllable Weather Editing with 4D Gaussian Field	May 26, 2025	3D Generation3DGS	CodeCode Available	2
HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions	May 29, 2025	Image AnimationVideo Generation	CodeCode Available	2
Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization and Temporal Motion Modulation	May 29, 2025	Portrait AnimationVideo Alignment	CodeCode Available	2
TC-GS: A Faster Gaussian Splatting Module Utilizing Tensor Cores	May 30, 2025	3DGS	CodeCode Available	2
When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways	May 30, 2025	Continual LearningImage Augmentation	CodeCode Available	2
ViStoryBench: Comprehensive Benchmark Suite for Story Visualization	May 30, 2025	Story Visualization	CodeCode Available	2
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention	Apr 8, 2025		CodeCode Available	2
DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing Scenes	Jun 2, 2025	Natural Language QueriesNavigate	CodeCode Available	2
Savage-Dickey density ratio estimation with normalizing flows for Bayesian model comparison	Jun 4, 2025	Density Ratio Estimation	CodeCode Available	2
VideoMolmo: Spatio-Temporal Grounding Meets Pointing	Jun 5, 2025	Autonomous DrivingAutonomous Navigation	CodeCode Available	2
ORV: 4D Occupancy-centric Robot Video Generation	Jun 3, 2025	Video Generation	CodeCode Available	2
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better	Jun 10, 2025	Image Generation	CodeCode Available	2
Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction	Jun 9, 2025	Reinforcement Learning (RL)	CodeCode Available	2
Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20^th century Urban Landscapes with Satellite Imageries	Jun 11, 2025	SegmentationSelf-Supervised Learning	CodeCode Available	2
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting	Jun 11, 2025	DiversityRepresentation Learning	CodeCode Available	2
CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models	Jun 11, 2025	counterfactualDescriptive	CodeCode Available	2
Language Modeling by Language Models	Jun 25, 2025	Code GenerationLanguage Modeling	CodeCode Available	2
PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket Conditioning	Jun 24, 2025	BenchmarkingDrug Discovery	CodeCode Available	2
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS	Nov 28, 2023	Knowledge DistillationNeRF	CodeCode Available	2
RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking	Jun 20, 2025	6D Pose EstimationObject	CodeCode Available	2
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning	Jun 18, 2025	Caption GenerationDescriptive	CodeCode Available	2
AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration	Mar 16, 2025	Camera Calibration	CodeCode Available	2
Learning to See in the Extremely Dark	Jun 26, 2025	DenoisingExposure Correction	CodeCode Available	2
Closed-form Continuous-time Neural Models	Jun 25, 2021	FormSentiment Analysis	CodeCode Available	2
Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning	Jan 25, 2025	Answer GenerationMulti-agent Reinforcement Learning	CodeCode Available	2
When Language Model Meets Private Library	Oct 31, 2022	Code GenerationLanguage Modeling	CodeCode Available	2
H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models	Jun 24, 2023	GPU	CodeCode Available	2
MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning	Apr 14, 2025	Machine TranslationReinforcement Learning (RL)	CodeCode Available	2
Visual Reinforcement Learning with Imagined Goals	Jul 12, 2018	reinforcement-learningReinforcement Learning	CodeCode Available	2
Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics	Nov 18, 2024	Vision-Language-Action	CodeCode Available	2
The Replica Dataset: A Digital Replica of Indoor Spaces	Jun 13, 2019	3D Scene ReconstructionInstruction Following	CodeCode Available	2
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model	Mar 8, 2025	Image Quality AssessmentLanguage Modeling	CodeCode Available	2
Multi-Objective Molecule Generation using Interpretable Substructures	Feb 8, 2020	DiversityDrug Design	CodeCode Available	2
Neural Network Compression Framework for fast model inference	Feb 20, 2020	BinarizationCPU	CodeCode Available	2
Towards Backdoor Attacks and Defense in Robust Machine Learning Models	Feb 25, 2020	BIG-bench Machine LearningClustering	CodeCode Available	2
Adversarial Attacks and Defenses on Graphs: A Review, A Tool and Empirical Studies	Mar 2, 2020	Adversarial Attack	CodeCode Available	2
On the Planning Abilities of Large Language Models - A Critical Investigation	Sep 21, 2023		CodeCode Available	2