The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5401–5450 of 661570 papers

Title	Date	Tasks	Status	Hype
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models	May 15, 2025	General KnowledgePrompt Engineering	CodeCode Available	2
A Tutorial on Structural Identifiability of Epidemic Models Using StructuralIdentifiability.jl	May 15, 2025	parameter estimation	CodeCode Available	2
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection	May 15, 2025	Anomaly Detection	CodeCode Available	2
VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality	May 15, 2025	3DGSGPU	CodeCode Available	2
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models	May 15, 2025	Mathreinforcement-learning	CodeCode Available	2
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning	May 14, 2025	Anomaly DetectionAnomaly Segmentation	CodeCode Available	2
Recent Advances in Medical Imaging Segmentation: A Survey	May 14, 2025	Domain AdaptationFew-Shot Learning	CodeCode Available	2
Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt	May 14, 2025	Anomaly DetectionAnomaly Segmentation	CodeCode Available	2
Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation	May 14, 2025	Anomaly ClassificationAnomaly Detection	CodeCode Available	2
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators	May 14, 2025	Spoken Dialogue Systems	CodeCode Available	2
Reproducibility Study of "Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents"	May 14, 2025		CodeCode Available	2
BAT: Benchmark for Auto-bidding Task	May 13, 2025		CodeCode Available	2
Behind Maya: Building a Multilingual Vision Language Model	May 13, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement	May 13, 2025	BenchmarkingLanguage Modeling	CodeCode Available	2
CodePDE: An Inference Framework for LLM-driven PDE Solver Generation	May 13, 2025	Code Generation	CodeCode Available	2
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation	May 12, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Unified Continuous Generative Models	May 12, 2025	Image Generation	CodeCode Available	2
Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs	May 12, 2025	AI AgentKnowledge Distillation	CodeCode Available	2
SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models	May 12, 2025		CodeCode Available	2
Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection	May 12, 2025	Anomaly Detection	CodeCode Available	2
Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent	May 12, 2025	RAGReinforcement Learning (RL)	CodeCode Available	2
Adaptive Latent-Space Constraints in Personalized FL	May 12, 2025	Federated Learning	CodeCode Available	2
Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving	May 12, 2025	MathMathematical Problem-Solving	CodeCode Available	2
LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning	May 12, 2025		CodeCode Available	2
MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering	May 12, 2025	Large Language Modelreinforcement-learning	CodeCode Available	2
Piloting Structure-Based Drug Design via Modality-Specific Optimal Schedule	May 12, 2025	Drug DesignScheduling	CodeCode Available	2
YuLan-OneSim: Towards the Next Generation of Social Simulator with Large Language Models	May 12, 2025	Large Language ModelSociology	CodeCode Available	2
GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance	May 11, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model Capabilities	May 10, 2025	Spatial Reasoning	CodeCode Available	2
ReplayCAD: Generative Diffusion Replay for Continual Anomaly Detection	May 10, 2025	Anomaly Detectioncontinual anomaly detection	CodeCode Available	2
Understanding and Mitigating Toxicity in Image-Text Pretraining Datasets: A Case Study on LLaVA	May 9, 2025		CodeCode Available	2
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation	May 9, 2025	Image GenerationImage Segmentation	CodeCode Available	2
InstanceGen: Image Generation with Instance-level Instructions	May 8, 2025	Image Generation	CodeCode Available	2
Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging	May 8, 2025		CodeCode Available	2
Foam-Agent: Towards Automated Intelligent CFD Workflows	May 8, 2025		CodeCode Available	2
SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation	May 8, 2025	3DGSData Augmentation	CodeCode Available	2
StabStitch++: Unsupervised Online Video Stitching with Spatiotemporal Bidirectional Warps	May 8, 2025	Image StitchingVideo Stabilization	CodeCode Available	2
Diffusion Model Quantization: A Review	May 8, 2025	modelQuantization	CodeCode Available	2
TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh Optimization	May 7, 2025	3D ReconstructionFairness	CodeCode Available	2
Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World	May 7, 2025		CodeCode Available	2
Steerable Scene Generation with Post Training and Inference-Time Search	May 7, 2025	Scene Generation	CodeCode Available	2
EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning	May 7, 2025	Multiple-choiceQuestion Answering	CodeCode Available	2
Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation	May 7, 2025	3D GenerationAttribute	CodeCode Available	2
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration	May 7, 2025	Computational Efficiency	CodeCode Available	2
Non-stationary Diffusion For Probabilistic Time Series Forecasting	May 7, 2025	DenoisingProbabilistic Time Series Forecasting	CodeCode Available	2
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception	May 7, 2025	object-detectionObject Detection	CodeCode Available	2
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization	May 6, 2025	Active Speaker DetectionAudio-Visual Speech Recognition	CodeCode Available	2
Rethinking Boundary Detection in Deep Learning-Based Medical Image Segmentation	May 6, 2025	Boundary DetectionDecoder	CodeCode Available	2
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing	May 5, 2025	Triplet	CodeCode Available	2
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models	May 5, 2025	Active Learning	CodeCode Available	2