The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 15351–15400 of 474278 papers

Title	Date	Tasks	Status	Hype
Hybrid Meta-learners for Estimating Heterogeneous Treatment Effects	Jun 16, 2025	POS	CodeCode Available	0
Automatic Multi-View X-Ray/CT Registration Using Bone Substructure Contours	Jun 16, 2025		CodeCode Available	0
Variational Inference with Mixtures of Isotropic Gaussians	Jun 16, 2025	Bayesian InferenceVariational Inference	CodeCode Available	0
Curriculum Learning for Biological Sequence Prediction: The Case of De Novo Peptide Sequencing	Jun 16, 2025	de novo peptide sequencing	CodeCode Available	1
We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems	Jun 16, 2025	PositionRed Teaming	CodeCode Available	0
Quantitative Comparison of Fine-Tuning Techniques for Pretrained Latent Diffusion Models in the Generation of Unseen SAR Image Concepts	Jun 16, 2025	Image Generation	—Unverified	0
OneRec Technical Report	Jun 16, 2025	Recommendation Systems	—Unverified	0
CALM: Consensus-Aware Localized Merging for Multi-Task Learning	Jun 16, 2025	Multi-Task LearningTask Arithmetic	CodeCode Available	0
Value-Free Policy Optimization via Reward Partitioning	Jun 16, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
EUNIS Habitat Maps: Enhancing Thematic and Spatial Resolution for Europe through Machine Learning	Jun 16, 2025		CodeCode Available	0
Meta-learning how to Share Credit among Macro-Actions	Jun 16, 2025	Atari GamesMeta-Learning	CodeCode Available	0
TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning	Jun 16, 2025	Reinforcement Learning (RL)Time Series	CodeCode Available	2
Simple is what you need for efficient and accurate medical image segmentation	Jun 16, 2025	feature selectionImage Segmentation	CodeCode Available	0
PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images	Jun 16, 2025	3D Human ReconstructionImage Reconstruction	—Unverified	0
RelTopo: Enhancing Relational Modeling for Driving Scene Topology Reasoning	Jun 16, 2025	Autonomous DrivingContrastive Learning	—Unverified	0
Characterizing Linguistic Shifts in Croatian News via Diachronic Word Embeddings	Jun 16, 2025	ArticlesDiachronic Word Embeddings	CodeCode Available	0
COME: Adding Scene-Centric Forecasting Control to Occupancy World Model	Jun 16, 2025	Autonomous DrivingRepresentation Learning	CodeCode Available	1
IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation	Jun 16, 2025	Text Generation	CodeCode Available	0
MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering	Jun 16, 2025	Conformal PredictionHardware Aware Neural Architecture Search	—Unverified	0
Flexible-length Text Infilling for Discrete Diffusion Models	Jun 16, 2025	PositionText Infilling	—Unverified	0
Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry	Jun 16, 2025	Novel View Synthesis	—Unverified	0
UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions	Jun 16, 2025	4k8k	—Unverified	0
Audio-Visual Driven Compression for Low-Bitrate Talking Head Videos	Jun 16, 2025	Neural RenderingVideo Compression	—Unverified	0
Understanding Learning Invariance in Deep Linear Networks	Jun 16, 2025	Data Augmentation	—Unverified	0
Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning	Jun 16, 2025	Reinforcement Learning (RL)	—Unverified	0
Machine Learning-Driven Compensation for Non-Ideal Channels in AWG-Based FBG Interrogator	Jun 16, 2025	regression	—Unverified	0
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects	Jun 16, 2025	BenchmarkingInstance Segmentation	—Unverified	0
Computational lower bounds in latent models: clustering, sparse-clustering, biclustering	Jun 16, 2025	Clustering	—Unverified	0
FOAM: A General Frequency-Optimized Anti-Overlapping Framework for Overlapping Object Perception	Jun 16, 2025	ObjectPneumonia Detection	—Unverified	0
Limited-Angle CBCT Reconstruction via Geometry-Integrated Cycle-domain Denoising Diffusion Probabilistic Models	Jun 16, 2025	ARCDenoising	—Unverified	0
Instruction Following by Boosting Attention of Large Language Models	Jun 16, 2025	Instruction FollowingPrompt Engineering	—Unverified	0
Active Multimodal Distillation for Few-shot Action Recognition	Jun 16, 2025	Action RecognitionFew-Shot action recognition	—Unverified	0
Intelligent Metasurface-Enabled Integrated Sensing and Communication: Unified Framework and Key Technologies	Jun 16, 2025	Integrated sensing and communicationISAC	—Unverified	0
ViT-NeBLa: A Hybrid Vision Transformer and Neural Beer-Lambert Framework for Single-View 3D Reconstruction of Oral Anatomy from Panoramic Radiographs	Jun 16, 2025	3D ReconstructionAnatomy	—Unverified	0
Micro-macro Gaussian Splatting with Enhanced Scalability for Unconstrained Scene Reconstruction	Jun 16, 2025	3D ReconstructionDiversity	CodeCode Available	0
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention	Jun 16, 2025	Mixture-of-ExpertsReinforcement Learning (RL)	CodeCode Available	7
Adversarial Disentanglement by Backpropagation with Physics-Informed Variational Autoencoder	Jun 16, 2025	Disentanglement	CodeCode Available	0
The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning	Jun 16, 2025	Deep Reinforcement LearningMuJoCo	—Unverified	0
Honesty in Causal Forests: When It Helps and When It Hurts	Jun 16, 2025	Causal Inference	—Unverified	0
SAGDA: Open-Source Synthetic Agriculture Data for Africa	Jun 16, 2025	Data Augmentation	CodeCode Available	0
Learning to Explore in Diverse Reward Settings via Temporal-Difference-Error Maximization	Jun 16, 2025	Deep Reinforcement Learning	CodeCode Available	0
Exploiting the Exact Denoising Posterior Score in Training-Free Guidance of Diffusion Models	Jun 16, 2025	ColorizationDenoising	—Unverified	0
Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs	Jun 16, 2025	Model Compression	CodeCode Available	0
SA-LUT: Spatial Adaptive 4D Look-Up Table for Photorealistic Style Transfer	Jun 16, 2025	Style Transfer	CodeCode Available	1
Multipole Attention for Efficient Long Context Reasoning	Jun 16, 2025		CodeCode Available	0
Few-Shot Learning for Industrial Time Series: A Comparative Analysis Using the Example of Screw-Fastening Process Monitoring	Jun 16, 2025	BenchmarkingFew-Shot Learning	—Unverified	0
Alignment Quality Index (AQI) : Beyond Refusals: AQI as an Intrinsic Alignment Diagnostic via Latent Geometry, Cluster Divergence, and Layer wise Pooled Representations	Jun 16, 2025	Diagnostic	—Unverified	0
Comparison of ConvNeXt and Vision-Language Models for Breast Density Assessment in Screening Mammography	Jun 16, 2025	breast density classificationClassification	—Unverified	0
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences	Jun 16, 2025	Document SummarizationGPU	CodeCode Available	3
Lost in the Mix: Evaluating LLM Understanding of Code-Switched Text	Jun 16, 2025		CodeCode Available	0