The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 16701–16750 of 474278 papers

Title	Date	Tasks	Status	Hype
MAC: An Efficient Gradient Preconditioning using Mean Activation Approximated Curvature	Jun 10, 2025		CodeCode Available	0
Urban Incident Prediction with Graph Neural Networks: Integrating Government Ratings and Crowdsourced Reports	Jun 10, 2025		CodeCode Available	0
IMAGIC-500: IMputation benchmark on A Generative Imaginary Country (500k samples)	Jun 10, 2025	Computational EfficiencyImputation	CodeCode Available	0
Agile Reinforcement Learning for Real-Time Task Scheduling in Edge Computing	Jun 10, 2025	Edge-computingreinforcement-learning	CodeCode Available	0
InfoDPCCA: Information-Theoretic Dynamic Probabilistic Canonical Correlation Analysis	Jun 10, 2025	Representation Learning	CodeCode Available	0
On Finetuning Tabular Foundation Models	Jun 10, 2025	In-Context LearningRetrieval	CodeCode Available	1
StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams	Jun 10, 2025	3DGS3D Reconstruction	CodeCode Available	2
Image Demoiréing Using Dual Camera Fusion on Mobile Phones	Jun 10, 2025		CodeCode Available	1
SAMSelect: A Spectral Index Search for Marine Debris Visualization using Segment Anything	Jun 10, 2025		CodeCode Available	0
HSG-12M: A Large-Scale Spatial Multigraph Dataset	Jun 10, 2025	Graph Learningscientific discovery	CodeCode Available	1
EtiCor++: Towards Understanding Etiquettical Bias in LLMs	Jun 10, 2025	Sensitivity	CodeCode Available	0
On Reasoning Strength Planning in Large Reasoning Models	Jun 10, 2025		CodeCode Available	1
Paths to Causality: Finding Informative Subgraphs Within Knowledge Graphs for Knowledge-Based Causal Discovery	Jun 10, 2025	Causal DiscoveryCausal Inference	CodeCode Available	0
Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood	Jun 10, 2025	Computational EfficiencyD4RL	CodeCode Available	0
Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving	Jun 10, 2025		CodeCode Available	0
Time-Aware World Model for Adaptive Prediction and Control	Jun 10, 2025		CodeCode Available	0
GFRIEND: Generative Few-shot Reward Inference through EfficieNt DPO	Jun 10, 2025	Data AugmentationModel Optimization	CodeCode Available	0
Tailored Architectures for Time Series Forecasting: Evaluating Deep Learning Models on Gaussian Process-Generated Data	Jun 10, 2025	Gaussian ProcessesTime Series	CodeCode Available	0
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning	Jun 10, 2025	Model SelectionReinforcement Learning (RL)	CodeCode Available	2
ClimateViz: A Benchmark for Statistical Reasoning and Fact Verification on Scientific Charts	Jun 10, 2025	Fact CheckingFact Verification	CodeCode Available	0
Institutional Books 1.0: A 242B token dataset from Harvard Library's collections, refined for accuracy and usability	Jun 10, 2025	Optical Character Recognition (OCR)	CodeCode Available	2
Dense Retrievers Can Fail on Simple Queries: Revealing The Granularity Dilemma of Embeddings	Jun 10, 2025	Image Captioning	CodeCode Available	0
Multi-Teacher Language-Aware Knowledge Distillation for Multilingual Speech Emotion Recognition	Jun 10, 2025	Emotion RecognitionKnowledge Distillation	CodeCode Available	0
Do MIL Models Transfer?	Jun 10, 2025	Multiple Instance LearningTransfer Learning	CodeCode Available	2
A Privacy-Preserving Federated Learning Framework for Generalizable CBCT to Synthetic CT Translation in Head and Neck	Jun 10, 2025	Federated LearningGenerative Adversarial Network	—Unverified	0
JoFormer (Journey-based Transformer): Theory and Empirical Analysis on the Tiny Shakespeare Dataset	Jun 10, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed	Jun 10, 2025	GPU	—Unverified	0
Network Threat Detection: Addressing Class Imbalanced Data with Deep Forest	Jun 10, 2025	Malware Detection	—Unverified	0
Asymptotic Normality of Infinite Centered Random Forests -Application to Imbalanced Classification	Jun 10, 2025	imbalanced classificationvalid	—Unverified	0
AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions	Jun 10, 2025	Math	CodeCode Available	2
MagCache: Fast Video Generation with Magnitude-Aware Cache	Jun 10, 2025	SSIMVideo Generation	CodeCode Available	3
ArrowPose: Segmentation, Detection, and 5 DoF Pose Estimation Network for Colorless Point Clouds	Jun 10, 2025	Pose Estimation	—Unverified	0
LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4RTX 4090s	Jun 10, 2025	Super-ResolutionVideo Super-Resolution	—Unverified	0
SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning	Jun 10, 2025	Knowledge DistillationMath	CodeCode Available	1
PlantBert: An Open Source Language Model for Plant Science	Jun 10, 2025	Domain AdaptationLanguage Modeling	—Unverified	0
CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models	Jun 10, 2025	Language ModelingLanguage Modelling	—Unverified	0
Improved Scaling Laws in Linear Regression via Data Reuse	Jun 10, 2025	regression	—Unverified	0
RAISE: Enhancing Scientific Reasoning in LLMs via Step-by-Step Retrieval	Jun 10, 2025	Problem DecompositionRetrieval	CodeCode Available	0
midr: Learning from Black-Box Models by Maximum Interpretation Decomposition	Jun 10, 2025	Explainable artificial intelligenceExplainable Artificial Intelligence (XAI)	CodeCode Available	0
Dialect Normalization using Large Language Models and Morphological Rules	Jun 10, 2025	Natural Language Understanding	CodeCode Available	0
Learnable Spatial-Temporal Positional Encoding for Link Prediction	Jun 10, 2025	Link PredictionPrediction	CodeCode Available	0
PropMEND: Hypernetworks for Knowledge Propagation in LLMs	Jun 10, 2025	knowledge editingLanguage Modeling	CodeCode Available	0
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better	Jun 10, 2025	Image Generation	CodeCode Available	2
EIFBENCH: Extremely Complex Instruction Following Benchmark for Large Language Models	Jun 10, 2025	Instruction FollowingNavigate	CodeCode Available	0
Olica: Efficient Structured Pruning of Large Language Models without Retraining	Jun 10, 2025	GPU	CodeCode Available	0
The Decoupled Risk Landscape in Performative Prediction	Jun 10, 2025	Prediction	CodeCode Available	0
Towards Class-wise Fair Adversarial Training via Anti-Bias Soft Label Distillation	Jun 10, 2025	Adversarial RobustnessFairness	CodeCode Available	0
VReST: Enhancing Reasoning in Large Vision-Language Models through Tree Search and Self-Reward Mechanism	Jun 10, 2025	Mathematical ReasoningVisual Reasoning	CodeCode Available	0
Solving excited states for long-range interacting trapped ions with neural networks	Jun 10, 2025	Benchmarking	—Unverified	0
Factors affecting the in-context learning abilities of LLMs for dialogue state tracking	Jun 10, 2025	Dialogue State TrackingIn-Context Learning	—Unverified	0