The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 18501–18550 of 474278 papers

Title	Date	Tasks	Status	Hype
Unified Interference-Aware Water-Filling for QoS-Constrained Communication, Sensing, and JRC	Jun 2, 2025	Joint Radar-Communication	—Unverified	0
Benchmarking Neural Speech Codec Intelligibility with SITool	Jun 2, 2025	BenchmarkingDiagnostic	—Unverified	0
SPACE: Your Genomic Profile Predictor is a Powerful DNA Foundation Model	Jun 2, 2025	Mixture-of-ExpertsUnsupervised Pre-training	CodeCode Available	1
High-gain MIMO Beamforming Antenna System for DSRC and mmwave 5G Integration in Autonomous Vehicles	Jun 2, 2025	Autonomous Vehicles	—Unverified	0
PMNO: A novel physics guided multi-step neural operator predictor for partial differential equations	Jun 2, 2025	Operator learning	—Unverified	0
Life Sequence Transformer: Generative Modelling for Counterfactual Simulation	Jun 2, 2025	counterfactual	—Unverified	0
Stock Market Telepathy: Graph Neural Networks Predicting the Secret Conversations between MINT and G7 Countries	Jun 2, 2025	Graph Neural NetworkMultivariate Time Series Forecasting	—Unverified	0
Pricing the Right to Renege in Search Markets: Evidence from Trucking	Jun 2, 2025	counterfactual	—Unverified	0
Effect of Insecurity on Agricultural Output in Benue State, Nigeria	Jun 2, 2025	Descriptive	—Unverified	0
A combined Machine Learning and Finite Element Modelling tool for the surgical planning of craniosynostosis correction	Jun 2, 2025	Computed Tomography (CT)	—Unverified	0
Sensor Fusion for Track Geometry Monitoring: Integrating On-Board Data and Degradation Models via Kalman Filtering	Jun 2, 2025	Sensor Fusion	—Unverified	0
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models	Jun 2, 2025	Instruction FollowingReinforcement Learning (RL)	CodeCode Available	1
iQUEST: An Iterative Question-Guided Framework for Knowledge Base Question Answering	Jun 2, 2025	Graph Neural NetworkKnowledge Base Question Answering	—Unverified	0
Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning	Jun 2, 2025	Machine UnlearningMath	CodeCode Available	0
LLMs as World Models: Data-Driven and Human-Centered Pre-Event Simulation for Disaster Impact Assessment	Jun 2, 2025	RAG	—Unverified	0
BehaviorBox: Automated Discovery of Fine-Grained Performance Differences Between Language Models	Jun 2, 2025	Language Model Evaluation	—Unverified	0
Why Gradients Rapidly Increase Near the End of Training	Jun 2, 2025	Language ModelingLanguage Modelling	—Unverified	0
Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments	Jun 2, 2025	Autonomous NavigationCollision Avoidance	—Unverified	0
KDRL: Post-Training Reasoning LLMs via Unified Knowledge Distillation and Reinforcement Learning	Jun 2, 2025	Knowledge DistillationLarge Language Model	—Unverified	0
LongDWM: Cross-Granularity Distillation for Building a Long-Term Driving World Model	Jun 2, 2025	Video Generation	—Unverified	0
PointT2I: LLM-based text-to-image generation via keypoints	Jun 2, 2025	Image GenerationLarge Language Model	—Unverified	0
Self-Challenging Language Model Agents	Jun 2, 2025	Language ModelingLanguage Modelling	—Unverified	0
Fodor and Pylyshyn's Legacy -- Still No Human-like Systematic Compositionality in Neural Networks	Jun 2, 2025	Meta-Learning	—Unverified	0
Temporal Variational Implicit Neural Representations	Jun 2, 2025	ImputationMeta-Learning	—Unverified	0
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists	Jun 2, 2025	BenchmarkingForm	—Unverified	0
WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks	Jun 2, 2025	Large Language ModelMathematical Reasoning	—Unverified	0
SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes	Jun 2, 2025	Scene Understanding	—Unverified	0
Implicit Deformable Medical Image Registration with Learnable Kernels	Jun 2, 2025	Deformable Medical Image RegistrationImage Registration	—Unverified	0
Adversarial learning for nonparametric regression: Minimax rate and adaptive estimation	Jun 2, 2025	regression	—Unverified	0
TSRating: Rating Quality of Diverse Time Series Data by Meta-learning from LLM Judgment	Jun 2, 2025	Meta-LearningTime Series	—Unverified	0
ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding	Jun 2, 2025	3D GenerationLarge Language Model	CodeCode Available	4
OD3: Optimization-free Dataset Distillation for Object Detection	Jun 2, 2025	Dataset Distillationimage-classification	CodeCode Available	1
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost	Jun 2, 2025	Image SegmentationSemantic Segmentation	CodeCode Available	1
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning	Jun 2, 2025	MathMathematical Reasoning	CodeCode Available	2
Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency	Jun 2, 2025	reinforcement-learningReinforcement Learning	CodeCode Available	2
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis	Jun 2, 2025	8kMath	—Unverified	0
Optimization Strategies for Variational Quantum Algorithms in Noisy Landscapes	Jun 2, 2025	NavigateQuantum Machine Learning	—Unverified	0
A 2-Stage Model for Vehicle Class and Orientation Detection with Photo-Realistic Image Generation	Jun 2, 2025	Image Generation	—Unverified	0
Stop Chasing the C-index: This Is How We Should Evaluate Our Survival Models	Jun 2, 2025	Survival Analysis	—Unverified	0
ReconXF: Graph Reconstruction Attack via Public Feature Explanations on Privatized Node Features and Labels	Jun 2, 2025	DenoisingGraph Reconstruction	—Unverified	0
ResearchCodeBench: Benchmarking LLMs on Implementing Novel Machine Learning Research Code	Jun 2, 2025	BenchmarkingCode Generation	—Unverified	0
Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Comprehension and Generation	Jun 2, 2025	Image GenerationText to Image Generation	—Unverified	0
Large Language Models for EEG: A Comprehensive Survey and Taxonomy	Jun 2, 2025	DiagnosticEEG	—Unverified	0
Exploring the Potential of LLMs as Personalized Assistants: Dataset, Evaluation, and Analysis	Jun 2, 2025		CodeCode Available	1
Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation	Jun 2, 2025	4kDescriptive	CodeCode Available	3
RewardBench 2: Advancing Reward Model Evaluation	Jun 2, 2025	Instruction Followingmodel	CodeCode Available	4
OmniV2V: Versatile Video Generation and Editing via Dynamic Content Manipulation	Jun 2, 2025	Data AugmentationHuman Animation	CodeCode Available	5
Enhancing Diffusion-based Unrestricted Adversarial Attacks via Adversary Preferences Alignment	Jun 2, 2025		CodeCode Available	0
Propaganda and Information Dissemination in the Russo-Ukrainian War: Natural Language Processing of Russian and Western Twitter Narratives	Jun 2, 2025	Humanitarian	—Unverified	0
Balancing Beyond Discrete Categories: Continuous Demographic Labels for Fair Face Recognition	Jun 2, 2025	Face Recognition	—Unverified	0