The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 15501–15550 of 474278 papers

Title	Date	Tasks	Status	Hype
Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems	Jun 16, 2025	DecoderLanguage Modeling	—Unverified	0
Logical Expressiveness of Graph Neural Networks with Hierarchical Node Individualization	Jun 16, 2025	Isomorphism Testing	CodeCode Available	0
Delving Into the Psychology of Machines: Exploring the Structure of Self-Regulated Learning via LLM-Generated Survey Responses	Jun 16, 2025	Survey	—Unverified	0
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning	Jun 16, 2025	Action GenerationAutonomous Driving	CodeCode Available	3
Evaluating Large Language Models for Phishing Detection, Self-Consistency, Faithfulness, and Explainability	Jun 16, 2025	ClassificationContrastive Learning	CodeCode Available	0
A Survey on World Models Grounded in Acoustic Physical Information	Jun 16, 2025	Autonomous DrivingSurvey	CodeCode Available	0
Fake it till You Make it: Reward Modeling as Discriminative Prediction	Jun 16, 2025		—Unverified	0
Towards Pervasive Distributed Agentic Generative AI -- A State of The Art	Jun 16, 2025	Natural Language UnderstandingSurvey	—Unverified	0
OPTIMUS: Observing Persistent Transformations in Multi-temporal Unlabeled Satellite-data	Jun 16, 2025	Change Point DetectionSelf-Supervised Learning	—Unverified	0
GeoRecon: Graph-Level Representation Learning for 3D Molecules via Reconstruction-Based Pretraining	Jun 16, 2025	DenoisingLanguage Modeling	—Unverified	0
Weakest Link in the Chain: Security Vulnerabilities in Advanced Reasoning Models	Jun 16, 2025	Math	—Unverified	0
Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes	Jun 16, 2025	Reinforcement Learning (RL)	—Unverified	0
A Survey on Imitation Learning for Contact-Rich Tasks in Robotics	Jun 16, 2025	Contact-rich ManipulationImitation Learning	—Unverified	0
From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars	Jun 16, 2025	GPUSpeech Synthesis	—Unverified	0
SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists	Jun 16, 2025	Fact CheckingTAG	—Unverified	0
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders	Jun 16, 2025		—Unverified	0
DoA Estimation using MUSIC with Range/Doppler Multiplexing for MIMO-OFDM Radar	Jun 16, 2025	parameter estimationSuper-Resolution	—Unverified	0
Stability Analysis of Physics-Informed Neural Networks via Variational Coercivity, Perturbation Bounds, and Concentration Estimates	Jun 16, 2025	Generalization Bounds	—Unverified	0
Dynamic Preference Multi-Objective Reinforcement Learning for Internet Network Management	Jun 16, 2025	ManagementMulti-Objective Reinforcement Learning	—Unverified	0
IKDiffuser: A Generative Inverse Kinematics Solver for Multi-arm Robots via Diffusion Model	Jun 16, 2025	Computational EfficiencyDiversity	—Unverified	0
ROSA: Harnessing Robot States for Vision-Language and Action Alignment	Jun 16, 2025	State EstimationVision-Language-Action	—Unverified	0
Agent Capability Negotiation and Binding Protocol (ACNBP)	Jun 16, 2025	Document Translation	CodeCode Available	0
TextureSplat: Per-Primitive Texture Mapping for Reflective Gaussian Splatting	Jun 16, 2025	GPUInverse Rendering	CodeCode Available	0
Polyra Swarms: A Shape-Based Approach to Machine Learning	Jun 16, 2025	Anomaly Detection	—Unverified	0
JENGA: Object selection and pose estimation for robotic grasping from a stack	Jun 16, 2025	BenchmarkingObject	—Unverified	0
VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models	Jun 16, 2025	Computational EfficiencyMissing Values	—Unverified	0
Block-wise Adaptive Caching for Accelerating Diffusion Policy	Jun 16, 2025	Action GenerationDenoising	—Unverified	0
FrontendBench: A Benchmark for Evaluating LLMs on Front-End Development via Automatic Evaluation	Jun 16, 2025	Code Generation	—Unverified	0
Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models	Jun 16, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
PeakWeather: MeteoSwiss Weather Station Measurements for Spatiotemporal Deep Learning	Jun 16, 2025	Deep LearningGraph structure learning	CodeCode Available	1
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model	Jun 16, 2025	Large Language Modelmultimodal interaction	CodeCode Available	5
Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble	Jun 16, 2025	Machine Unlearning	CodeCode Available	1
ZipVoice: Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching	Jun 16, 2025	DecoderSpeech Synthesis	CodeCode Available	4
SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure	Jun 16, 2025	Simultaneous Localization and Mapping	CodeCode Available	2
Global Convergence of Adjoint-Optimized Neural PDEs	Jun 16, 2025		CodeCode Available	0
EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization	Jun 16, 2025	Mixture-of-ExpertsModel Compression	CodeCode Available	0
Probing Deep into Temporal Profile Makes the Infrared Small Target Detector Much Better	Jun 15, 2025	Anomaly Detection	CodeCode Available	1
SMPL Normal Map Is All You Need for Single-view Textured Human Reconstruction	Jun 15, 2025	3D Human Reconstruction3D Reconstruction	—Unverified	0
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies	Jun 15, 2025	Benchmarking	CodeCode Available	1
Evaluating Cell Type Inference in Vision Language Models Under Varying Visual Context	Jun 15, 2025	image-classificationImage Classification	CodeCode Available	0
Dynamic Scheduling for Enhanced Performance in RIS-assisted Cooperative Network with Interference	Jun 15, 2025	ManagementScheduling	—Unverified	0
Effect Decomposition of Functional-Output Computer Experiments via Orthogonal Additive Gaussian Processes	Jun 15, 2025	Gaussian ProcessesSensitivity	—Unverified	0
PDCNet: a benchmark and general deep learning framework for activity prediction of peptide-drug conjugates	Jun 15, 2025	Activity Prediction	—Unverified	0
MORIC: CSI Delay-Doppler Decomposition for Robust Wi-Fi-based Human Activity Recognition	Jun 15, 2025	Activity RecognitionHuman Activity Recognition	—Unverified	0
Improving spliced alignment by modeling splice sites with deep learning	Jun 15, 2025		CodeCode Available	2
Uncovering Social Network Activity Using Joint User and Topic Interaction	Jun 15, 2025	Point Processes	—Unverified	0
KCLNet: Physics-Informed Power Flow Prediction via Constraints Projections	Jun 15, 2025	Graph Neural NetworkPrediction	—Unverified	0
Nonlinear Model Order Reduction of Dynamical Systems in Process Engineering: Review and Comparison	Jun 15, 2025	Chemical Process	—Unverified	0
GM-LDM: Latent Diffusion Model for Brain Biomarker Identification through Functional Data-Driven Gray Matter Synthesis	Jun 15, 2025	DecoderDenoising	—Unverified	0
Predicting Genetic Mutations from Single-Cell Bone Marrow Images in Acute Myeloid Leukemia Using Noise-Robust Deep Learning Models	Jun 15, 2025	Diagnostic	—Unverified	0