The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 14901–14950 of 474278 papers

Title	Date	Tasks	Status	Hype
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence	Jun 18, 2025		—Unverified	0
MEGC2025: Micro-Expression Grand Challenge on Spot Then Recognize and Visual Question Answering	Jun 18, 2025	Multimodal ReasoningQuestion Answering	—Unverified	0
MSNeRV: Neural Video Representation with Multi-Scale Feature Fusion	Jun 18, 2025	DecoderVideo Compression	—Unverified	0
PRISM-Loc: a Lightweight Long-range LiDAR Localization in Urban Environments with Topological Maps	Jun 18, 2025	Pose Estimation	—Unverified	0
Context-Aware Deep Lagrangian Networks for Model Predictive Control	Jun 18, 2025	Model Predictive Control	—Unverified	0
Probabilistic Trajectory GOSPA: A Metric for Uncertainty-Aware Multi-Object Tracking Performance Evaluation	Jun 18, 2025	Multi-Object TrackingObject Tracking	—Unverified	0
Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models	Jun 18, 2025	Music GenerationText-to-Music Generation	—Unverified	0
Factorized RVQ-GAN For Disentangled Speech Tokenization	Jun 18, 2025	DisentanglementKnowledge Distillation	—Unverified	0
Uncovering Intention through LLM-Driven Code Snippet Description Generation	Jun 18, 2025	Descriptive	—Unverified	0
Code Rate Optimization via Neural Polar Decoders	Jun 18, 2025	Capacity Estimation	—Unverified	0
One-shot Face Sketch Synthesis in the Wild via Generative Diffusion Prior and Instruction Tuning	Jun 18, 2025	Face Sketch Synthesis	CodeCode Available	0
ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression	Jun 18, 2025	Image Compression	CodeCode Available	0
Fair Contracts in Principal-Agent Games with Heterogeneous Types	Jun 18, 2025	Fairness	—Unverified	0
MAARTA:Multi-Agentic Adaptive Radiology Teaching Assistant	Jun 18, 2025	Diagnostic	—Unverified	0
Centroid Approximation for Byzantine-Tolerant Federated Learning	Jun 18, 2025	Distributed ComputingFederated Learning	—Unverified	0
RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments	Jun 18, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
All is Not Lost: LLM Recovery without Checkpoints	Jun 18, 2025	AllScheduling	CodeCode Available	1
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning	Jun 18, 2025	Caption GenerationDescriptive	CodeCode Available	2
Evaluation Pipeline for systematically searching for Anomaly Detection Systems	Jun 18, 2025	Anomaly Detection	—Unverified	0
Multi-Agent Reinforcement Learning for Autonomous Multi-Satellite Earth Observation: A Realistic Case Study	Jun 18, 2025	Earth ObservationManagement	—Unverified	0
PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction	Jun 18, 2025	Sentencetext-to-speech	—Unverified	0
video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models	Jun 18, 2025	Audio captioningLarge Language Model	CodeCode Available	2
Optimizing Web-Based AI Query Retrieval with GPT Integration in LangChain A CoT-Enhanced Prompt Engineering Approach	Jun 18, 2025	Prompt EngineeringRetrieval	—Unverified	0
Semantic and Feature Guided Uncertainty Quantification of Visual Localization for Autonomous Vehicles	Jun 18, 2025	Autonomous DrivingAutonomous Vehicles	—Unverified	0
deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses	Jun 18, 2025	Large Language Model	—Unverified	0
PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning	Jun 18, 2025	LLM-generated Text DetectionMisinformation	—Unverified	0
Transit for All: Mapping Equitable Bike2Subway Connection using Region Representation Learning	Jun 18, 2025	AllRepresentation Learning	—Unverified	0
Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers	Jun 18, 2025	Chatbot	—Unverified	0
Accessible Gesture-Driven Augmented Reality Interaction System	Jun 18, 2025	Federated LearningGesture Recognition	—Unverified	0
An Empirical Study of Bugs in Data Visualization Libraries	Jun 18, 2025	Data VisualizationDecision Making	—Unverified	0
Steering Your Diffusion Policy with Latent Space Reinforcement Learning	Jun 18, 2025	reinforcement-learningReinforcement Learning	—Unverified	0
Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos	Jun 18, 2025	Object	—Unverified	0
RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation	Jun 18, 2025	Depth EstimationDepth Prediction	—Unverified	0
Model Predictive Path-Following Control for a Quadrotor	Jun 18, 2025	Model Predictive Control	—Unverified	0
MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System	Jun 18, 2025	ObjectObject SLAM	—Unverified	0
Correspondence-Free Multiview Point Cloud Registration via Depth-Guided Joint Optimisation	Jun 18, 2025	Point Cloud Registration	—Unverified	0
HEAL: An Empirical Study on Hallucinations in Embodied Agents Driven by Large Language Models	Jun 18, 2025	Hallucination	—Unverified	0
An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW	Jun 18, 2025	Optical Character RecognitionOptical Character Recognition (OCR)	—Unverified	0
PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning	Jun 18, 2025	Federated Learning	—Unverified	0
Veracity: An Open-Source AI Fact-Checking System	Jun 18, 2025	Fact CheckingMisinformation	—Unverified	0
In-Context Learning for Gradient-Free Receiver Adaptation: Principles, Applications, and Theory	Jun 18, 2025	In-Context LearningMeta-Learning	—Unverified	0
cAST: Enhancing Code Retrieval-Augmented Generation with Structural Chunking via Abstract Syntax Tree	Jun 18, 2025	ChunkingCode Generation	CodeCode Available	2
I Know Which LLM Wrote Your Code Last Summer: LLM generated Code Stylometry for Authorship Attribution	Jun 18, 2025	Authorship AttributionBinary Classification	—Unverified	0
4Real-Video-V2: Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation	Jun 18, 2025	3D Reconstruction4D reconstruction	—Unverified	0
InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding	Jun 18, 2025	GPUStreaming video understanding	—Unverified	0
LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning	Jun 18, 2025	Attribute	CodeCode Available	0
Show-o2: Improved Native Unified Multimodal Models	Jun 18, 2025	Language ModelingLanguage Modelling	CodeCode Available	5
Mix-of-Language-Experts Architecture for Multilingual Programming	Jun 18, 2025		CodeCode Available	0
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges	Jun 18, 2025	Combinatorial Optimization	CodeCode Available	2
Finance Language Model Evaluation (FLaME)	Jun 18, 2025	BenchmarkingLanguage Model Evaluation	—Unverified	0