The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 18051–18100 of 474278 papers

Title	Date	Tasks	Status	Hype
Neurosymbolic Artificial Intelligence for Robust Network Intrusion Detection: From Scratch to Transfer Learning	Jun 4, 2025	ClusteringIntrusion Detection	—Unverified	0
OpenThoughts: Data Recipes for Reasoning Models	Jun 4, 2025	Math	CodeCode Available	7
Training-free AI for Earth Observation Change Detection using Physics Aware Neuromorphic Networks	Jun 4, 2025	Change DetectionEarth Observation	—Unverified	0
Multiscale guidance of AlphaFold3 with heterogeneous cryo-EM data	Jun 4, 2025	DiversityPrediction	—Unverified	0
Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-Order	Jun 4, 2025	parameter-efficient fine-tuning	CodeCode Available	0
Knockout LLM Assessment: Using Large Language Models for Evaluations through Iterative Pairwise Comparisons	Jun 4, 2025	Machine Translation	—Unverified	0
SF^2Bench: Evaluating Data-Driven Models for Compound Flood Forecasting in South Florida	Jun 4, 2025	Computational Efficiency	—Unverified	0
Softlog-Softmax Layers and Divergences Contribute to a Computationally Dependable Ensemble Learning	Jun 4, 2025	DiversityEnsemble Learning	—Unverified	0
A Statistical Physics of Language Model Reasoning	Jun 4, 2025	Language ModelingLanguage Modelling	—Unverified	0
HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models	Jun 4, 2025	BenchmarkingGeneral Knowledge	CodeCode Available	0
Behavioural vs. Representational Systematicity in End-to-End Models: An Opinionated Survey	Jun 4, 2025	Systematic Generalization	—Unverified	0
An AI-Based Public Health Data Monitoring System	Jun 4, 2025	Anomaly DetectionDecision Making	—Unverified	0
Even Faster Hyperbolic Random Forests: A Beltrami-Klein Wrapper Approach	Jun 4, 2025		CodeCode Available	1
MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection	Jun 4, 2025	MambaNovel Object Detection	—Unverified	0
FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers	Jun 4, 2025	Video EditingVideo Generation	—Unverified	0
HUMOF: Human Motion Forecasting in Interactive Social Scenes	Jun 4, 2025	Motion Forecastingmotion prediction	—Unverified	0
Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model	Jun 4, 2025	Language ModelingLanguage Modelling	—Unverified	0
Rectified Sparse Attention	Jun 4, 2025	Language ModelingLanguage Modelling	—Unverified	0
From Theory to Practice: Real-World Use Cases on Trustworthy LLM-Driven Process Modeling, Prediction and Automation	Jun 4, 2025	NavigatePharmacovigilance	—Unverified	0
Pseudo-Simulation for Autonomous Driving	Jun 4, 2025	Autonomous DrivingAutonomous Vehicles	CodeCode Available	4
"Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation	Jun 4, 2025	Language ModelingLanguage Modelling	—Unverified	0
Finding signatures of low-dimensional geometric landscapes in high-dimensional cell fate transitions	Jun 4, 2025	Decision Making	CodeCode Available	0
MFLA: Monotonic Finite Look-ahead Attention for Streaming Speech Recognition	Jun 4, 2025	speech-recognitionSpeech Recognition	—Unverified	0
Understanding Mental Models of Generative Conversational Search and The Effect of Interface Transparency	Jun 4, 2025	Conversational Search	—Unverified	0
Uniqueness of phase retrieval from offset linear canonical transform	Jun 4, 2025	Retrieval	—Unverified	0
Beamforming and Resource Allocation for Delay Optimization in RIS-Assisted OFDM Systems	Jun 4, 2025	Deep Reinforcement LearningFairness	—Unverified	0
Autonomous Collaborative Scheduling of Time-dependent UAVs, Workers and Vehicles for Crowdsensing in Disaster Response	Jun 4, 2025	Dimensionality ReductionDisaster Response	—Unverified	0
From Virtual Agents to Robot Teams: A Multi-Robot Framework Evaluation in High-Stakes Healthcare Context	Jun 4, 2025	Code Generation	—Unverified	0
Sounding that Object: Interactive Object-Aware Image to Audio Generation	Jun 4, 2025	Audio GenerationImage Segmentation	—Unverified	0
IntLevPy: A Python library to classify and model intermittent and Lévy processes	Jun 4, 2025	parameter estimation	—Unverified	0
Solving engineering eigenvalue problems with neural networks using the Rayleigh quotient	Jun 4, 2025	Physics-informed machine learning	—Unverified	0
CETBench: A Novel Dataset constructed via Transformations over Programs for Benchmarking LLMs for Code-Equivalence Checking	Jun 4, 2025	BenchmarkingCode Generation	—Unverified	0
Unsupervised Meta-Testing with Conditional Neural Processes for Hybrid Meta-Reinforcement Learning	Jun 4, 2025	continuous-controlContinuous Control	—Unverified	0
Object-centric 3D Motion Field for Robot Learning from Human Videos	Jun 4, 2025	DenoisingMotion Estimation	—Unverified	0
SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL	Jun 4, 2025	DisentanglementIndustrial Robots	—Unverified	0
Autonomous Vehicle Lateral Control Using Deep Reinforcement Learning with MPC-PID Demonstration	Jun 4, 2025	Autonomous DrivingDeep Reinforcement Learning	—Unverified	0
Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation	Jun 4, 2025	Collision AvoidanceVisual Navigation	—Unverified	0
Understanding Physical Properties of Unseen Deformable Objects by Leveraging Large Language Models and Robot Actions	Jun 4, 2025	Motion PlanningTask and Motion Planning	—Unverified	0
SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models	Jun 4, 2025	Object	—Unverified	0
Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion	Jun 4, 2025	DisentanglementStyle Transfer	—Unverified	0
Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR	Jun 4, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions	Jun 4, 2025	Data AugmentationDiversity	—Unverified	0
Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering	Jun 4, 2025	DecoderDomain Adaptation	—Unverified	0
BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing	Jun 4, 2025	Quantizationtext-to-speech	—Unverified	0
Generating Automotive Code: Large Language Models for Software Development and Verification in Safety-Critical Systems	Jun 4, 2025	BenchmarkingCode Generation	—Unverified	0
VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation	Jun 4, 2025	Code Generation	—Unverified	0
An Improved Finite Element Modeling Method for Triply Periodic Minimal Surface Structures Based on Element Size and Minimum Jacobian	Jun 4, 2025	Computational Efficiency	—Unverified	0
Discrete Element Parameter Calibration of Livestock Salt Based on Particle Scaling	Jun 4, 2025	Friction	—Unverified	0
Topology-Aware Graph Neural Network-based State Estimation for PMU-Unobservable Power Systems	Jun 4, 2025	Graph AttentionGraph Neural Network	—Unverified	0
BridgeNet: A Hybrid, Physics-Informed Machine Learning Framework for Solving High-Dimensional Fokker-Planck Equations	Jun 4, 2025	Physics-informed machine learning	—Unverified	0