The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 14751–14800 of 474278 papers

Title	Date	Tasks	Status	Hype
Privacy-Preserving LLM Interaction with Socratic Chain-of-Thought Reasoning and Homomorphically Encrypted Vector Databases	Jun 19, 2025	Privacy Preserving	—Unverified	0
Automatic Speech Recognition Biases in Newcastle English: an Error Analysis	Jun 19, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Relational Deep Learning: Challenges, Foundations and Next-Generation Architectures	Jun 19, 2025	Deep LearningFeature Engineering	—Unverified	0
Bayesian Epistemology with Weighted Authority: A Formal Architecture for Truth-Promoting Autonomous Scientific Reasoning	Jun 19, 2025	Bayesian Inference	—Unverified	0
FlatCAD: Fast Curvature Regularization of Neural SDFs for CAD Models	Jun 19, 2025	GPU	—Unverified	0
Solving Zero-Sum Convex Markov Games	Jun 19, 2025	Policy Gradient Methods	—Unverified	0
Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues	Jun 19, 2025	AI AgentCausal Discovery	—Unverified	0
The Role of Explanation Styles and Perceived Accuracy on Decision Making in Predictive Process Monitoring	Jun 19, 2025	counterfactualDecision Making	—Unverified	0
Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support	Jun 19, 2025	Large Language ModelSentence Embeddings	—Unverified	0
SEP-GCN: Leveraging Similar Edge Pairs with Temporal and Spatial Contexts for Location-Based Recommender Systems	Jun 19, 2025	Recommendation Systems	—Unverified	0
GeoGuess: Multimodal Reasoning based on Hierarchy of Visual Information in Street View	Jun 19, 2025	Multimodal Reasoning	—Unverified	0
Adaptive Social Metaverse Streaming based on Federated Multi-Agent Deep Reinforcement Learning	Jun 19, 2025	Deep Reinforcement LearningFederated Learning	—Unverified	0
Fine-grained Image Retrieval via Dual-Vision Adaptation	Jun 19, 2025	Image RetrievalKnowledge Distillation	—Unverified	0
Reimagination with Test-time Observation Interventions: Distractor-Robust World Model Predictions for Visual Model Predictive Control	Jun 19, 2025	modelModel Predictive Control	—Unverified	0
CapsDT: Diffusion-Transformer for Capsule Robot Manipulation	Jun 19, 2025	DiagnosticRobot Manipulation	—Unverified	0
Spatially-Aware Evaluation of Segmentation Uncertainty	Jun 19, 2025	Segmentation	—Unverified	0
Multi-use LLM Watermarking and the False Detection Problem	Jun 19, 2025	User Identification	—Unverified	0
Streaming Non-Autoregressive Model for Accent Conversion and Pronunciation Improvement	Jun 19, 2025	text-to-speechText to Speech	—Unverified	0
Reproducible Evaluation of Camera Auto-Exposure Methods in the Field: Platform, Benchmark and Lessons Learned	Jun 19, 2025	Pose Estimation	CodeCode Available	0
Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot Navigation	Jun 19, 2025	Adversarial AttackRobot Navigation	CodeCode Available	1
Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clustering	Jun 19, 2025	Segmentation	CodeCode Available	0
Beyond Audio and Pose: A General-Purpose Framework for Video Synchronization	Jun 19, 2025	Pose EstimationVideo Synchronization	CodeCode Available	0
Dense 3D Displacement Estimation for Landslide Monitoring via Fusion of TLS Point Clouds and Embedded RGB Images	Jun 19, 2025	3D geometry	CodeCode Available	1
TrainVerify: Equivalence-Based Verification for Distributed LLM Training	Jun 19, 2025	GPU	—Unverified	0
Double Entendre: Robust Audio-Based AI-Generated Lyrics Detection via Multi-View Fusion	Jun 19, 2025	Music Generation	CodeCode Available	0
On using AI for EEG-based BCI applications: problems, current challenges and future trends	Jun 19, 2025	EEG	CodeCode Available	1
PBFT-Backed Semantic Voting for Multi-Agent Memory Pruning	Jun 19, 2025		CodeCode Available	0
LLMs in Coding and their Impact on the Commercial Software Engineering Landscape	Jun 19, 2025	Language ModelingLanguage Modelling	—Unverified	0
Floating-Point Neural Networks Are Provably Robust Universal Approximators	Jun 19, 2025		CodeCode Available	0
Beyond Prediction -- Structuring Epistemic Integrity in Artificial Reasoning Systems	Jun 19, 2025	Knowledge Graphs	—Unverified	0
FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation	Jun 19, 2025	Computational EfficiencyDenoising	—Unverified	0
Noise Fusion-based Distillation Learning for Anomaly Detection in Complex Industrial Environments	Jun 19, 2025	Anomaly Detection	—Unverified	0
Improved Intelligibility of Dysarthric Speech using Conditional Flow Matching	Jun 19, 2025	Self-Supervised Learning	—Unverified	0
BIDA: A Bi-level Interaction Decision-making Algorithm for Autonomous Vehicles in Dynamic Traffic Scenarios	Jun 19, 2025	Autonomous VehiclesDecision Making	—Unverified	0
Knee-Deep in C-RASP: A Transformer Depth Hierarchy	Jun 19, 2025		CodeCode Available	0
Unpacking Generative AI in Education: Computational Modeling of Teacher and Student Perspectives in Social Media Discourse	Jun 19, 2025	Sentiment Analysis	—Unverified	0
PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models	Jun 19, 2025	Image GenerationQuantization	—Unverified	0
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems	Jun 19, 2025	BenchmarkingDescriptive	CodeCode Available	1
Malware Classification Leveraging NLP & Machine Learning for Enhanced Accuracy	Jun 19, 2025	Classificationfeature selection	CodeCode Available	0
Enhanced Dermatology Image Quality Assessment via Cross-Domain Training	Jun 19, 2025	Image Quality Assessment	—Unverified	0
Spotting tell-tale visual artifacts in face swapping videos: strengths and pitfalls of CNN detectors	Jun 19, 2025	BenchmarkingFace Swapping	—Unverified	0
On the Performance of Cyber-Biomedical Features for Intrusion Detection in Healthcare 5.0	Jun 19, 2025	Intrusion Detection	—Unverified	0
CodeDiffuser: Attention-Enhanced Diffusion Policy via VLM-Generated Code for Instruction Ambiguity	Jun 19, 2025	Action GenerationContact-rich Manipulation	—Unverified	0
Weight Factorization and Centralization for Continual Learning in Speech Recognition	Jun 19, 2025	Continual Learningspeech-recognition	—Unverified	0
EDNet: A Distortion-Agnostic Speech Enhancement Framework with Gating Mamba Mechanism and Phase Shift-Invariant Training	Jun 19, 2025	Bandwidth ExtensionDenoising	—Unverified	0
Probing the Robustness of Large Language Models Safety to Latent Perturbations	Jun 19, 2025	DiagnosticSafety Alignment	CodeCode Available	1
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research	Jun 19, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
TrajSceneLLM: A Multimodal Perspective on Semantic GPS Trajectory Analysis	Jun 19, 2025	Temporal Sequences	CodeCode Available	0
Data-Agnostic Cardinality Learning from Imperfect Workloads	Jun 19, 2025		CodeCode Available	0
Empowering Graph-based Approximate Nearest Neighbor Search with Adaptive Awareness Capabilities	Jun 19, 2025	Contrastive LearningInformation Retrieval	—Unverified	0