The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 15151–15200 of 474278 papers

Title	Date	Tasks	Status	Hype
Excessive Reasoning Attack on Reasoning LLMs	Jun 17, 2025	GSM8K	—Unverified	0
LLM-Powered Intent-Based Categorization of Phishing Emails	Jun 17, 2025	Binary Classification	—Unverified	0
AIn't Nothing But a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation	Jun 17, 2025	Survey	—Unverified	0
One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification	Jun 17, 2025	Domain AdaptationEdge-computing	—Unverified	0
Computational Studies in Influencer Marketing: A Systematic Literature Review	Jun 17, 2025	FairnessMarketing	—Unverified	0
One Size Fits None: Rethinking Fairness in Medical AI	Jun 17, 2025	Decision MakingFairness	—Unverified	0
A multi-stage augmented multimodal interaction network for fish feeding intensity quantification	Jun 17, 2025	Decision Makingmultimodal interaction	—Unverified	0
ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies	Jun 17, 2025	Scene GenerationSpatial Reasoning	—Unverified	0
ELLIS Alicante at CQs-Gen 2025: Winning the critical thinking questions shared task: LLM-based question generation and selection	Jun 17, 2025	Argument MiningQuestion Generation	—Unverified	0
ELI-Why: Evaluating the Pedagogical Utility of Language Model Explanations	Jun 17, 2025	Language ModelingLanguage Modelling	—Unverified	0
StorySage: Conversational Autobiography Writing Powered by a Multi-Agent Framework	Jun 17, 2025	Navigate	—Unverified	0
Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription	Jun 17, 2025	DecoderInformation Retrieval	—Unverified	0
CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion	Jun 17, 2025	Object Localization	—Unverified	0
Markov Regime-Switching Intelligent Driver Model for Interpretable Car-Following Behavior	Jun 17, 2025	Bayesian Inference	—Unverified	0
DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning	Jun 17, 2025	Autonomous NavigationDepth Estimation	—Unverified	0
SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning	Jun 17, 2025	Density EstimationRobot Manipulation	—Unverified	0
VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy	Jun 17, 2025	Decision MakingSemantic Segmentation	—Unverified	0
Adaptive Reinforcement Learning for Unobservable Random Delays	Jun 17, 2025	reinforcement-learningReinforcement Learning	—Unverified	0
AMPLIFY: Actionless Motion Priors for Robot Learning from Videos	Jun 17, 2025	motion predictionVideo Prediction	—Unverified	0
Hard Contacts with Soft Gradients: Refining Differentiable Simulators for Learning and Control	Jun 17, 2025	MuJoCo	—Unverified	0
GAF: Gaussian Action Field as a Dvnamic World Model for Robotic Mlanipulation	Jun 17, 2025	3DGS	—Unverified	0
Unifying Streaming and Non-streaming Zipformer-based ASR	Jun 17, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
InsertRank: LLMs can reason over BM25 scores to Improve Listwise Reranking	Jun 17, 2025	Information RetrievalReranking	—Unverified	0
RAGtifier: Evaluating RAG Generation Approaches of State-of-the-Art RAG Systems for the SIGIR LiveRAG Competition	Jun 17, 2025	Answer GenerationRAG	—Unverified	0
Similarity = Value? Consultation Value Assessment and Alignment for Personalized Search	Jun 17, 2025	Semantic SimilaritySemantic Textual Similarity	—Unverified	0
ImpReSS: Implicit Recommender System for Support Conversations	Jun 17, 2025	Recommendation Systems	—Unverified	0
A Vision for Geo-Temporal Deep Research Systems: Towards Comprehensive, Transparent, and Reproducible Geo-Temporal Information Synthesis	Jun 17, 2025	Retrieval	—Unverified	0
FADPNet: Frequency-Aware Dual-Path Network for Face Super-Resolution	Jun 17, 2025	MambaSuper-Resolution	—Unverified	0
Meta-SurDiff: Classification Diffusion Model Optimized by Meta Learning is Reliable for Online Surgical Phase Recognition	Jun 17, 2025	Meta-LearningOnline surgical phase recognition	—Unverified	0
HRGS: Hierarchical Gaussian Splatting for Memory-Efficient High-Resolution 3D Reconstruction	Jun 17, 2025	3DGS3D Reconstruction	—Unverified	0
Unified Representation Space for 3D Visual Grounding	Jun 17, 2025	3D visual groundingContrastive Learning	—Unverified	0
Exploring Non-contrastive Self-supervised Representation Learning for Image-based Profiling	Jun 17, 2025	Data AugmentationDrug Discovery	—Unverified	0
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment	Jun 17, 2025	Autonomous DrivingInstance Segmentation	—Unverified	0
Discrete JEPA: Learning Discrete Token Representations without Reconstruction	Jun 17, 2025	Logical Reasoning	—Unverified	0
DepthSeg: Depth prompting in remote sensing semantic segmentation	Jun 17, 2025	SegmentationSemantic Segmentation	—Unverified	0
GrFormer: A Novel Transformer on Grassmann Manifold for Infrared and Visible Image Fusion	Jun 17, 2025	Infrared And Visible Image FusionSemantic Similarity	—Unverified	0
Compositional Attribute Imbalance in Vision Datasets	Jun 17, 2025	AttributeData Augmentation	—Unverified	0
MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models	Jun 17, 2025	Mixture-of-ExpertsQuantization	—Unverified	0
AsyncSwitch: Asynchronous Text-Speech Adaptation for Code-Switched ASR	Jun 17, 2025	Decoder	—Unverified	0
Enhancing Symbolic Machine Learning by Subsymbolic Representations	Jun 17, 2025	Tensor Networks	—Unverified	0
SLEEPING-DISCO 9M: A large-scale pre-training dataset for generative music modeling	Jun 17, 2025	Music CaptioningMusic Modeling	—Unverified	0
Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios	Jun 17, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Comparison of Two Methods for Stationary Incident Detection Based on Background Image	Jun 17, 2025	object-detectionObject Detection	—Unverified	0
Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models	Jun 17, 2025		CodeCode Available	0
Busting the Paper Ballot: Voting Meets Adversarial Machine Learning	Jun 17, 2025		CodeCode Available	0
hyperFA*IR: A hypergeometric approach to fair rankings with finite candidate pool	Jun 17, 2025	Fairness	CodeCode Available	0
Déjà Vu: Efficient Video-Language Query Engine with Learning-based Inter-Frame Computation Reuse	Jun 17, 2025		CodeCode Available	1
EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric Optimization	Jun 17, 2025	Multi-Instance RetrievalRetrieval	CodeCode Available	0
HydroChronos: Forecasting Decades of Surface Water Change	Jun 17, 2025	Change Detection	CodeCode Available	0
23 Ways to Contact How Do I Talk to Someone at Expedia®: A Step-by-Step Guide	Jun 17, 2025	NavigateTAG	—Unverified	0