The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 15101–15150 of 474278 papers

Title	Date	Tasks	Status	Hype
The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products	Jun 16, 2025	Benchmarking	CodeCode Available	1
Self-Supervised Enhancement for Depth from a Lightweight ToF Sensor with Monocular Images	Jun 16, 2025	Depth EstimationSelf-Supervised Learning	CodeCode Available	1
Tady: A Neural Disassembler without Structural Constraint Violations	Jun 16, 2025		CodeCode Available	1
Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers	Jun 16, 2025	Fact CheckingFact Verification	CodeCode Available	1
Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs	Jun 16, 2025	Machine Unlearning	CodeCode Available	1
Probing Deep into Temporal Profile Makes the Infrared Small Target Detector Much Better	Jun 15, 2025	Anomaly Detection	CodeCode Available	1
TCANet: A Temporal Convolutional Attention Network for Motor Imagery EEG Decoding	Jun 15, 2025	Brain Computer InterfaceEEG	CodeCode Available	1
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies	Jun 15, 2025	Benchmarking	CodeCode Available	1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation	Jun 15, 2025	ObjectSemantic Segmentation	CodeCode Available	1
SmartHome-Bench: A Comprehensive Benchmark for Video Anomaly Detection in Smart Homes Using Multi-Modal Large Language Models	Jun 15, 2025	Anomaly DetectionVideo Anomaly Detection	CodeCode Available	1
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks	Jun 14, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
BSA: Ball Sparse Attention for Large-scale Geometries	Jun 14, 2025		CodeCode Available	1
Domain Generalization for Person Re-identification: A Survey Towards Domain-Agnostic Person Matching	Jun 14, 2025	Domain GeneralizationPerson Re-Identification	CodeCode Available	1
Real-Time Per-Garment Virtual Try-On with Temporal Consistency for Loose-Fitting Garments	Jun 14, 2025	Dataset GenerationVirtual Try-on	CodeCode Available	1
Vectorized Sparse Second-Order Forward Automatic Differentiation for Optimal Control Direct Methods	Jun 13, 2025	Computational Efficiency	CodeCode Available	1
Schema-R1: A reasoning training approach for schema linking in Text-to-SQL Task	Jun 13, 2025	reinforcement-learningReinforcement Learning	CodeCode Available	1
Structural Similarity-Inspired Unfolding for Lightweight Image Super-Resolution	Jun 13, 2025	Image Super-ResolutionMixture-of-Experts	CodeCode Available	1
Dynamic Grid Trading Strategy: From Zero Expectation to Market Outperformance	Jun 13, 2025		CodeCode Available	1
Self-supervised Learning of Echocardiographic Video Representations via Online Cluster Distillation	Jun 13, 2025	Anomaly DetectionClustering	CodeCode Available	1
DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion Models	Jun 13, 2025	AllHallucination	CodeCode Available	1
Recursive KalmanNet: Deep Learning-Augmented Kalman Filtering for State Estimation with Consistent Uncertainty Quantification	Jun 13, 2025	State EstimationUncertainty Quantification	CodeCode Available	1
DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents	Jun 13, 2025		CodeCode Available	1
SIMSHIFT: A Benchmark for Adapting Neural Surrogates to Distribution Shifts	Jun 13, 2025	Domain Adaptation	CodeCode Available	1
PRO-V: An Efficient Program Generation Multi-Agent System for Automatic RTL Verification	Jun 13, 2025	Code GenerationIn-Context Learning	CodeCode Available	1
ICME 2025 Grand Challenge on Video Super-Resolution for Video Conferencing	Jun 13, 2025	Super-ResolutionVideo Super-Resolution	CodeCode Available	1
Visual Pre-Training on Unlabeled Images using Reinforcement Learning	Jun 13, 2025	reinforcement-learningReinforcement Learning	CodeCode Available	1
Diffusion-Based Electrocardiography Noise Quantification via Anomaly Detection	Jun 13, 2025	Anomaly DetectionDecision Making	CodeCode Available	1
FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation	Jun 13, 2025	Model CompressionQuantization	CodeCode Available	1
Dual‑detector Re‑optimization for Federated Weakly Supervised Video Anomaly Detection Via Adaptive Dynamic Recursive Mapping	Jun 13, 2025	Anomaly DetectionAnomaly Detection In Surveillance Videos	CodeCode Available	1
SoK: Evaluating Jailbreak Guardrails for Large Language Models	Jun 12, 2025		CodeCode Available	1
Probably Approximately Correct Labels	Jun 12, 2025	Protein Foldingtext annotation	CodeCode Available	1
Low-Barrier Dataset Collection with Real Human Body for Interactive Per-Garment Virtual Try-On	Jun 12, 2025	Virtual Try-on	CodeCode Available	1
PyLO: Towards Accessible Learned Optimizers in PyTorch	Jun 12, 2025		CodeCode Available	1
A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon	Jun 12, 2025	Large Language ModelStarcraft	CodeCode Available	1
ClimateChat: Designing Data and Methods for Instruction Tuning LLMs to Answer Climate Change Queries	Jun 12, 2025	scientific discovery	CodeCode Available	1
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection	Jun 12, 2025	object-detectionObject Detection	CodeCode Available	1
TaxoAdapt: Aligning LLM-Based Multidimensional Taxonomy Construction to Evolving Research Corpora	Jun 12, 2025	General Knowledge	CodeCode Available	1
SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks	Jun 12, 2025		CodeCode Available	1
Hessian Geometry of Latent Space in Generative Models	Jun 12, 2025		CodeCode Available	1
BioClinical ModernBERT: A State-of-the-Art Long-Context Encoder for Biomedical and Clinical NLP	Jun 12, 2025	DecoderDomain Adaptation	CodeCode Available	1
Principled Approaches for Extending Neural Architectures to Function Spaces for Operator Learning	Jun 12, 2025	Operator learning	CodeCode Available	1
Towards Robust Multimodal Emotion Recognition under Missing Modalities and Distribution Shifts	Jun 12, 2025	Causal Inferencecounterfactual	CodeCode Available	1
DART: Differentiable Dynamic Adaptive Region Tokenizer for Vision Transformer and Mamba	Jun 12, 2025	Mamba	CodeCode Available	1
It's Not the Target, It's the Background: Rethinking Infrared Small Target Detection via Deep Patch-Free Low-Rank Representations	Jun 12, 2025	Computational Efficiency	CodeCode Available	1
Farseer: A Refined Scaling Law in Large Language Models	Jun 12, 2025	GPU	CodeCode Available	1
Accelerating Diffusion Large Language Models with SlowFast: The Three Golden Principles	Jun 12, 2025		CodeCode Available	1
NoLoCo: No-all-reduce Low Communication Training Method for Large Models	Jun 12, 2025	AllBlocking	CodeCode Available	1
Anti-Aliased 2D Gaussian Splatting	Jun 12, 2025	3DGSNovel View Synthesis	CodeCode Available	1
Constructing and Evaluating Declarative RAG Pipelines in PyTerrier	Jun 12, 2025	Natural QuestionsRAG	CodeCode Available	1
Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization	Jun 12, 2025	Dictionary Learning	CodeCode Available	1