The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 19951–20000 of 474278 papers

Title	Date	Tasks	Status	Hype
Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models	Oct 31, 2024		CodeCode Available	1
AlphaTrans: A Neuro-Symbolic Compositional Approach for Repository-Level Code Translation and Validation	Oct 31, 2024	Code TranslationTranslation	CodeCode Available	1
Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction	Oct 31, 2024	Disaster ResponseLanguage Modeling	CodeCode Available	1
MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation	Oct 31, 2024	Image SegmentationMamba	CodeCode Available	1
Pedestrian Trajectory Prediction with Missing Data: Datasets, Imputation, and Benchmarking	Oct 31, 2024	BenchmarkingImputation	CodeCode Available	1
Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?	Oct 31, 2024	DenoisingIn-Context Learning	CodeCode Available	1
Prospective Learning: Learning for a Dynamic Future	Oct 31, 2024	PAC learning	CodeCode Available	1
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages	Oct 31, 2024	Language Identification	CodeCode Available	1
Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning	Oct 30, 2024	In-Context LearningLanguage Modeling	CodeCode Available	1
A Walsh Hadamard Derived Linear Vector Symbolic Architecture	Oct 30, 2024	Computational Efficiency	CodeCode Available	1
DataRec: A Python Library for Standardized and Reproducible Data Management in Recommender Systems	Oct 30, 2024	BenchmarkingManagement	CodeCode Available	1
LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM	Oct 30, 2024	Simultaneous Localization and MappingVisual Odometry	CodeCode Available	1
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models	Oct 30, 2024	Video Understanding	CodeCode Available	1
DAVINCI: A Single-Stage Architecture for Constrained CAD Sketch Inference	Oct 30, 2024	Data Augmentation	CodeCode Available	1
Diceplot: A package for high dimensional categorical data visualization	Oct 30, 2024	Data Visualization	CodeCode Available	1
EchoFM: Foundation Model for Generalizable Echocardiogram Analysis	Oct 30, 2024	Contrastive Learningmodel	CodeCode Available	1
FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference Images	Oct 30, 2024	Face Swapping	CodeCode Available	1
Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting	Oct 30, 2024		CodeCode Available	1
Can Models Help Us Create Better Models? Evaluating LLMs as Data Scientists	Oct 30, 2024	Feature Engineering	CodeCode Available	1
Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning	Oct 30, 2024	Computational EfficiencyIn-Context Learning	CodeCode Available	1
CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense	Oct 30, 2024	Adversarial DefenseDisentanglement	CodeCode Available	1
Effective and Efficient Adversarial Detection for Vision-Language Models via A Single Vector	Oct 30, 2024		CodeCode Available	1
DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET	Oct 30, 2024		CodeCode Available	1
SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion	Oct 30, 2024	Infrared And Visible Image Fusion	CodeCode Available	1
When can classical neural networks represent quantum states?	Oct 30, 2024		CodeCode Available	1
Emotional RAG: Enhancing Role-Playing Agents through Emotional Retrieval	Oct 30, 2024	RAGResponse Generation	CodeCode Available	1
TPP-Gaze: Modelling Gaze Dynamics in Space and Time with Neural Temporal Point Processes	Oct 30, 2024	Point Processes	CodeCode Available	1
Simulation-Free Training of Neural ODEs on Paired Data	Oct 30, 2024		CodeCode Available	1
DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data	Oct 30, 2024	Decision MakingImputation	CodeCode Available	1
DASH: Warm-Starting Neural Network Training in Stationary Settings without Loss of Plasticity	Oct 30, 2024	Memorization	CodeCode Available	1
bit2bit: 1-bit quanta video reconstruction via self-supervised photon prediction	Oct 30, 2024	DenoisingVideo Reconstruction	CodeCode Available	1
Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback	Oct 30, 2024	Decision MakingLanguage Modeling	CodeCode Available	1
Is Function Similarity Over-Engineered? Building a Benchmark	Oct 30, 2024	Malware AnalysisVulnerability Detection	CodeCode Available	1
FlexTSF: A Universal Forecasting Model for Time Series with Variable Regularities	Oct 30, 2024	Irregular Time SeriesMissing Values	CodeCode Available	1
High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer	Oct 30, 2024	Document EnhancementFeature Importance	CodeCode Available	1
Survey of Cultural Awareness in Language Models: Text and Beyond	Oct 30, 2024	Benchmarking	CodeCode Available	1
WaveRoRA: Wavelet Rotary Route Attention for Multivariate Time Series Forecasting	Oct 30, 2024	Multivariate Time Series ForecastingTime Series	CodeCode Available	1
SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark	Oct 30, 2024	6D Pose EstimationPose Estimation	CodeCode Available	1
Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation	Oct 29, 2024	Cross-Domain Few-ShotFew-Shot Semantic Segmentation	CodeCode Available	1
Solving Epistemic Logic Programs using Generate-and-Test with Propagation	Oct 29, 2024		CodeCode Available	1
Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models	Oct 29, 2024	Adversarial RobustnessAll	CodeCode Available	1
Embedding-based classifiers can detect prompt injection attacks	Oct 29, 2024		CodeCode Available	1
Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images	Oct 29, 2024	Image GenerationSuper-Resolution	CodeCode Available	1
SimRec: Mitigating the Cold-Start Problem in Sequential Recommendation by Integrating Item Similarity	Oct 29, 2024	Recommendation SystemsSequential Recommendation	CodeCode Available	1
PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices	Oct 29, 2024	Objectobject-detection	CodeCode Available	1
SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection	Oct 29, 2024	DiagnosticLesion Segmentation	CodeCode Available	1
f-PO: Generalizing Preference Optimization with f-divergence Minimization	Oct 29, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial Attention	Oct 29, 2024	Object	CodeCode Available	1
An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion	Oct 29, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
EconoJax: A Fast & Scalable Economic Simulation in Jax	Oct 29, 2024	reinforcement-learningReinforcement Learning	CodeCode Available	1