The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 20651–20700 of 474278 papers

Title	Date	Tasks	Status	Hype
Diffusion Auto-regressive Transformer for Effective Self-supervised Time Series Forecasting	Oct 8, 2024	DecoderDenoising	CodeCode Available	1
Batched Bayesian optimization by maximizing the probability of including the optimum	Oct 8, 2024	Bayesian OptimizationDiversity	CodeCode Available	1
Evaluating Performance and Bias of Negative Sampling in Large-Scale Sequential Recommendation Models	Oct 8, 2024	Hyperparameter OptimizationSequential Recommendation	CodeCode Available	1
Physics-Informed Regularization for Domain-Agnostic Dynamical System Modeling	Oct 8, 2024	Inductive Bias	CodeCode Available	1
Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration	Oct 8, 2024	Graph Neural NetworkPoint Cloud Registration	CodeCode Available	1
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing	Oct 8, 2024	Image Manipulation	CodeCode Available	1
Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes	Oct 8, 2024	ArticlesClassification	CodeCode Available	1
GlucoBench: Curated List of Continuous Glucose Monitoring Datasets with Prediction Benchmarks	Oct 8, 2024	ManagementTrajectory Prediction	CodeCode Available	1
FACMIC: Federated Adaptative CLIP Model for Medical Image Classification	Oct 8, 2024	Domain AdaptationFederated Learning	CodeCode Available	1
Amortized Control of Continuous State Space Feynman-Kac Model for Irregular Time Series	Oct 8, 2024	Computational EfficiencyIrregular Time Series	CodeCode Available	1
Feature Selection Gates with Gradient Routing for Endoscopic Image Computing	Oct 7, 2024	Binary ClassificationColorectal Polyps Characterization	CodeCode Available	1
SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification	Oct 7, 2024	image-classificationImage Classification	CodeCode Available	1
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise	Oct 7, 2024	AttributeDenoising	CodeCode Available	1
SePPO: Semi-Policy Preference Optimization for Diffusion Alignment	Oct 7, 2024	Model Selection	CodeCode Available	1
Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild	Oct 7, 2024	BenchmarkingMixture-of-Experts	CodeCode Available	1
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency	Oct 7, 2024	Attribute	CodeCode Available	1
ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering	Oct 7, 2024	Question AnsweringRetrieval	CodeCode Available	1
ImProver: Agent-Based Automated Proof Optimization	Oct 7, 2024	Language ModellingLarge Language Model	CodeCode Available	1
Collaboration! Towards Robust Neural Methods for Routing Problems	Oct 7, 2024	Out-of-Distribution Generalization	CodeCode Available	1
Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality	Oct 7, 2024		CodeCode Available	1
Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors	Oct 7, 2024	Object	CodeCode Available	1
Can LLMs Understand Time Series Anomalies?	Oct 7, 2024	Anomaly DetectionTime Series	CodeCode Available	1
Neural Fourier Modelling: A Highly Compact Approach to Time-Series Analysis	Oct 7, 2024	16kAnomaly Detection	CodeCode Available	1
Fast Training of Sinusoidal Neural Fields via Scaling Initialization	Oct 7, 2024		CodeCode Available	1
Continuous Ensemble Weather Forecasting with Diffusion models	Oct 7, 2024	Weather Forecasting	CodeCode Available	1
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models	Oct 7, 2024	GSM8KLogical Reasoning	CodeCode Available	1
Fine-Tuning CLIP's Last Visual Projector: A Few-Shot Cornucopia	Oct 7, 2024	Domain GeneralizationLanguage Modeling	CodeCode Available	1
RoWeeder: Unsupervised Weed Mapping through Crop-Row Detection	Oct 7, 2024	Deep LearningManagement	CodeCode Available	1
What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMs	Oct 7, 2024		CodeCode Available	1
PRFusion: Toward Effective and Robust Multi-Modal Place Recognition with Image and Point Cloud Fusion	Oct 7, 2024	Autonomous Driving	CodeCode Available	1
DAPE V2: Process Attention Score as Feature Map for Length Extrapolation	Oct 7, 2024		CodeCode Available	1
MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain	Oct 7, 2024	AttributeMetric Learning	CodeCode Available	1
DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration	Oct 7, 2024	DenoisingImage Denoising	CodeCode Available	1
ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models	Oct 7, 2024	Question AnsweringVisual Question Answering	CodeCode Available	1
Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality	Oct 7, 2024	Video Generation	CodeCode Available	1
Refining Counterfactual Explanations With Joint-Distribution-Informed Shapley Towards Actionable Minimality	Oct 7, 2024	counterfactual	CodeCode Available	1
MOFFlow: Flow Matching for Structure Prediction of Metal-Organic Frameworks	Oct 7, 2024		CodeCode Available	1
Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models	Oct 7, 2024	Memorization	CodeCode Available	1
A Recipe For Building a Compliant Real Estate Chatbot	Oct 7, 2024	ChatbotInstruction Following	CodeCode Available	1
Enhanced Super-Resolution Training via Mimicked Alignment for Real-World Scenes	Oct 7, 2024	Image Super-ResolutionSuper-Resolution	CodeCode Available	1
NeuroBOLT: Resting-state EEG-to-fMRI Synthesis with Multi-dimensional Feature Mapping	Oct 7, 2024	Brain DecodingEEG	CodeCode Available	1
R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?	Oct 7, 2024		CodeCode Available	1
PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing	Oct 7, 2024	GPU	CodeCode Available	1
D-PoSE: Depth as an Intermediate Representation for 3D Human Pose and Shape Estimation	Oct 7, 2024	3D human pose and shape estimation	CodeCode Available	1
Hyper-Representations: Learning from Populations of Neural Networks	Oct 7, 2024	Representation LearningTransfer Learning	CodeCode Available	1
Unsupervised Representation Learning from Sparse Transformation Analysis	Oct 7, 2024	Representation Learning	CodeCode Available	1
Spatio-Temporal 3D Point Clouds from WiFi-CSI Data via Transformer Networks	Oct 7, 2024	multimodal interaction	CodeCode Available	1
CogDevelop2K: Reversed Cognitive Development in Multimodal Large Language Models	Oct 6, 2024		CodeCode Available	1
Towards Secure Tuning: Mitigating Security Risks Arising from Benign Instruction Fine-Tuning	Oct 6, 2024		CodeCode Available	1
Algorithmic Capabilities of Random Transformers	Oct 6, 2024	Text Generation	CodeCode Available	1