The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 20801–20850 of 474278 papers

Title	Date	Tasks	Status	Hype
EvoMesh: Adaptive Physical Simulation with Hierarchical Graph Evolutions	Oct 3, 2024		CodeCode Available	1
RDEIC: Accelerating Diffusion-Based Extreme Image Compression with Relay Residual Diffusion	Oct 3, 2024	DenoisingImage Compression	CodeCode Available	1
Encryption-Friendly LLM Architecture	Oct 3, 2024	Privacy Preserving	CodeCode Available	1
Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos	Oct 3, 2024	counterfactual	CodeCode Available	1
Pseudo-Stereo Inputs: A Solution to the Occlusion Challenge in Self-Supervised Stereo Matching	Oct 3, 2024	Stereo Matching	CodeCode Available	1
Erasing Conceptual Knowledge from Language Models	Oct 3, 2024	SpecificityText Generation	CodeCode Available	1
Response Estimation and System Identification of Dynamical Systems via Physics-Informed Neural Networks	Oct 2, 2024	parameter estimationState Estimation	CodeCode Available	1
TorchSISSO: A PyTorch-Based Implementation of the Sure Independence Screening and Sparsifying Operator for Efficient and Interpretable Model Discovery	Oct 2, 2024	GPUModel Discovery	CodeCode Available	1
CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment	Oct 2, 2024	AstronomyImage Quality Assessment	CodeCode Available	1
Revisiting Hierarchical Text Classification: Inference and Metrics	Oct 2, 2024	Classificationtext-classification	CodeCode Available	1
OmniSR: Shadow Removal under Direct and Indirect Lighting	Oct 2, 2024	Image Shadow RemovalShadow Removal	CodeCode Available	1
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models	Oct 2, 2024	Image Generation	CodeCode Available	1
PASS:Test-Time Prompting to Adapt Styles and Semantic Shapes in Medical Image Segmentation	Oct 2, 2024	Image SegmentationMedical Image Segmentation	CodeCode Available	1
Saliency-Guided DETR for Moment Retrieval and Highlight Detection	Oct 2, 2024	Highlight DetectionMoment Retrieval	CodeCode Available	1
Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices	Oct 2, 2024	GPULanguage Modeling	CodeCode Available	1
AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment	Oct 2, 2024	Self-Supervised Learningzero-shot-classification	CodeCode Available	1
Imaging foundation model for universal enhancement of non-ideal measurement CT	Oct 2, 2024	Medical Image Enhancementparameter-efficient fine-tuning	CodeCode Available	1
LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits	Oct 2, 2024	Instruction FollowingMath	CodeCode Available	1
Explainable Earth Surface Forecasting under Extreme Events	Oct 2, 2024	counterfactualEarth Observation	CodeCode Available	1
Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models	Oct 2, 2024	feature selection	CodeCode Available	1
HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration	Oct 2, 2024	2kDenoising	CodeCode Available	1
Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling	Oct 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Question-guided Knowledge Graph Re-scoring and Injection for Knowledge Graph Question Answering	Oct 2, 2024	Graph Question AnsweringQuestion Answering	CodeCode Available	1
DeepProtein: Deep Learning Library and Benchmark for Protein Sequence Learning	Oct 2, 2024	Deep LearningDrug Discovery	CodeCode Available	1
Were RNNs All We Needed?	Oct 2, 2024	AllMamba	CodeCode Available	1
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition	Oct 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Text2PDE: Latent Diffusion Models for Accessible Physics Simulation	Oct 2, 2024		CodeCode Available	1
VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models	Oct 2, 2024		CodeCode Available	1
Positional Attention: Expressivity and Learnability of Algorithmic Computation	Oct 2, 2024	Out-of-Distribution Generalization	CodeCode Available	1
A versatile machine learning workflow for high-throughput analysis of supported metal catalyst particles	Oct 2, 2024	object-detectionObject Detection	CodeCode Available	1
ANTIPASTI: interpretable prediction of antibody binding affinity exploiting Normal Modes and Deep Learning	Oct 2, 2024		CodeCode Available	1
UW-GS: Distractor-Aware 3D Gaussian Splatting for Enhanced Underwater Scene Reconstruction	Oct 2, 2024	3DGS	CodeCode Available	1
TPP-LLM: Modeling Temporal Point Processes by Efficiently Fine-Tuning Large Language Models	Oct 2, 2024	Computational Efficiencyparameter-efficient fine-tuning	CodeCode Available	1
Multi-Scale Fusion for Object Representation	Oct 2, 2024	Object	CodeCode Available	1
Edge-preserving noise for diffusion models	Oct 2, 2024	DenoisingImage Generation	CodeCode Available	1
ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning	Oct 2, 2024	Self-Learning	CodeCode Available	1
FlexLMM: a Nextflow linear mixed model framework for GWAS	Oct 2, 2024		CodeCode Available	1
FactAlign: Long-form Factuality Alignment of Large Language Models	Oct 2, 2024	FormHallucination	CodeCode Available	1
Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions	Oct 2, 2024	Classificationintent-classification	CodeCode Available	1
Integrative Decoding: Improve Factuality via Implicit Self-consistency	Oct 2, 2024	TruthfulQA	CodeCode Available	1
MONICA: Benchmarking on Long-tailed Medical Image Classification	Oct 2, 2024	BenchmarkingClassification	CodeCode Available	1
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models	Oct 2, 2024	Data AugmentationKnowledge Distillation	CodeCode Available	1
High-quality Animatable Eyelid Shapes from Lightweight Captures	Oct 2, 2024		CodeCode Available	1
EMMA: Efficient Visual Alignment in Multi-Modal LLMs	Oct 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Integrating Visual and Textual Inputs for Searching Large-Scale Map Collections with CLIP	Oct 2, 2024	Image Retrieval	CodeCode Available	1
MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation	Oct 2, 2024	Video Generation	CodeCode Available	1
Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression	Oct 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework	Oct 2, 2024	BenchmarkingInstruction Following	CodeCode Available	1
Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking	Oct 2, 2024	3D Multi-Object TrackingAutonomous Driving	CodeCode Available	1
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data	Oct 2, 2024	Audio ClassificationCaption Generation	CodeCode Available	1