The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5051–5100 of 661570 papers

Title	Date	Tasks	Status	Hype
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses	Oct 9, 2024	scientific discoveryvalid	CodeCode Available	2
Source-Free Domain Adaptation with Frozen Multimodal Foundation Model	Nov 27, 2023	Domain AdaptationPrompt Learning	CodeCode Available	2
CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers	Jan 3, 2024	Point Cloud Completion	CodeCode Available	2
TimeLMs: Diachronic Language Models from Twitter	Feb 8, 2022	Continual LearningLanguage Modeling	CodeCode Available	2
string2string: A Modern Python Library for String-to-String Algorithms	Apr 27, 2023		CodeCode Available	2
Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite	Sep 15, 2023	Question Answering	CodeCode Available	2
Spectrally Pruned Gaussian Fields with Neural Compensation	May 1, 2024		CodeCode Available	2
BIG-Bench Extra Hard	Feb 26, 2025		CodeCode Available	2
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective	Oct 31, 2024		CodeCode Available	2
Chain of Hindsight Aligns Language Models with Feedback	Feb 6, 2023		CodeCode Available	2
MiraGe: Editable 2D Images using Gaussian Splatting	Oct 2, 2024		CodeCode Available	2
Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends	Jul 31, 2024	coreference-resolutionCoreference Resolution	CodeCode Available	2
Vision-aided UAV navigation and dynamic obstacle avoidance using gradient-based B-spline trajectory optimization	Sep 15, 2022	Navigate	CodeCode Available	2
Deep learning-driven pulmonary artery and vein segmentation reveals demography-associated vasculature anatomical differences	Apr 11, 2024	AnatomySegmentation	CodeCode Available	2
A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion	Apr 14, 2024	MambaPansharpening	CodeCode Available	2
The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models	Jul 25, 2024		CodeCode Available	2
Spiking Diffusion Models	Aug 29, 2024	Image Generation	CodeCode Available	2
Putting People in their Place: Monocular Regression of 3D People in Depth	Dec 15, 2021	3D Depth Estimationregression	CodeCode Available	2
MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance	May 28, 2024		CodeCode Available	2
PnLCalib: Sports Field Registration via Points and Lines Optimization	Apr 12, 2024	Camera CalibrationHomography Estimation	CodeCode Available	2
XHand: Real-time Expressive Hand Avatar	Jul 30, 2024		CodeCode Available	2
FedGraph: A Research Library and Benchmark for Federated Graph Learning	Oct 8, 2024	BenchmarkingFederated Learning	CodeCode Available	2
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation	Aug 15, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	2
ZenSVI: An Open-Source Software for the Integrated Acquisition, Processing and Analysis of Street View Imagery Towards Scalable Urban Science	Dec 24, 2024		CodeCode Available	2
ProteinGym: Large-Scale Benchmarks for Protein Fitness Prediction and Design	Sep 26, 2023	Mutational/Variant Effect Prediction	CodeCode Available	2
Editing Models with Task Arithmetic	Dec 8, 2022	NegationTask Arithmetic	CodeCode Available	2
Learning Video Representations from Large Language Models	Dec 8, 2022	Action ClassificationAction Recognition	CodeCode Available	2
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives	Nov 30, 2023	Video Understanding	CodeCode Available	2
Model-free quantification of completeness, uncertainties, and outliers in atomistic machine learning using information theory	Apr 18, 2024	Active LearningUncertainty Quantification	CodeCode Available	2
Masked Face Recognition Dataset and Application	Mar 20, 2020	Face DetectionFace Recognition	CodeCode Available	2
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models	Oct 12, 2023	Natural Language UnderstandingQuantization	CodeCode Available	2
Semantic Image Synthesis via Diffusion Models	Jun 30, 2022	DecoderDenoising	CodeCode Available	2
Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing	Apr 4, 2023	Multimodal fashion image editing	CodeCode Available	2
Generating 3D Molecules for Target Protein Binding	Apr 19, 2022	Drug DiscoveryGraph Neural Network	CodeCode Available	2
FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation	May 22, 2023	Imitation LearningMotion Planning	CodeCode Available	2
Isotropic Correlation Models for the Cross-Section of Equity Returns	Nov 13, 2024		CodeCode Available	2
Large Language Model with Region-guided Referring and Grounding for CT Report Generation	Nov 23, 2024	Computed Tomography (CT)Diagnostic	CodeCode Available	2
QAEncoder: Towards Aligned Representation Learning in Question Answering System	Sep 30, 2024	Document EmbeddingQuestion Answering	CodeCode Available	2
Neural-Driven Image Editing	Jul 7, 2025	Contrastive LearningMultimodel-guided image editing	CodeCode Available	2
Rethinking Negative Instances for Generative Named Entity Recognition	Feb 26, 2024	named-entity-recognitionNamed Entity Recognition	CodeCode Available	2
Act3D: 3D Feature Field Transformers for Multi-Task Robotic Manipulation	Jun 30, 2023	Action DetectionPose Prediction	CodeCode Available	2
Space Group Informed Transformer for Crystalline Materials Generation	Mar 23, 2024		CodeCode Available	2
SFFNet: A Wavelet-Based Spatial and Frequency Domain Fusion Network for Remote Sensing Segmentation	May 3, 2024	feature selection	CodeCode Available	2
Fourier Neural Operator with Learned Deformations for PDEs on General Geometries	Jul 11, 2022	valid	CodeCode Available	2
KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider	Jun 3, 2025		CodeCode Available	2
Deep Video Prior for Video Consistency and Propagation	Jan 27, 2022	Optical Flow EstimationSemantic Segmentation	CodeCode Available	2
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity	Jan 11, 2021	Language ModellingMixture-of-Experts	CodeCode Available	2
Towards Large-Scale Training of Pathology Foundation Models	Mar 24, 2024	Nuclear SegmentationSelf-Supervised Learning	CodeCode Available	2
Explicit Differentiable Slicing and Global Deformation for Cardiac Mesh Reconstruction	Sep 3, 2024	Anatomy	CodeCode Available	2
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training	Aug 3, 2022	Instance SegmentationSegmentation	CodeCode Available	2