The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6076–6100 of 474278 papers

Title	Date	Tasks	Status	Hype
ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation	Feb 20, 2025	3D Molecule GenerationProtein Design	CodeCode Available	2
GiGL: Large-Scale Graph Neural Networks at Snapchat	Feb 20, 2025	Graph Learning	CodeCode Available	2
Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics	Feb 19, 2025		CodeCode Available	2
MoM: Linear Sequence Modeling with Mixture-of-Memories	Feb 19, 2025		CodeCode Available	2
Smaller But Better: Unifying Layout Generation with Smaller Large Language Models	Feb 19, 2025	Layout Generation	CodeCode Available	2
DataSciBench: An LLM Agent Benchmark for Data Science	Feb 19, 2025	Code GenerationLarge Language Model	CodeCode Available	2
Calibration and Option Pricing with Stochastic Volatility and Double Exponential Jumps	Feb 19, 2025	ArticlesEconometrics	CodeCode Available	2
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models	Feb 19, 2025	GPUQuantization	CodeCode Available	2
Repo2Run: Automated Building Executable Environment for Code Repository at Scale	Feb 19, 2025		CodeCode Available	2
Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention	Feb 19, 2025	image-classificationImage Classification	CodeCode Available	2
Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework	Feb 19, 2025		CodeCode Available	2
JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework	Feb 19, 2025	Change DetectionEarth Observation	CodeCode Available	2
Event-Based Video Frame Interpolation With Cross-Modal Asymmetric Bidirectional Motion Fields	Feb 19, 2025	Video Frame Interpolation	CodeCode Available	2
SIFT: Grounding LLM Reasoning in Contexts via Stickers	Feb 19, 2025	GSM8KMath	CodeCode Available	2
TESS 2: A Large-Scale Generalist Diffusion Language Model	Feb 19, 2025	Instruction FollowingLanguage Modeling	CodeCode Available	2
Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models	Feb 19, 2025	Contrastive LearningSentence	CodeCode Available	2
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation	Feb 18, 2025	3D Generation3D Molecule Generation	CodeCode Available	2
DAMamba: Vision State Space Model with Dynamic Adaptive Scan	Feb 18, 2025	image-classificationImage Classification	CodeCode Available	2
Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors	Feb 18, 2025	Code GenerationKnowledge Tracing	CodeCode Available	2
Rethinking Diverse Human Preference Learning through Principal Component Analysis	Feb 18, 2025		CodeCode Available	2
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning	Feb 18, 2025	Math	CodeCode Available	2
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation	Feb 18, 2025	DecoderGPU	CodeCode Available	2
WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects	Feb 18, 2025	Machine Translation	CodeCode Available	2
UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design	Feb 18, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization	Feb 18, 2025	Image RetrievalQuestion Answering	CodeCode Available	2