The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5801–5850 of 661570 papers

Title	Date	Tasks	Status	Hype
SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in Chinese	Jan 22, 2024	DiversityGSM8K	CodeCode Available	2
ChainerCV: a Library for Deep Learning in Computer Vision	Aug 28, 2017	Deep Learningobject-detection	CodeCode Available	2
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse	Sep 17, 2024	In-Context LearningRAG	CodeCode Available	2
CenterNet++ for Object Detection	Apr 18, 2022	Objectobject-detection	CodeCode Available	2
Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models	Mar 30, 2023	Video AlignmentVideo Editing	CodeCode Available	2
Conformal Symplectic Optimization for Stable Reinforcement Learning	Dec 3, 2024	Atari GamesDeep Reinforcement Learning	CodeCode Available	2
LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models	Mar 4, 2022	DecoderGPU	CodeCode Available	2
STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion	Jan 3, 2024	3D Human Pose EstimationHuman Mesh Recovery	CodeCode Available	2
LongReward: Improving Long-context Large Language Models with AI Feedback	Oct 28, 2024	Offline RLReinforcement Learning (RL)	CodeCode Available	2
Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey	Feb 8, 2025	FairnessRAG	CodeCode Available	2
Deformable One-shot Face Stylization via DINO Semantic Guidance	Mar 1, 2024	One-Shot Face Stylization	CodeCode Available	2
ProcessPainter: Learn Painting Process from Sequence Data	Jun 10, 2024	DenoisingImage Generation	CodeCode Available	2
Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation	Aug 24, 2023	Image-to-Image Translation	CodeCode Available	2
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models	Oct 6, 2023	Code GenerationDecision Making	CodeCode Available	2
Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion	Feb 22, 2024	Music Generation	CodeCode Available	2
Learning to Compress Prompts with Gist Tokens	Apr 17, 2023	Decoder	CodeCode Available	2
TRADES: Generating Realistic Market Simulations with Diffusion Models	Jan 31, 2025	Denoising	CodeCode Available	2
SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals	May 28, 2024	Contrastive LearningRepresentation Learning	CodeCode Available	2
MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features	Sep 30, 2022	Image Classification	CodeCode Available	2
PPSURF: Combining Patches and Point Convolutions for Detailed Surface Reconstruction	Jan 16, 2024	Surface Reconstruction	CodeCode Available	2
FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference	Feb 28, 2025		CodeCode Available	2
Heterogeneous Multi-Robot Reinforcement Learning	Jan 17, 2023	Graph Neural NetworkMulti-agent Reinforcement Learning	CodeCode Available	2
DETR Doesn't Need Multi-Scale or Locality Design	Aug 3, 2023	Decoder	CodeCode Available	2
Multi-Scale Representations by Varying Window Attention for Semantic Segmentation	Apr 25, 2024	DecoderSemantic Segmentation	CodeCode Available	2
Segment and Caption Anything	Dec 1, 2023	Caption Generationobject-detection	CodeCode Available	2
Attention as a Hypernetwork	Jun 9, 2024		CodeCode Available	2
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions	Mar 25, 2024	Attribute	CodeCode Available	2
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data	Mar 27, 2025	Text to 3D	CodeCode Available	2
Ontology Embedding: A Survey of Methods, Applications and Resources	Jun 16, 2024	Logical ReasoningOntology Embedding	CodeCode Available	2
3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification	Aug 25, 2024	Computational EfficiencyHyperspectral Image Classification	CodeCode Available	2
Scaling Diffusion Transformers Efficiently via μP	May 21, 2025	Image GenerationText to Image Generation	CodeCode Available	2
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection	Jul 23, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
Quanda: An Interpretability Toolkit for Training Data Attribution Evaluation and Beyond	Oct 9, 2024	Benchmarking	CodeCode Available	2
ViTs for SITS: Vision Transformers for Satellite Image Time Series	Jan 12, 2023	Semantic SegmentationTime Series	CodeCode Available	2
RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction	Mar 8, 2024	Audio GenerationComputational Efficiency	CodeCode Available	2
LambdaKG: A Library for Pre-trained Language Model-Based Knowledge Graph Embeddings	Oct 1, 2022	Graph Representation LearningKnowledge Graph Completion	CodeCode Available	2
Optimal Flow Matching: Learning Straight Trajectories in Just One Step	Mar 19, 2024		CodeCode Available	2
DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences	Mar 8, 2024		CodeCode Available	2
Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework	Apr 19, 2024	Earth ObservationSegmentation	CodeCode Available	2
g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin	Mar 20, 2022	Part-Of-Speech TaggingPolyphone disambiguation	CodeCode Available	2
LangProp: A code optimization framework using Large Language Models applied to driving	Jan 18, 2024	Autonomous DrivingCode Generation	CodeCode Available	2
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding	Feb 28, 2022	Document Image Classificationdocument understanding	CodeCode Available	2
GrootVL: Tree Topology is All You Need in State Space Model	Jun 4, 2024	Allimage-classification	CodeCode Available	2
Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tuning	Jun 6, 2024	Multi-Task LearningVulnerability Detection	CodeCode Available	2
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors	May 29, 2023	Contrastive LearningImage Reconstruction	CodeCode Available	2
CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models	Nov 28, 2023	Dialogue Generation	CodeCode Available	2
Towards Evaluating and Building Versatile Large Language Models for Medicine	Aug 22, 2024	Multiple-choicenamed-entity-recognition	CodeCode Available	2
RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM	Jan 8, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis	Mar 24, 2022	DenoisingImage Denoising	CodeCode Available	2
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms	Feb 21, 2025	Scheduling	CodeCode Available	2