The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–475 of 659983 papers

Title	Date	Tasks	Status	Hype
MambaOut: Do We Really Need Mamba for Vision?	May 13, 2024	image-classificationImage Classification	CodeCode Available	7
AIOS Compiler: LLM as Interpreter for Natural Language Programming and Flow Programming of AI Agents	May 11, 2024		CodeCode Available	7
Mirage: A Multi-Level Superoptimizer for Tensor Programs	May 9, 2024	GPUNavigate	CodeCode Available	7
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers	May 9, 2024		CodeCode Available	7
xLSTM: Extended Long Short-Term Memory	May 7, 2024	Language ModelingLanguage Modelling	CodeCode Available	7
Labeling supervised fine-tuning data with the scaling law	May 5, 2024	coreference-resolutionCoreference Resolution	CodeCode Available	7
PuLID: Pure and Lightning ID Customization via Contrastive Alignment	Apr 24, 2024	Image GenerationText to Image Generation	CodeCode Available	7
Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and Orchestration	Apr 24, 2024	ManagementPrompt Engineering	CodeCode Available	7
Better Synthetic Data by Retrieving and Transforming Existing Datasets	Apr 22, 2024	Dataset GenerationDiversity	CodeCode Available	7
CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models	Apr 19, 2024		CodeCode Available	7
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents	Apr 16, 2024	Fact CheckingRetrieval-augmented Generation	CodeCode Available	7
Long-form music generation with latent diffusion	Apr 16, 2024	Audio GenerationForm	CodeCode Available	7
Interactive Prompt Debugging with Sequence Salience	Apr 11, 2024	Sentencetext-classification	CodeCode Available	7
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments	Apr 11, 2024	Benchmarking	CodeCode Available	7
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models	Apr 10, 2024	Image to 3D	CodeCode Available	7
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models	Apr 8, 2024		CodeCode Available	7
AutoCodeRover: Autonomous Program Improvement	Apr 8, 2024	Bug fixingCode Search	CodeCode Available	7
Streamlining Ocean Dynamics Modeling with Fourier Neural Operators: A Multiobjective Hyperparameter and Architecture Optimization Approach	Apr 7, 2024	Efficient ExplorationHyperparameter Optimization	CodeCode Available	7
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation	Apr 3, 2024	Image GenerationText to Image Generation	CodeCode Available	7
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models	Mar 27, 2024	Image ClassificationImage Comprehension	CodeCode Available	7
2D Gaussian Splatting for Geometrically Accurate Radiance Fields	Mar 26, 2024	3DGSNovel View Synthesis	CodeCode Available	7
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation	Mar 22, 2024	Depth EstimationSurface Normal Estimation	CodeCode Available	7
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding	Mar 22, 2024	Action ClassificationAction Recognition	CodeCode Available	7
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy	Mar 21, 2024	Contrastive LearningDescriptive	CodeCode Available	7
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance	Mar 21, 2024	Animated GIF GenerationImage Animation	CodeCode Available	7