The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2126–2150 of 661570 papers

Title	Date	Tasks	Status	Hype
TinyLLaVA: A Framework of Small-scale Large Multimodal Models	Feb 22, 2024	Visual Question Answering	CodeCode Available	4
Building reliable sim driving agents by scaling self-play	Feb 20, 2025	Autonomous VehiclesBenchmarking	CodeCode Available	4
Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts	Mar 13, 2024	Image AnimationImage to Video Generation	CodeCode Available	4
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN	May 27, 2022	Image ClassificationInstance Segmentation	CodeCode Available	4
SkyReels-A2: Compose Anything in Video Diffusion Transformers	Apr 3, 2025	Human-Domain Subject-to-VideoOpen-Domain Subject-to-Video	CodeCode Available	4
Croissant: A Metadata Format for ML-Ready Datasets	Mar 28, 2024	FrictionManagement	CodeCode Available	4
DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning	Feb 28, 2025	Information Retrievalreinforcement-learning	CodeCode Available	4
LLMMapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources	Apr 8, 2025	ArticlesForm	CodeCode Available	4
KISS-Matcher: Fast and Robust Point Cloud Registration Revisited	Sep 23, 2024	Point Cloud Registration	CodeCode Available	4
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control	Mar 18, 2025		CodeCode Available	4
Prototypical Verbalizer for Prompt-based Few-shot Tuning	Mar 18, 2022	Contrastive LearningEntity Typing	CodeCode Available	4
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning	May 2, 2024	Autonomous Drivingcounterfactual	CodeCode Available	4
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis	Jul 20, 2022	Image OutpaintingText-to-Image Generation	CodeCode Available	4
Autoregressive Video Generation without Vector Quantization	Dec 18, 2024	Image GenerationPrediction	CodeCode Available	4
Best-of-N Jailbreaking	Dec 4, 2024		CodeCode Available	4
InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems	Oct 21, 2024	Automated Theorem ProvingCPU	CodeCode Available	4
Continual Learning of Large Language Models: A Comprehensive Survey	Apr 25, 2024	Continual LearningSurvey	CodeCode Available	4
KTO: Model Alignment as Prospect Theoretic Optimization	Feb 2, 2024	Attributemodel	CodeCode Available	4
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval	Sep 14, 2024	Contrastive LearningImage Retrieval	CodeCode Available	4
Text2SQL is Not Enough: Unifying AI and Databases with TAG	Aug 27, 2024	RAGRetrieval-augmented Generation	CodeCode Available	4
Towards No.1 in CLUE Semantic Matching Challenge: Pre-trained Language Model Erlangshen with Propensity-Corrected Loss	Aug 5, 2022	Language ModelingLanguage Modelling	CodeCode Available	4
Convolutional Differentiable Logic Gate Networks	Nov 7, 2024		CodeCode Available	4
Billion-scale similarity search with GPUs	Feb 28, 2017	GPUImage Similarity Search	CodeCode Available	4
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers	Sep 30, 2024		CodeCode Available	4
Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level	Mar 7, 2024		CodeCode Available	4