The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10026–10050 of 474278 papers

Title	Date	Tasks	Status	Hype
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models	Feb 7, 2024	DiversityMultiple-choice	CodeCode Available	2
λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space	Feb 7, 2024	Concept AlignmentGPU	CodeCode Available	2
Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers	Feb 7, 2024	Drug DiscoveryGraph Learning	CodeCode Available	2
Universal Neural Functionals	Feb 7, 2024		CodeCode Available	2
ScreenAI: A Vision-Language Model for UI and Infographics Understanding	Feb 7, 2024	Chart Question AnsweringLanguage Modeling	CodeCode Available	2
Pedagogical Alignment of Large Language Models	Feb 7, 2024	Synthetic Data Generation	CodeCode Available	2
ConvLoRA and AdaBN based Domain Adaptation via Self-Training	Feb 7, 2024	Domain AdaptationMulti-target Domain Adaptation	CodeCode Available	2
Multi-Patch Prediction: Adapting LLMs for Time Series Representation Learning	Feb 7, 2024	Contrastive LearningPrediction	CodeCode Available	2
BEBLID: Boosted efficient binary local image descriptor	Feb 7, 2024	Computational EfficiencyRetrieval	CodeCode Available	2
Blue noise for diffusion models	Feb 7, 2024	Denoising	CodeCode Available	2
A Survey on Domain Generalization for Medical Image Analysis	Feb 7, 2024	Domain GeneralizationMedical Image Analysis	CodeCode Available	2
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark	Feb 7, 2024		CodeCode Available	2
Edu-ConvoKit: An Open-Source Library for Education Conversation Data	Feb 7, 2024		CodeCode Available	2
Data-efficient Large Vision Models through Sequential Autoregression	Feb 7, 2024		CodeCode Available	2
Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints	Feb 7, 2024	Layout DesignLayout Generation	CodeCode Available	2
FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models	Feb 7, 2024	Instance SegmentationObject	CodeCode Available	2
Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding	Feb 7, 2024		CodeCode Available	2
YOLOPoint Joint Keypoint and Object Detection	Feb 6, 2024	Objectobject-detection	CodeCode Available	2
Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language Models	Feb 6, 2024	Stock Prediction	CodeCode Available	2
U-shaped Vision Mamba for Single Image Dehazing	Feb 6, 2024	Image DehazingImage Restoration	CodeCode Available	2
LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and Cosmology	Feb 6, 2024	AllBenchmarking	CodeCode Available	2
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model	Feb 6, 2024	DecoderImage Segmentation	CodeCode Available	2
Learning a Decision Tree Algorithm with Transformers	Feb 6, 2024	Meta-Learning	CodeCode Available	2
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies	Feb 6, 2024	Decision MakingDiversity	CodeCode Available	2
Large Language Models to Enhance Bayesian Optimization	Feb 6, 2024	Bayesian OptimizationFew-Shot Learning	CodeCode Available	2