SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1002610050 of 474278 papers

TitleStatusHype
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language ModelsCode2
λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent SpaceCode2
Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph TransformersCode2
Universal Neural FunctionalsCode2
ScreenAI: A Vision-Language Model for UI and Infographics UnderstandingCode2
Pedagogical Alignment of Large Language ModelsCode2
ConvLoRA and AdaBN based Domain Adaptation via Self-TrainingCode2
Multi-Patch Prediction: Adapting LLMs for Time Series Representation LearningCode2
BEBLID: Boosted efficient binary local image descriptorCode2
Blue noise for diffusion modelsCode2
A Survey on Domain Generalization for Medical Image AnalysisCode2
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language BenchmarkCode2
Edu-ConvoKit: An Open-Source Library for Education Conversation DataCode2
Data-efficient Large Vision Models through Sequential AutoregressionCode2
Towards Aligned Layout Generation via Diffusion Model with Aesthetic ConstraintsCode2
FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation ModelsCode2
Hydra: Sequentially-Dependent Draft Heads for Medusa DecodingCode2
YOLOPoint Joint Keypoint and Object DetectionCode2
Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language ModelsCode2
U-shaped Vision Mamba for Single Image DehazingCode2
LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and CosmologyCode2
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything ModelCode2
Learning a Decision Tree Algorithm with TransformersCode2
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based PoliciesCode2
Large Language Models to Enhance Bayesian OptimizationCode2
Show:102550
← PrevPage 402 of 18972Next →