SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 95519575 of 177340 papers

TitleStatusHype
An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare RecordsCode2
CHGNet: Pretrained universal neural network potential for charge-informed atomistic modelingCode2
LogAI: A Library for Log Analytics and IntelligenceCode2
ReMoDiffuse: Retrieval-Augmented Motion Diffusion ModelCode2
ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior ConstraintsCode2
Geometric Latent Diffusion Models for 3D Molecule GenerationCode2
Accelerating Self-Play Learning in GoCode2
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language ModelsCode2
MoEUT: Mixture-of-Experts Universal TransformersCode2
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization AlgorithmsCode2
LaMI-DETR: Open-Vocabulary Detection with Language Model InstructionCode2
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMsCode2
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual TokenizationCode2
Cross-Image Relational Knowledge Distillation for Semantic SegmentationCode2
MLAgentBench: Evaluating Language Agents on Machine Learning ExperimentationCode2
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text DetectionCode2
R3LIVE: A Robust, Real-time, RGB-colored, LiDAR-Inertial-Visual tightly-coupled state Estimation and mapping packageCode2
ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented AgentsCode2
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment RetrievalCode2
SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter OptimizationCode2
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge DeviceCode2
MultiOOD: Scaling Out-of-Distribution Detection for Multiple ModalitiesCode2
Self-Exploring Language Models: Active Preference Elicitation for Online AlignmentCode2
FEC: Fast Euclidean Clustering for Point Cloud SegmentationCode2
PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical ImagingCode2
Show:102550
← PrevPage 383 of 7094Next →