SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 87768800 of 474278 papers

TitleStatusHype
MapCoder: Multi-Agent Code Generation for Competitive Problem SolvingCode2
Heterogeneity-Informed Meta-Parameter Learning for Spatiotemporal Time Series ForecastingCode2
TexPainter: Generative Mesh Texturing with Multi-view ConsistencyCode2
Improving Point-based Crowd Counting and Localization Based on Auxiliary Point GuidanceCode2
Observational Scaling Laws and the Predictability of Language Model PerformanceCode2
Layer-Condensed KV Cache for Efficient Inference of Large Language ModelsCode2
Identifying Functionally Important Features with End-to-End Sparse Dictionary LearningCode2
IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation ModelCode2
PyTorch-IE: Fast and Reproducible Prototyping for Information ExtractionCode2
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific DiscoveryCode2
Active Learning with Fully Bayesian Neural Networks for Discontinuous and Nonstationary DataCode2
Libra: Building Decoupled Vision System on Large Language ModelsCode2
Many-Shot In-Context Learning in Multimodal Foundation ModelsCode2
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object DetectionCode2
Grounded 3D-LLM with Referent TokensCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion ModelsCode2
LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image InterpretationCode2
SpecDETR: A Transformer-based Hyperspectral Point Object Detection NetworkCode2
HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase RecognitionCode2
DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy ProtectionCode2
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language ModelsCode2
From NeRFs to Gaussian Splats, and BackCode2
Xmodel-VLM: A Simple Baseline for Multimodal Vision Language ModelCode2
Enhancing Blind Video Quality Assessment with Rich Quality-aware FeaturesCode2
Show:102550
← PrevPage 352 of 18972Next →