SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 77017725 of 177340 papers

TitleStatusHype
FedPara: Low-Rank Hadamard Product for Communication-Efficient Federated LearningCode2
LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent ApplicationsCode2
Unveiling COVID-19 from Chest X-ray with deep learning: a hurdles race with small dataCode2
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image ClassificationCode2
CoIR: A Comprehensive Benchmark for Code Information Retrieval ModelsCode2
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale DemonstrationsCode2
BioCLIP: A Vision Foundation Model for the Tree of LifeCode2
VMambaMorph: a Multi-Modality Deformable Image Registration Framework based on Visual State Space Model with Cross-Scan ModuleCode2
Fusing finetuned models for better pretrainingCode2
Flow Matching in Latent SpaceCode2
Evaluating Explainability for Graph Neural NetworksCode2
Efficient Quality Diversity Optimization of 3D Buildings through 2D Pre-optimizationCode2
Certified Human Trajectory PredictionCode2
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model PerformanceCode2
LLMs Know More Than They Show: On the Intrinsic Representation of LLM HallucinationsCode2
Unveiling Deep Shadows: A Survey and Benchmark on Image and Video Shadow Detection, Removal, and Generation in the Deep Learning EraCode2
MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image ClassificationCode2
Provable Robust Watermarking for AI-Generated TextCode2
Large Language Models are Efficient Learners of Noise-Robust Speech RecognitionCode2
LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image InterpretationCode2
ToolGen: Unified Tool Retrieval and Calling via GenerationCode2
MoCha-Stereo: Motif Channel Attention Network for Stereo MatchingCode2
Equivariant Energy-Guided SDE for Inverse Molecular DesignCode2
LinVT: Empower Your Image-level Large Language Model to Understand VideosCode2
BIRB: A Generalization Benchmark for Information Retrieval in BioacousticsCode2
Show:102550
← PrevPage 309 of 7094Next →