SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 86518700 of 661570 papers

TitleStatusHype
MoEUT: Mixture-of-Experts Universal TransformersCode2
Underwater Image Enhancement by Diffusion Model with Customized CLIP-ClassifierCode2
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous controlCode2
Analytic Federated LearningCode2
REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph GenerationCode2
Accelerating Transformers with Spectrum-Preserving Token MergingCode2
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy OptimizationCode2
Optimizing Large Language Models for OpenAPI Code CompletionCode2
PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud AnalysisCode2
iVideoGPT: Interactive VideoGPTs are Scalable World ModelsCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in CodeCode2
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision ModelsCode2
Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-ImprovementCode2
Fast-PGM: Fast Probabilistic Graphical Model Learning and InferenceCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
LM4LV: A Frozen Large Language Model for Low-level Vision TasksCode2
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal ModelsCode2
Sparse maximal update parameterization: A holistic approach to sparse training dynamicsCode2
Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series ClassificationCode2
Fieldscale: Locality-Aware Field-based Adaptive Rescaling for Thermal Infrared ImageCode2
Composed Image Retrieval for Remote SensingCode2
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion ModelsCode2
What is a Goldilocks Face Verification Test Set?Code2
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map GenerationCode2
Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL GenerationCode2
Out of Many, One: Designing and Scaffolding Proteins at the Scale of the Structural Universe with Genie 2Code2
MambaVC: Learned Visual Compression with Selective State SpacesCode2
Diffusion Bridge Implicit ModelsCode2
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation ModelsCode2
AnalogCoder: Analog Circuit Design via Training-Free Code GenerationCode2
AnomalyDINO: Boosting Patch-based Few-shot Anomaly Detection with DINOv2Code2
Agent Planning with World Knowledge ModelCode2
EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health RecordsCode2
Extracting Prompts by Inverting LLM OutputsCode2
S-Eval: Towards Automated and Comprehensive Safety Evaluation for Large Language ModelsCode2
Efficient Visual State Space Model for Image DeblurringCode2
RoGs: Large Scale Road Surface Reconstruction with Meshgrid GaussianCode2
Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern GeneratorsCode2
LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention NetworksCode2
Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image GenerationCode2
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer ModelsCode2
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language ModelsCode2
RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic ReportsCode2
Flatten Anything: Unsupervised Neural Surface ParameterizationCode2
Metric Flow Matching for Smooth Interpolations on the Data ManifoldCode2
Calibrated Self-Rewarding Vision Language ModelsCode2
Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text RecognitionCode2
Mamba-R: Vision Mamba ALSO Needs RegistersCode2
PipeFusion: Patch-level Pipeline Parallelism for Diffusion Transformers InferenceCode2
Show:102550
← PrevPage 174 of 13232Next →