SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 61516200 of 177340 papers

TitleStatusHype
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy PredictionCode2
ZAPBench: A Benchmark for Whole-Brain Activity Prediction in ZebrafishCode2
Effective Diffusion Transformer Architecture for Image Super-ResolutionCode2
Towards Diverse Binary Segmentation via A Simple yet General Gated NetworkCode2
LLaVA-KD: A Framework of Distilling Multimodal Large Language ModelsCode2
MolCRAFT: Structure-Based Drug Design in Continuous Parameter SpaceCode2
GlyphControl: Glyph Conditional Control for Visual Text GenerationCode2
UnIVAL: Unified Model for Image, Video, Audio and Language TasksCode2
LongVideoBench: A Benchmark for Long-context Interleaved Video-Language UnderstandingCode2
Varformer: Adapting VAR's Generative Prior for Image RestorationCode2
Guiding Generative Protein Language Models with Reinforcement LearningCode2
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal ResearchCode2
On Discrete Prompt Optimization for Diffusion ModelsCode2
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic SegmentationCode2
CrystalFormer-RL: Reinforcement Fine-Tuning for Materials DesignCode2
HISTAI: An Open-Source, Large-Scale Whole Slide Image Dataset for Computational PathologyCode2
D-Bot: Database Diagnosis System using Large Language ModelsCode2
Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge EnhancementCode2
XPSR: Cross-modal Priors for Diffusion-based Image Super-ResolutionCode2
Graph-Based Multimodal and Multi-view Alignment for Keystep RecognitionCode2
Frequency Adaptive Normalization For Non-stationary Time Series ForecastingCode2
Deep Reinforcement Learning for Multi-Agent InteractionCode2
Distributional Soft Actor-Critic with Three RefinementsCode2
Distilling Diffusion Models to Efficient 3D LiDAR Scene CompletionCode2
Navigation Variable-based Multi-objective Particle Swarm Optimization for UAV Path Planning with Kinematic ConstraintsCode2
SRAI: Towards Standardization of Geospatial AICode2
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language ModelsCode2
PyTorch FSDP: Experiences on Scaling Fully Sharded Data ParallelCode2
MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language ModelsCode2
Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI ExpertsCode2
Contrastive Search Is What You Need For Neural Text GenerationCode2
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object DetectorsCode2
Enhancing Spatiotemporal Disease Progression Models via Latent Diffusion and Prior KnowledgeCode2
Open World Scene Graph Generation using Vision Language ModelsCode2
Exposure Bracketing Is All You Need For A High-Quality ImageCode2
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary SegmentationCode2
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation DataCode2
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language BenchmarkCode2
An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language ModelsCode2
zkLLM: Zero Knowledge Proofs for Large Language ModelsCode2
FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing ModelCode2
X^2-VLM: All-In-One Pre-trained Model For Vision-Language TasksCode2
Git-Theta: A Git Extension for Collaborative Development of Machine Learning ModelsCode2
Starting From Non-Parametric Networks for 3D Point Cloud AnalysisCode2
Foundational Large Language Models for Materials ResearchCode2
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer VisionCode2
AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling EngineCode2
Re3: Generating Longer Stories With Recursive Reprompting and RevisionCode2
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point CloudsCode2
CNMBERT: A Model for Converting Hanyu Pinyin Abbreviations to Chinese CharactersCode2
Show:102550
← PrevPage 124 of 3547Next →