SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 49014950 of 177340 papers

TitleStatusHype
Knowledge Graph-Guided Retrieval Augmented GenerationCode2
σ-GPTs: A New Approach to Autoregressive ModelsCode2
Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose EstimationCode2
SimpleClick: Interactive Image Segmentation with Simple Vision TransformersCode2
Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style TransferCode2
CodeJudge: Evaluating Code Generation with Large Language ModelsCode2
RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion PriorsCode2
Towards Generative Ray Path Sampling for Faster Point-to-Point Ray TracingCode2
Temporally Efficient Vision Transformer for Video Instance SegmentationCode2
DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask DiffusionCode2
Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal InputsCode2
Pre-training Music Classification Models via Music Source SeparationCode2
Smooth Exploration for Robotic Reinforcement LearningCode2
Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair AlignmentCode2
GALIP: Generative Adversarial CLIPs for Text-to-Image SynthesisCode2
RRHF: Rank Responses to Align Language Models with Human FeedbackCode2
Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion TransformersCode2
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic MemoryCode2
Scene Text Recognition with Permuted Autoregressive Sequence ModelsCode2
U-shaped Vision Mamba for Single Image DehazingCode2
FastMAC: Stochastic Spectral Sampling of Correspondence GraphCode2
LHU-Net: A Light Hybrid U-Net for Cost-Efficient, High-Performance Volumetric Medical Image SegmentationCode2
Learning A Spiking Neural Network for Efficient Image DerainingCode2
ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAECode2
Exploring the Benefit of Activation Sparsity in Pre-trainingCode2
AST: Audio Spectrogram TransformerCode2
Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion ModelsCode2
Motion Mamba: Efficient and Long Sequence Motion GenerationCode2
A Graph-Based Approach for Category-Agnostic Pose EstimationCode2
Agent Lumos: Unified and Modular Training for Open-Source Language AgentsCode2
Toward General Instruction-Following Alignment for Retrieval-Augmented GenerationCode2
Practical Blind Image Denoising via Swin-Conv-UNet and Data SynthesisCode2
CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language ModelsCode2
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document UnderstandingCode2
Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation FrameworkCode2
Quanda: An Interpretability Toolkit for Training Data Attribution Evaluation and BeyondCode2
3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image ClassificationCode2
Attention as a HypernetworkCode2
DETR Doesn't Need Multi-Scale or Locality DesignCode2
SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory SignalsCode2
Symbolic Music Generation with Non-Differentiable Rule Guided DiffusionCode2
Deformable One-shot Face Stylization via DINO Semantic GuidanceCode2
STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment FusionCode2
SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in ChineseCode2
RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation ModelsCode2
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
Multi-Modal Fusion Transformer for End-to-End Autonomous DrivingCode2
EfficientRAG: Efficient Retriever for Multi-Hop Question AnsweringCode2
Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentationCode2
Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding TransformerCode2
Show:102550
← PrevPage 99 of 3547Next →