SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 49515000 of 177340 papers

TitleStatusHype
YOLC: You Only Look Clusters for Tiny Object Detection in Aerial ImagesCode2
Ecco: An Open Source Library for the Explainability of Transformer Language ModelsCode2
DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion TransformerCode2
Arbitrary Motion Style Transfer with Multi-condition Motion Latent Diffusion ModelCode2
eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction DataCode2
AlphaStar Unplugged: Large-Scale Offline Reinforcement LearningCode2
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision TransformerCode2
LiMoE: Mixture of LiDAR Representation Learners from Automotive ScenesCode2
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG SystemsCode2
Unlocking the Potential of Classic GNNs for Graph-level Tasks: Simple Architectures Meet ExcellenceCode2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene ImageryCode2
Restore-RWKV: Efficient and Effective Medical Image Restoration with RWKVCode2
ViG: Linear-complexity Visual Sequence Learning with Gated Linear AttentionCode2
DiffiT: Diffusion Vision Transformers for Image GenerationCode2
Benchmarking Large Language Models in Retrieval-Augmented GenerationCode2
MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic PromptCode2
Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure PriorCode2
Visual Adversarial Examples Jailbreak Aligned Large Language ModelsCode2
Exploring the Limit of Outcome Reward for Learning Mathematical ReasoningCode2
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-TrainingCode2
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale TransformersCode2
OmniMAE: Single Model Masked Pretraining on Images and VideosCode2
TransVOD: End-to-End Video Object Detection with Spatial-Temporal TransformersCode2
StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar GenerationCode2
SCNet: Sparse Compression Network for Music Source SeparationCode2
Large Language Models Can Self-Improve in Long-context ReasoningCode2
StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned FacesCode2
LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-OnCode2
CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic UnderstandingCode2
A Simple Framework for Contrastive Learning of Visual RepresentationsCode2
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual PreferencesCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event DetectionCode2
The 1st-place Solution for ECCV 2022 Multiple People Tracking in Group Dance ChallengeCode2
KST-GCN: A Knowledge-Driven Spatial-Temporal Graph Convolutional Network for Traffic ForecastingCode2
MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D GenerationCode2
UniRGB-IR: A Unified Framework for RGB-Infrared Semantic Tasks via Adapter TuningCode2
PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud AnalysisCode2
LumberChunker: Long-Form Narrative Document SegmentationCode2
EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image SegmentationCode2
PokerBench: Training Large Language Models to become Professional Poker PlayersCode2
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal UnderstandingCode2
Geodesic Diffusion Models for Medical Image-to-Image GenerationCode2
Exploring the best way for UAV visual localization under Low-altitude Multi-view Observation Condition: a BenchmarkCode2
Taming Diffusion Models for Audio-Driven Co-Speech Gesture GenerationCode2
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency LossesCode2
Mamba-ND: Selective State Space Modeling for Multi-Dimensional DataCode2
rPPG-Toolbox: Deep Remote PPG ToolboxCode2
R-Judge: Benchmarking Safety Risk Awareness for LLM AgentsCode2
Explaining Explanations: Axiomatic Feature Interactions for Deep NetworksCode2
Show:102550
← PrevPage 100 of 3547Next →