SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 52515300 of 661570 papers

TitleStatusHype
Causal Diffusion Transformers for Generative ModelingCode2
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM InferenceCode2
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge EnhancementCode2
Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics EmulationCode2
When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token PruningCode2
When Do LLMs Help With Node Classification? A Comprehensive AnalysisCode2
GhostNetV2: Enhance Cheap Operation with Long-Range AttentionCode2
A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object DetectionCode2
Atlas: End-to-End 3D Scene Reconstruction from Posed ImagesCode2
Federated Learning in Mobile Networks: A Comprehensive Case Study on Traffic ForecastingCode2
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-OptimizationCode2
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language ModelsCode2
SpecDETR: A Transformer-based Hyperspectral Point Object Detection NetworkCode2
Nix-TTS: Lightweight and End-to-End Text-to-Speech via Module-wise DistillationCode2
Are Self-Attentions Effective for Time Series Forecasting?Code2
Autoformalizing Euclidean GeometryCode2
HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and ManipulationCode2
Roll the dice & look before you leap: Going beyond the creative limits of next-token predictionCode2
HybridNets: End-to-End Perception NetworkCode2
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-trainingCode2
Recommendation as Language Processing (RLP): A Unified Pretrain, Personalized Prompt & Predict Paradigm (P5)Code2
D-Flow: Differentiating through Flows for Controlled GenerationCode2
REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph GenerationCode2
Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image AnalysisCode2
MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought ReasoningCode2
Cross-video Identity Correlating for Person Re-identification Pre-trainingCode2
Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State FusionCode2
FCN: Fusing Exponential and Linear Cross Network for Click-Through Rate PredictionCode2
SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose EstimationCode2
Wavelet-based Mamba with Fourier Adjustment for Low-light Image EnhancementCode2
Learning Vision from Models Rivals Learning Vision from DataCode2
Enhancing Retrieval-Augmented Generation: A Study of Best PracticesCode2
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four StemsCode2
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic SegmentationCode2
MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree SearchCode2
Correlation Matching Transformation Transformers for UHD Image RestorationCode2
Me LLaMA: Foundation Large Language Models for Medical ApplicationsCode2
Mixed Diffusion for 3D Indoor Scene SynthesisCode2
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question AnsweringCode2
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance FieldsCode2
R-Judge: Benchmarking Safety Risk Awareness for LLM AgentsCode2
rPPG-Toolbox: Deep Remote PPG ToolboxCode2
Open-Vocabulary Segmentation with Unpaired Mask-Text SupervisionCode2
Mamba-ND: Selective State Space Modeling for Multi-Dimensional DataCode2
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency LossesCode2
ControlVideo: Training-free Controllable Text-to-Video GenerationCode2
Realistic Rainy Weather Simulation for LiDARs in CARLA SimulatorCode2
Taming Diffusion Models for Audio-Driven Co-Speech Gesture GenerationCode2
Exploring the best way for UAV visual localization under Low-altitude Multi-view Observation Condition: a BenchmarkCode2
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and GenerationCode2
Show:102550
← PrevPage 106 of 13232Next →