SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 76017650 of 661570 papers

TitleStatusHype
MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought ReasoningCode2
Cross-video Identity Correlating for Person Re-identification Pre-trainingCode2
Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State FusionCode2
FCN: Fusing Exponential and Linear Cross Network for Click-Through Rate PredictionCode2
SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose EstimationCode2
Wavelet-based Mamba with Fourier Adjustment for Low-light Image EnhancementCode2
Learning Vision from Models Rivals Learning Vision from DataCode2
Enhancing Retrieval-Augmented Generation: A Study of Best PracticesCode2
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four StemsCode2
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic SegmentationCode2
MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree SearchCode2
Correlation Matching Transformation Transformers for UHD Image RestorationCode2
Me LLaMA: Foundation Large Language Models for Medical ApplicationsCode2
Mixed Diffusion for 3D Indoor Scene SynthesisCode2
MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language ModelingCode2
CoSeR: Bridging Image and Language for Cognitive Super-ResolutionCode2
FedFMS: Exploring Federated Foundation Models for Medical Image SegmentationCode2
Improved Canonicalization for Model Agnostic EquivarianceCode2
PENCIL: Long Thoughts with Short MemoryCode2
GenN2N: Generative NeRF2NeRF TranslationCode2
Translating Images to Road Network: A Sequence-to-Sequence PerspectiveCode2
BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event StreamCode2
Unsupervised Semantic Segmentation by Distilling Feature CorrespondencesCode2
I2V-Adapter: A General Image-to-Video Adapter for Diffusion ModelsCode2
Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic SurgeryCode2
LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse KernelsCode2
PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary ViewsCode2
How Well Can LLMs Negotiate? NegotiationArena Platform and AnalysisCode2
Blockwise Parallel Transformers for Large Context ModelsCode2
Evaluating RAG-Fusion with RAGElo: an Automated Elo-based FrameworkCode2
Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language ModelsCode2
Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic ParallelismCode2
Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph ProgrammingCode2
ETSformer: Exponential Smoothing Transformers for Time-series ForecastingCode2
YOLOMG: Vision-based Drone-to-Drone Detection with Appearance and Pixel-Level Motion FusionCode2
CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AICode2
V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision TransformerCode2
Preference Leakage: A Contamination Problem in LLM-as-a-judgeCode2
Marrying Autoregressive Transformer and Diffusion with Multi-Reference AutoregressionCode2
Spatially-Adaptive Feature Modulation for Efficient Image Super-ResolutionCode2
Generalizing 6-DoF Grasp Detection via Domain Prior KnowledgeCode2
Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based RetrieversCode2
ParCo: Part-Coordinating Text-to-Motion SynthesisCode2
A Comprehensive Survey on Self-Supervised Learning for RecommendationCode2
Self-Supervised Visual Preference AlignmentCode2
Kandinsky 3.0 Technical ReportCode2
SparseLLM: Towards Global Pruning for Pre-trained Language ModelsCode2
AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discoveryCode2
Diffusion Time-step Curriculum for One Image to 3D GenerationCode2
Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPsCode2
Show:102550
← PrevPage 153 of 13232Next →