SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 67516800 of 661570 papers

TitleStatusHype
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image GenerationCode2
Liquid Structural State-Space ModelsCode2
STAR: Skeleton-aware Text-based 4D Avatar Generation with In-Network Motion RetargetingCode2
SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language DescriptionCode2
One Transformer Can Understand Both 2D & 3D Molecular DataCode2
SelfRecon: Self Reconstruction Your Digital Avatar from Monocular VideoCode2
LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language InterpretationCode2
FedPara: Low-Rank Hadamard Product for Communication-Efficient Federated LearningCode2
LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent ApplicationsCode2
Unveiling COVID-19 from Chest X-ray with deep learning: a hurdles race with small dataCode2
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image ClassificationCode2
CoIR: A Comprehensive Benchmark for Code Information Retrieval ModelsCode2
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale DemonstrationsCode2
BioCLIP: A Vision Foundation Model for the Tree of LifeCode2
VMambaMorph: a Multi-Modality Deformable Image Registration Framework based on Visual State Space Model with Cross-Scan ModuleCode2
Fusing finetuned models for better pretrainingCode2
Flow Matching in Latent SpaceCode2
Evaluating Explainability for Graph Neural NetworksCode2
Efficient Quality Diversity Optimization of 3D Buildings through 2D Pre-optimizationCode2
Certified Human Trajectory PredictionCode2
Rethinking Mobile Block for Efficient Attention-based ModelsCode2
LLMs Know More Than They Show: On the Intrinsic Representation of LLM HallucinationsCode2
Unveiling Deep Shadows: A Survey and Benchmark on Image and Video Shadow Detection, Removal, and Generation in the Deep Learning EraCode2
MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image ClassificationCode2
Provable Robust Watermarking for AI-Generated TextCode2
Large Language Models are Efficient Learners of Noise-Robust Speech RecognitionCode2
LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image InterpretationCode2
ToolGen: Unified Tool Retrieval and Calling via GenerationCode2
MoCha-Stereo: Motif Channel Attention Network for Stereo MatchingCode2
Equivariant Energy-Guided SDE for Inverse Molecular DesignCode2
LinVT: Empower Your Image-level Large Language Model to Understand VideosCode2
BIRB: A Generalization Benchmark for Information Retrieval in BioacousticsCode2
Blue noise for diffusion modelsCode2
Recurrent Memory TransformerCode2
Artificial Kuramoto Oscillatory NeuronsCode2
AvatarGen: A 3D Generative Model for Animatable Human AvatarsCode2
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social DilemmasCode2
Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth EstimationCode2
SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought ReasoningCode2
Self-Supervised Multimodal Learning: A SurveyCode2
Unified Multimodal Discrete DiffusionCode2
RepairAgent: An Autonomous, LLM-Based Agent for Program RepairCode2
RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D RegistrationCode2
Accurate 3D Body Shape Regression using Metric and Semantic AttributesCode2
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language ModelsCode2
ExpertPrompting: Instructing Large Language Models to be Distinguished ExpertsCode2
Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters MoreCode2
RetroMAE v2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language ModelsCode2
AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics PerceptionCode2
Multimodal Analogical Reasoning over Knowledge GraphsCode2
Show:102550
← PrevPage 136 of 13232Next →