SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 38513900 of 177340 papers

TitleStatusHype
MMFNet: Multi-Scale Frequency Masking Neural Network for Multivariate Time Series ForecastingCode3
TOTEM: TOkenized Time Series EMbeddings for General Time Series AnalysisCode3
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click LabelsCode3
Augmentation-Free Graph Contrastive Learning of Invariant-Discriminative RepresentationsCode3
GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented UnderstandingCode3
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language ModelsCode3
MetaAgents: Simulating Interactions of Human Behaviors for LLM-based Task-oriented Coordination via Collaborative Generative AgentsCode3
MobileNetV4 -- Universal Models for the Mobile EcosystemCode3
OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic ManipulationCode3
DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image EditingCode3
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song GenerationCode3
Open-Source Web Service with Morphological Dictionary-Supplemented Deep Learning for Morphosyntactic Analysis of CzechCode3
Model Inversion Attacks: A Survey of Approaches and CountermeasuresCode3
GIFT-Eval: A Benchmark For General Time Series Forecasting Model EvaluationCode3
CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning AlgorithmsCode3
Depth Any Camera: Zero-Shot Metric Depth Estimation from Any CameraCode3
Leveraging Self-Supervised Learning for Speaker DiarizationCode3
ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic ManipulationCode3
Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation ModelsCode3
REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real WebsitesCode3
VideoTetris: Towards Compositional Text-to-Video GenerationCode3
FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution RenderingCode3
ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic BudgetCode3
EasyVolcap: Accelerating Neural Volumetric Video ResearchCode3
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing ImagesCode3
High-Speed Stereo Visual SLAM for Low-Powered Computing DevicesCode3
Swin-UMamba: Mamba-based UNet with ImageNet-based pretrainingCode3
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single VideoCode3
Automated Movie Generation via Multi-Agent CoT PlanningCode3
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization DataCode3
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for CodingCode3
EPRecon: An Efficient Framework for Real-Time Panoptic 3D Reconstruction from Monocular VideoCode3
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMsCode3
Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language ModelsCode3
Thinkless: LLM Learns When to ThinkCode3
Rethinking Vision Transformers for MobileNet Size and SpeedCode3
Sentiment Reasoning for HealthcareCode3
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature SynchronizerCode3
DICEPTION: A Generalist Diffusion Model for Visual Perceptual TasksCode3
MuSc: Zero-Shot Industrial Anomaly Classification and Segmentation with Mutual Scoring of the Unlabeled ImagesCode3
HtFLlib: A Comprehensive Heterogeneous Federated Learning Library and BenchmarkCode3
Motion Anything: Any to Motion GenerationCode3
RAP-SAM: Towards Real-Time All-Purpose Segment AnythingCode3
AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive ReasoningCode3
EnvGS: Modeling View-Dependent Appearance with Environment GaussianCode3
A Survey on Data Selection for Language ModelsCode3
MagicLens: Self-Supervised Image Retrieval with Open-Ended InstructionsCode3
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video GenerationCode3
A Survey on Deep Learning for Theorem ProvingCode3
APOLLO: SGD-like Memory, AdamW-level PerformanceCode3
Show:102550
← PrevPage 78 of 3547Next →