SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 87518800 of 661570 papers

TitleStatusHype
Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image ClassificationCode2
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space ModelCode2
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal SlicesCode2
A Simulation Tool for V2G Enabled Demand Response Based on Model Predictive ControlCode2
Diff-BGM: A Diffusion Model for Video Background Music GenerationCode2
Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in MammographyCode2
xFinder: Robust and Pinpoint Answer Extraction for Large Language ModelsCode2
Imp: Highly Capable Large Multimodal Models for Mobile DevicesCode2
AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance FieldCode2
End-to-End Full-Page Optical Music Recognition for Pianoform Sheet MusicCode2
MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text ExpertiseCode2
CoR-GS: Sparse-View 3D Gaussian Splatting via Co-RegularizationCode2
SEMv3: A Fast and Robust Approach to Table Separation Line DetectionCode2
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics BenchmarkCode2
AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot MovementsCode2
NetMamba: Efficient Network Traffic Classification via Pre-training Unidirectional MambaCode2
Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging GeometriesCode2
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch NormalizationCode2
Your Transformer is Secretly LinearCode2
Transcriptomics-guided Slide Representation Learning in Computational PathologyCode2
MAMCA -- Optimal on Accuracy and Efficiency for Automatic Modulation Classification with Extended Signal LengthCode2
MotionGS : Compact Gaussian Splatting SLAM by Motion FilterCode2
MapCoder: Multi-Agent Code Generation for Competitive Problem SolvingCode2
Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score MatchingCode2
MediCLIP: Adapting CLIP for Few-shot Medical Image Anomaly DetectionCode2
GinAR: An End-To-End Multivariate Time Series Forecasting Model Suitable for Variable MissingCode2
Heterogeneity-Informed Meta-Parameter Learning for Spatiotemporal Time Series ForecastingCode2
Layer-Condensed KV Cache for Efficient Inference of Large Language ModelsCode2
TexPainter: Generative Mesh Texturing with Multi-view ConsistencyCode2
Improving Point-based Crowd Counting and Localization Based on Auxiliary Point GuidanceCode2
Observational Scaling Laws and the Predictability of Language Model PerformanceCode2
Identifying Functionally Important Features with End-to-End Sparse Dictionary LearningCode2
Many-Shot In-Context Learning in Multimodal Foundation ModelsCode2
LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image InterpretationCode2
SpecDETR: A Transformer-based Hyperspectral Point Object Detection NetworkCode2
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific DiscoveryCode2
Libra: Building Decoupled Vision System on Large Language ModelsCode2
Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion ModelsCode2
IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation ModelCode2
HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase RecognitionCode2
PyTorch-IE: Fast and Reproducible Prototyping for Information ExtractionCode2
Active Learning with Fully Bayesian Neural Networks for Discontinuous and Nonstationary DataCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy ProtectionCode2
Grounded 3D-LLM with Referent TokensCode2
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object DetectionCode2
Xmodel-VLM: A Simple Baseline for Multimodal Vision Language ModelCode2
From NeRFs to Gaussian Splats, and BackCode2
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language ModelsCode2
EchoTracker: Advancing Myocardial Point Tracking in EchocardiographyCode2
Show:102550
← PrevPage 176 of 13232Next →