SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 22012225 of 177340 papers

TitleStatusHype
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video UnderstandingCode4
Deep-TEMPEST: Using Deep Learning to Eavesdrop on HDMI from its Unintended Electromagnetic EmanationsCode4
Adversarial Diffusion Compression for Real-World Image Super-ResolutionCode4
Learning Multiple Stock Trading Patterns with Temporal Routing Adaptor and Optimal TransportCode4
Mathematical Supplement for the gsplat LibraryCode4
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like ArchitecturesCode4
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language ModelsCode4
Desiderata for next generation of ML model servingCode4
Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D GaussiansCode4
MARS: Unleashing the Power of Variance Reduction for Training Large ModelsCode4
Trackastra: Transformer-based cell tracking for live-cell microscopyCode4
The Whole Is Greater than the Sum of Its Parts: Improving Music Source Separation by Bridging NetworkCode4
Hallucination of Multimodal Large Language Models: A SurveyCode4
Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCICode4
Pseudo-Simulation for Autonomous DrivingCode4
UniK3D: Universal Camera Monocular 3D EstimationCode4
TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning BenchmarksCode4
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable ConvolutionsCode4
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming HeadsCode4
State Space Model for New-Generation Network Alternative to Transformers: A SurveyCode4
Predicting Subjective Features of Questions of QA Websites using BERTCode4
Resources for Brewing BEIR: Reproducible Reference Models and an Official LeaderboardCode4
FuseChat: Knowledge Fusion of Chat ModelsCode4
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM ServingCode4
Vidur: A Large-Scale Simulation Framework For LLM InferenceCode4
Show:102550
← PrevPage 89 of 7094Next →