SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 33263350 of 661570 papers

TitleStatusHype
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective FusionCode3
EfficientQAT: Efficient Quantization-Aware Training for Large Language ModelsCode3
Neural Localizer Fields for Continuous 3D Human Pose and Shape EstimationCode3
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation BenchmarkCode3
Inference Performance Optimization for Large Language Models on CPUsCode3
Robust Neural Information Retrieval: An Adversarial and Out-of-distribution PerspectiveCode3
Revisiting, Benchmarking and Understanding Unsupervised Graph Domain AdaptationCode3
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation ModelsCode3
Scaling Retrieval-Based Language Models with a Trillion-Token DatastoreCode3
Chat-Edit-3D: Interactive 3D Scene Editing via Text PromptsCode3
A Survey on LoRA of Large Language ModelsCode3
WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work TasksCode3
Unified Approach for Hedging Impermanent Loss of Liquidity ProvisionCode3
LoRA-GA: Low-Rank Adaptation with Gradient ApproximationCode3
Better by Default: Strong Pre-Tuned MLPs and Boosted Trees on Tabular DataCode3
CountGD: Multi-Modal Open-World CountingCode3
OneRestore: A Universal Restoration Framework for Composite DegradationCode3
YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem AugmentationCode3
LaRa: Efficient Large-Baseline Radiance FieldsCode3
Simplifying Deep Temporal Difference LearningCode3
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model LeaderboardsCode3
Consistency Flow Matching: Defining Straight Flows with Velocity ConsistencyCode3
What We Talk About When We Talk About LMs: Implicit Paradigm Shifts and the Ship of Language ModelsCode3
TokenPacker: Efficient Visual Projector for Multimodal LLMCode3
A Practical Review of Mechanistic Interpretability for Transformer-Based Language ModelsCode3
Show:102550
← PrevPage 134 of 26463Next →