SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 16261650 of 661570 papers

TitleStatusHype
Distilling Tiny and Ultra-fast Deep Neural Networks for Autonomous Navigation on Nano-UAVsCode4
Halu-J: Critique-Based Hallucination JudgeCode4
Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM InferenceCode4
When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world EnvironmentsCode4
Deep-TEMPEST: Using Deep Learning to Eavesdrop on HDMI from its Unintended Electromagnetic EmanationsCode4
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data EngineCode4
SEED-Story: Multimodal Long Story Generation with Large Language ModelCode4
A Survey on Deep Stereo Matching in the TwentiesCode4
A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future TrendsCode4
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication TrainingCode4
The GeometricKernels Package: Heat and Matérn Kernels for Geometric Learning on Manifolds, Meshes, and GraphsCode4
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative IntelligenceCode4
Wavelet Convolutions for Large Receptive FieldsCode4
MiraData: A Large-Scale Video Dataset with Long Durations and Structured CaptionsCode4
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text GenerationCode4
MUSE: Machine Unlearning Six-Way Evaluation for Language ModelsCode4
TALENT: A Tabular Analytics and Learning ToolboxCode4
Modern Neighborhood Components Analysis: A Deep Tabular Baseline Two Decades LaterCode4
MIGC++: Advanced Multi-Instance Generation Controller for Image SynthesisCode4
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language ModelsCode4
Tiny-PULP-Dronets: Squeezing Neural Networks for Faster and Lighter Inference on Multi-Tasking Autonomous Nano-DronesCode4
Kolmogorov-Arnold Convolutions: Design Principles and Empirical StudiesCode4
fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial IntelligenceCode4
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized SoundsCode4
A Closer Look at Deep Learning Methods on Tabular DatasetsCode4
Show:102550
← PrevPage 66 of 26463Next →