SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 55765600 of 177340 papers

TitleStatusHype
Linearizing Large Language ModelsCode2
Demystifying AI Platform Design for Distributed Inference of Next-Generation LLM modelsCode2
SpA-Former: Transformer image shadow detection and removal via spatial attentionCode2
Layer-Condensed KV Cache for Efficient Inference of Large Language ModelsCode2
DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal ForecastingCode2
Kernel Neural Optimal TransportCode2
Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph ConstructionCode2
WizMap: Scalable Interactive Visualization for Exploring Large Machine Learning EmbeddingsCode2
Actuarial Applications of Natural Language Processing Using Transformers: Case Studies for Using Text Features in an Actuarial ContextCode2
Graph-based Neural Weather Prediction for Limited Area ModelingCode2
MOROCCO: Model Resource Comparison FrameworkCode2
LesionLocator: Zero-Shot Universal Tumor Segmentation and Tracking in 3D Whole-Body ImagingCode2
Vakyansh: ASR Toolkit for Low Resource Indic languagesCode2
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System CollaborationCode2
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding ModelCode2
CodeS: Towards Building Open-source Language Models for Text-to-SQLCode2
TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormerCode2
DiffDock-PP: Rigid Protein-Protein Docking with Diffusion ModelsCode2
Number it: Temporal Grounding Videos like Flipping MangaCode2
Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative ReasoningCode2
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model DisentanglementCode2
Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D GenerationCode2
Photoreal Scene Reconstruction from an Egocentric DeviceCode2
How Much are Large Language Models Contaminated? A Comprehensive Survey and the LLMSanitize LibraryCode2
TabLLM: Few-shot Classification of Tabular Data with Large Language ModelsCode2
Show:102550
← PrevPage 224 of 7094Next →