SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 40764100 of 177340 papers

TitleStatusHype
Efficient Large Language Models: A SurveyCode3
Navigating Eukaryotic Genome Annotation Pipelines: A Route Map to BRAKER, Galba, and TSEBRACode3
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital WorldCode3
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time seriesCode3
BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language ModelsCode3
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQLCode3
DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view InputCode3
TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Similarity TreeCode3
VidTwin: Video VAE with Decoupled Structure and DynamicsCode3
Probabilistic Weather Forecasting with Hierarchical Graph Neural NetworksCode3
Dataset and Baseline System for Multi-lingual Extraction and Normalization of Temporal and Numerical ExpressionsCode3
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?Code3
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at ScaleCode3
OctoPack: Instruction Tuning Code Large Language ModelsCode3
Learning Smooth Humanoid Locomotion through Lipschitz-Constrained PoliciesCode3
Low-Pass Filtering SGD for Recovering Flat Optima in the Deep Learning Optimization LandscapeCode3
MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-ExpertsCode3
On the use of deep learning for phase recoveryCode3
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language ModelsCode3
NVS-Solver: Video Diffusion Model as Zero-Shot Novel View SynthesizerCode3
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language ModelCode3
MAPIE: an open-source library for distribution-free uncertainty quantificationCode3
PhysX: Physical-Grounded 3D Asset GenerationCode3
Sigma: Siamese Mamba Network for Multi-Modal Semantic SegmentationCode3
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at ScaleCode3
Show:102550
← PrevPage 164 of 7094Next →