SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 39513975 of 661570 papers

TitleStatusHype
EscherNet: A Generative Model for Scalable View SynthesisCode3
DistiLLM: Towards Streamlined Distillation for Large Language ModelsCode3
Does confidence calibration improve conformal prediction?Code3
BiLLM: Pushing the Limit of Post-Training Quantization for LLMsCode3
Neural networks for abstraction and reasoning: Towards broad generalization in machinesCode3
SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAMCode3
Swin-UMamba: Mamba-based UNet with ImageNet-based pretrainingCode3
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV CacheCode3
V-IRL: Grounding Virtual Intelligence in Real LifeCode3
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement LearningCode3
AutoTimes: Autoregressive Time Series Forecasters via Large Language ModelsCode3
A Survey of Large Language Models in Finance (FinLLMs)Code3
TopoX: A Suite of Python Packages for Machine Learning on Topological DomainsCode3
SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous DrivingCode3
Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series ForecastingCode3
Transolver: A Fast Transformer Solver for PDEs on General GeometriesCode3
Position: Graph Foundation Models are Already HereCode3
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language ModelsCode3
ReEvo: Large Language Models as Hyper-Heuristics with Reflective EvolutionCode3
cmaes : A Simple yet Practical Python Library for CMA-ESCode3
A Survey on Self-Supervised Learning for Non-Sequential Tabular DataCode3
TravelPlanner: A Benchmark for Real-World Planning with Language AgentsCode3
GaMeS: Mesh-Based Adapting and Modification of Gaussian SplattingCode3
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State SpacesCode3
Repeat After Me: Transformers are Better than State Space Models at CopyingCode3
Show:102550
← PrevPage 159 of 26463Next →