SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 21762200 of 177340 papers

TitleStatusHype
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision ModelsCode4
fastai: A Layered API for Deep LearningCode4
Learning Important Features Through Propagating Activation DifferencesCode4
DeepResearch Bench: A Comprehensive Benchmark for Deep Research AgentsCode4
Orion-14B: Open-source Multilingual Large Language ModelsCode4
iText2KG: Incremental Knowledge Graphs Construction Using Large Language ModelsCode4
Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge SystemCode4
KernelBench: Can LLMs Write Efficient GPU Kernels?Code4
Image Segmentation Keras : Implementation of Segnet, FCN, UNet, PSPNet and other models in KerasCode4
MaskNet: Introducing Feature-Wise Multiplication to CTR Ranking Models by Instance-Guided MaskCode4
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement LearningCode4
Windows Agent Arena: Evaluating Multi-Modal OS Agents at ScaleCode4
A Framework For Contrastive Self-Supervised Learning And Designing A New ApproachCode4
Brain-inspired Multilayer Perceptron with Spiking NeuronsCode4
V3D: Video Diffusion Models are Effective 3D GeneratorsCode4
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
An Aggregated Multicolumn Dilated Convolution Network for Perspective-Free CountingCode4
An Extended Sequence Tagging Vocabulary for Grammatical Error CorrectionCode4
GPUTreeShap: Massively Parallel Exact Calculation of SHAP Scores for Tree EnsemblesCode4
TransPixeler: Advancing Text-to-Video Generation with TransparencyCode4
BlazePose: On-device Real-time Body Pose trackingCode4
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-onCode4
EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary ComputationCode4
Amortized Planning with Large-Scale Transformers: A Case Study on ChessCode4
LISA: Reasoning Segmentation via Large Language ModelCode4
Show:102550
← PrevPage 88 of 7094Next →