SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 21512200 of 177339 papers

TitleStatusHype
Navigation World ModelsCode4
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language ModelsCode4
Diffusion-Based Planning for Autonomous Driving with Flexible GuidanceCode4
Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like SpeedCode4
VideoChat: Chat-Centric Video UnderstandingCode4
HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture RecognitionCode4
Contextual Multilingual Spellchecker for User QueriesCode4
Panoptic Feature Pyramid NetworksCode4
Evolution Transformer: In-Context Evolutionary OptimizationCode4
Segment and Track AnythingCode4
SmoothGrad: removing noise by adding noiseCode4
A Comprehensive Survey on 3D Content GenerationCode4
Autoregressive Models in Vision: A SurveyCode4
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse ViewpointsCode4
Ray: A Distributed Framework for Emerging AI ApplicationsCode4
RegNet: Self-Regulated Network for Image ClassificationCode4
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View StereoCode4
CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue DatasetCode4
On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages with Fewer Resources than EnglishCode4
Dive into Deep LearningCode4
RLlib Flow: Distributed Reinforcement Learning is a Dataflow ProblemCode4
Kolmogorov-Arnold TransformerCode4
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific DiscoveryCode4
Sonata: Self-Supervised Learning of Reliable Point RepresentationsCode4
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision ModelsCode4
fastai: A Layered API for Deep LearningCode4
Learning Important Features Through Propagating Activation DifferencesCode4
DeepResearch Bench: A Comprehensive Benchmark for Deep Research AgentsCode4
Orion-14B: Open-source Multilingual Large Language ModelsCode4
iText2KG: Incremental Knowledge Graphs Construction Using Large Language ModelsCode4
Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge SystemCode4
KernelBench: Can LLMs Write Efficient GPU Kernels?Code4
Image Segmentation Keras : Implementation of Segnet, FCN, UNet, PSPNet and other models in KerasCode4
MaskNet: Introducing Feature-Wise Multiplication to CTR Ranking Models by Instance-Guided MaskCode4
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement LearningCode4
Windows Agent Arena: Evaluating Multi-Modal OS Agents at ScaleCode4
A Framework For Contrastive Self-Supervised Learning And Designing A New ApproachCode4
Brain-inspired Multilayer Perceptron with Spiking NeuronsCode4
V3D: Video Diffusion Models are Effective 3D GeneratorsCode4
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
An Aggregated Multicolumn Dilated Convolution Network for Perspective-Free CountingCode4
An Extended Sequence Tagging Vocabulary for Grammatical Error CorrectionCode4
GPUTreeShap: Massively Parallel Exact Calculation of SHAP Scores for Tree EnsemblesCode4
TransPixeler: Advancing Text-to-Video Generation with TransparencyCode4
BlazePose: On-device Real-time Body Pose trackingCode4
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-onCode4
EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary ComputationCode4
Amortized Planning with Large-Scale Transformers: A Case Study on ChessCode4
LISA: Reasoning Segmentation via Large Language ModelCode4
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video UnderstandingCode4
Show:102550
← PrevPage 44 of 3547Next →