SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 626650 of 659983 papers

TitleStatusHype
ART: Automatic multi-step reasoning and tool-use for large language modelsCode6
Distributed Inference and Fine-tuning of Large Language Models Over The InternetCode6
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionCode6
Simple and Controllable Music GenerationCode6
RAGAS: Automated Evaluation of Retrieval Augmented GenerationCode6
MusicLM: Generating Music From TextCode6
Long Document Summarization with Top-down and Bottom-up InferenceCode6
Training Compute-Optimal Large Language ModelsCode6
Nerfstudio: A Modular Framework for Neural Radiance Field DevelopmentCode6
Extending Context Window of Large Language Models via Positional InterpolationCode6
Seamless: Multilingual Expressive and Streaming Speech TranslationCode6
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion ModelsCode6
SegRNN: Segment Recurrent Neural Network for Long-Term Time Series ForecastingCode6
Gorilla: Large Language Model Connected with Massive APIsCode6
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging FaceCode6
U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image SegmentationCode6
FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement LearningCode6
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
OxfordVGG Submission to the EGO4D AV Transcription ChallengeCode6
Efficient and Effective Text Encoding for Chinese LLaMA and AlpacaCode6
Training language models to follow instructions with human feedbackCode6
LucidFlux: Caption-Free Photo-Realistic Image Restoration via a Large-Scale Diffusion Transformer5
DeepEyesV2: Toward Agentic Multimodal Model5
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters5
OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data5
Show:102550
← PrevPage 26 of 26400Next →