SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 551575 of 659983 papers

TitleStatusHype
TaskBench: Benchmarking Large Language Models for Task AutomationCode6
U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image SegmentationCode6
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion ModelsCode6
Adversarial Diffusion DistillationCode6
TabRepo: A Large Scale Repository of Tabular Model Evaluations and its AutoML ApplicationsCode6
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from BackboneCode6
H2O Open Ecosystem for State-of-the-art Large Language ModelsCode6
A decoder-only foundation model for time-series forecastingCode6
MemGPT: Towards LLMs as Operating SystemsCode6
Mistral 7BCode6
iTransformer: Inverted Transformers Are Effective for Time Series ForecastingCode6
NEFTune: Noisy Embeddings Improve Instruction FinetuningCode6
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language ModelsCode6
Improved Baselines with Visual Instruction TuningCode6
Qwen Technical ReportCode6
Vision Transformers Need RegistersCode6
RAGAS: Automated Evaluation of Retrieval Augmented GenerationCode6
LongLoRA: Efficient Fine-tuning of Long-Context Large Language ModelsCode6
Data Formulator: AI-powered Concept-driven Visualization AuthoringCode6
An Empirical Study of Scaling Instruct-Tuned Large Multimodal ModelsCode6
Efficient Memory Management for Large Language Model Serving with PagedAttentionCode6
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language ModelsCode6
YaRN: Efficient Context Window Extension of Large Language ModelsCode6
Code Llama: Open Foundation Models for CodeCode6
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across LanguagesCode6
Show:102550
← PrevPage 23 of 26400Next →