SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 626650 of 177339 papers

TitleStatusHype
Gorilla: Large Language Model Connected with Massive APIsCode6
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging FaceCode6
U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image SegmentationCode6
FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement LearningCode6
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
OxfordVGG Submission to the EGO4D AV Transcription ChallengeCode6
Efficient and Effective Text Encoding for Chinese LLaMA and AlpacaCode6
Training language models to follow instructions with human feedbackCode6
MoVQ: Modulating Quantized Vectors for High-Fidelity Image GenerationCode5
Unified Training of Universal Time Series Forecasting TransformersCode5
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer FrameworkCode5
TimeMixer++: A General Time Series Pattern Machine for Universal Predictive AnalysisCode5
Learning Flow Fields in Attention for Controllable Person Image GenerationCode5
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter ExpertsCode5
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes InteractivelyCode5
Common 7B Language Models Already Possess Strong Math CapabilitiesCode5
Fast On-device LLM Inference with NPUsCode5
VideoCrafter1: Open Diffusion Models for High-Quality Video GenerationCode5
Efficient Multimodal Learning from Data-centric PerspectiveCode5
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented GenerationCode5
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and InferenceCode5
StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task LearningCode5
A ConvNet for the 2020sCode5
A Time Series is Worth 64 Words: Long-term Forecasting with TransformersCode5
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference OptimizationCode5
Show:102550
← PrevPage 26 of 7094Next →