SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers247,172 code links4,818 tasks

Papers

Showing 551600 of 658356 papers

TitleStatusHype
TaskBench: Benchmarking Large Language Models for Task AutomationCode6
U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image SegmentationCode6
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion ModelsCode6
Adversarial Diffusion DistillationCode6
TabRepo: A Large Scale Repository of Tabular Model Evaluations and its AutoML ApplicationsCode6
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from BackboneCode6
H2O Open Ecosystem for State-of-the-art Large Language ModelsCode6
A decoder-only foundation model for time-series forecastingCode6
MemGPT: Towards LLMs as Operating SystemsCode6
Mistral 7BCode6
iTransformer: Inverted Transformers Are Effective for Time Series ForecastingCode6
NEFTune: Noisy Embeddings Improve Instruction FinetuningCode6
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language ModelsCode6
Improved Baselines with Visual Instruction TuningCode6
Qwen Technical ReportCode6
Vision Transformers Need RegistersCode6
RAGAS: Automated Evaluation of Retrieval Augmented GenerationCode6
LongLoRA: Efficient Fine-tuning of Long-Context Large Language ModelsCode6
Data Formulator: AI-powered Concept-driven Visualization AuthoringCode6
An Empirical Study of Scaling Instruct-Tuned Large Multimodal ModelsCode6
Efficient Memory Management for Large Language Model Serving with PagedAttentionCode6
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language ModelsCode6
YaRN: Efficient Context Window Extension of Large Language ModelsCode6
Code Llama: Open Foundation Models for CodeCode6
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across LanguagesCode6
SegRNN: Segment Recurrent Neural Network for Long-Term Time Series ForecastingCode6
AutoGluon-TimeSeries: AutoML for Probabilistic Time Series ForecastingCode6
Continual Pre-Training of Large Language Models: How to (re)warm your model?Code6
3D Gaussian Splatting for Real-Time Radiance Field RenderingCode6
L-Eval: Instituting Standardized Evaluation for Long Context Language ModelsCode6
Efficient Guided Generation for Large Language ModelsCode6
OxfordVGG Submission to the EGO4D AV Transcription ChallengeCode6
FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningCode6
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionCode6
Extending Context Window of Large Language Models via Positional InterpolationCode6
SqueezeLLM: Dense-and-Sparse QuantizationCode6
h2oGPT: Democratizing Large Language ModelsCode6
FinGPT: Open-Source Financial Large Language ModelsCode6
Simple and Controllable Music GenerationCode6
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelCode6
Gorilla: Large Language Model Connected with Massive APIsCode6
RET-LLM: Towards a General Read-Write Memory for Large Language ModelsCode6
QLoRA: Efficient Finetuning of Quantized LLMsCode6
RWKV: Reinventing RNNs for the Transformer EraCode6
SoundStorm: Efficient Parallel Audio GenerationCode6
Better speech synthesis through scalingCode6
Shap-E: Generating Conditional 3D Implicit FunctionsCode6
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondCode6
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking HeadCode6
Show:102550
← PrevPage 12 of 13168Next →