SOTAVerified

Sentence Completion

Papers

Showing 110 of 91 papers

TitleStatusHype
Llama 2: Open Foundation and Fine-Tuned Chat ModelsCode8
LLaMA: Open and Efficient Foundation Language ModelsCode7
GPT-4 Technical ReportCode6
Mistral 7BCode6
Training Compute-Optimal Large Language ModelsCode6
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
Factuality Enhanced Language Models for Open-Ended Text GenerationCode5
Finetuned Language Models Are Zero-Shot LearnersCode3
Language Models are Few-Shot LearnersCode3
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of ExpertsCode3
Show:102550
← PrevPage 1 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CompassMTL 567M with TailorAccuracy96.1Unverified
2CompassMTL 567MAccuracy95.6Unverified
3DeBERTa-Large 304M (classification-based)Accuracy95.6Unverified
4GPT-4 (10-shot)Accuracy95.3Unverified
5LLaMA3+MoSLoRAAccuracy95Unverified
6LLaMA-2 13B + MixLoRAAccuracy94.7Unverified
7DeBERTa-Large 304MAccuracy94.7Unverified
8Unicorn 11B (fine-tuned)Accuracy93.9Unverified
9LLaMA-3 8B + MixLoRAAccuracy93.3Unverified
10LLaMA-2 7B + MixLoRAAccuracy93.1Unverified