SOTAVerified

Sentence Completion

Papers

Showing 2650 of 91 papers

TitleStatusHype
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale InstructionsCode2
BloombergGPT: A Large Language Model for FinanceCode0
GPT-4 Technical ReportCode6
LLaMA: Open and Efficient Foundation Language ModelsCode7
Exploring the Benefits of Training Expert Language Models over Instruction TuningCode1
Numeracy from Literacy: Data Science as an Emergent Skill from Large Language Models0
POIBERT: A Transformer-based Model for the Tour Recommendation Problem0
Implicit causality in GPT-2: a case study0
Crosslingual Generalization through Multitask FinetuningCode2
Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question AnsweringCode1
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models0
DiscoSense: Commonsense Reasoning with Discourse ConnectivesCode0
Task Compass: Scaling Multi-task Pre-training with Task PrefixCode1
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot LearnersCode1
Effidit: Your AI Writing Assistant0
SC-Ques: A Sentence Completion Question Dataset for English as a Second Language LearnersCode0
Factuality Enhanced Language Models for Open-Ended Text GenerationCode5
Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ IndividualsCode1
PaLM: Scaling Language Modeling with PathwaysCode2
Training Compute-Optimal Large Language ModelsCode6
Efficient Language Modeling with Sparse all-MLP0
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language ModelCode3
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
SeqPATE: Differentially Private Text Generation via Knowledge Distillation0
Language Models as a Knowledge Source for Cognitive Agents0
Show:102550
← PrevPage 2 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CompassMTL 567M with TailorAccuracy96.1Unverified
2CompassMTL 567MAccuracy95.6Unverified
3DeBERTa-Large 304M (classification-based)Accuracy95.6Unverified
4GPT-4 (10-shot)Accuracy95.3Unverified
5LLaMA3+MoSLoRAAccuracy95Unverified
6LLaMA-2 13B + MixLoRAAccuracy94.7Unverified
7DeBERTa-Large 304MAccuracy94.7Unverified
8Unicorn 11B (fine-tuned)Accuracy93.9Unverified
9LLaMA-3 8B + MixLoRAAccuracy93.3Unverified
10LLaMA-2 7B + MixLoRAAccuracy93.1Unverified