SOTAVerified

Sentence Completion

Papers

Showing 150 of 91 papers

TitleStatusHype
Llama 2: Open Foundation and Fine-Tuned Chat ModelsCode8
LLaMA: Open and Efficient Foundation Language ModelsCode7
GPT-4 Technical ReportCode6
Training Compute-Optimal Large Language ModelsCode6
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
Mistral 7BCode6
Factuality Enhanced Language Models for Open-Ended Text GenerationCode5
Finetuned Language Models Are Zero-Shot LearnersCode3
Language Models are Few-Shot LearnersCode3
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language ModelCode3
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of ExpertsCode3
Crosslingual Generalization through Multitask FinetuningCode2
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningCode2
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale InstructionsCode2
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General TasksCode2
PaLM: Scaling Language Modeling with PathwaysCode2
Sheared LLaMA: Accelerating Language Model Pre-training via Structured PruningCode2
DeBERTa: Decoding-enhanced BERT with Disentangled AttentionCode2
HONEST: Measuring Hurtful Sentence Completion in Language ModelsCode1
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkCode1
Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question AnsweringCode1
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot LearnersCode1
GePpeTto Carves Italian into a Language ModelCode1
Exploring the Benefits of Training Expert Language Models over Instruction TuningCode1
RoBERTa: A Robustly Optimized BERT Pretraining ApproachCode1
Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ IndividualsCode1
Task Compass: Scaling Multi-task Pre-training with Task PrefixCode1
LLM in a flash: Efficient Large Language Model Inference with Limited Memory0
Ranking LLMs by compression0
A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks0
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs0
Clause Final Verb Prediction in Hindi: Evidence for Noisy Channel Model of Communication0
Computational Approaches to Sentence Completion0
Contextual LSTM (CLSTM) models for Large scale NLP tasks0
Defining and Evaluating Fair Natural Language Generation0
Dependency Language Models for Sentence Completion0
Differentially Private n-gram Extraction0
Efficient Language Modeling with Sparse all-MLP0
Effidit: Your AI Writing Assistant0
Enhancing Bangla Language Next Word Prediction and Sentence Completion through Extended RNN with Bi-LSTM Model On N-gram Language0
Evaluating Gender Bias in Large Language Models0
Expect the unexpected: Harnessing Sentence Completion for Sarcasm Detection0
Exploiting Language Models as a Source of Knowledge for Cognitive Agents0
Exploiting Linguistic Features for Sentence Completion0
Filling Conversation Ellipsis for Better Social Dialog Understanding0
Hybrid Model For Word Prediction Using Naive Bayes and Latent Information0
iCap: Interactive Image Captioning with Predictive Text0
Illuminating the Black Box: A Psychometric Investigation into the Multifaceted Nature of Large Language Models0
Implicit causality in GPT-2: a case study0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CompassMTL 567M with TailorAccuracy96.1Unverified
2CompassMTL 567MAccuracy95.6Unverified
3DeBERTa-Large 304M (classification-based)Accuracy95.6Unverified
4GPT-4 (10-shot)Accuracy95.3Unverified
5LLaMA3+MoSLoRAAccuracy95Unverified
6LLaMA-2 13B + MixLoRAAccuracy94.7Unverified
7DeBERTa-Large 304MAccuracy94.7Unverified
8Unicorn 11B (fine-tuned)Accuracy93.9Unverified
9LLaMA-3 8B + MixLoRAAccuracy93.3Unverified
10LLaMA-2 7B + MixLoRAAccuracy93.1Unverified