SOTAVerified

Sentence Completion

Papers

Showing 150 of 91 papers

TitleStatusHype
Llama 2: Open Foundation and Fine-Tuned Chat ModelsCode8
LLaMA: Open and Efficient Foundation Language ModelsCode7
Training Compute-Optimal Large Language ModelsCode6
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
GPT-4 Technical ReportCode6
Mistral 7BCode6
Factuality Enhanced Language Models for Open-Ended Text GenerationCode5
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of ExpertsCode3
Finetuned Language Models Are Zero-Shot LearnersCode3
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language ModelCode3
Language Models are Few-Shot LearnersCode3
PaLM: Scaling Language Modeling with PathwaysCode2
DeBERTa: Decoding-enhanced BERT with Disentangled AttentionCode2
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningCode2
Sheared LLaMA: Accelerating Language Model Pre-training via Structured PruningCode2
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale InstructionsCode2
Crosslingual Generalization through Multitask FinetuningCode2
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General TasksCode2
GePpeTto Carves Italian into a Language ModelCode1
RoBERTa: A Robustly Optimized BERT Pretraining ApproachCode1
Exploring the Benefits of Training Expert Language Models over Instruction TuningCode1
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot LearnersCode1
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkCode1
Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ IndividualsCode1
Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question AnsweringCode1
HONEST: Measuring Hurtful Sentence Completion in Language ModelsCode1
Task Compass: Scaling Multi-task Pre-training with Task PrefixCode1
Learning Word Representations with Hierarchical Sparse Coding0
LLM in a flash: Efficient Large Language Model Inference with Limited Memory0
mahaNLP: A Marathi Natural Language Processing Library0
Ranking LLMs by compression0
A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks0
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs0
BloombergGPT: A Large Language Model for Finance0
Clause Final Verb Prediction in Hindi: Evidence for Noisy Channel Model of Communication0
Computational Approaches to Sentence Completion0
Contextual LSTM (CLSTM) models for Large scale NLP tasks0
Defining and Evaluating Fair Natural Language Generation0
Dependency Language Models for Sentence Completion0
Differentially Private n-gram Extraction0
Efficient Language Modeling with Sparse all-MLP0
Effidit: Your AI Writing Assistant0
Enhancing Bangla Language Next Word Prediction and Sentence Completion through Extended RNN with Bi-LSTM Model On N-gram Language0
Evaluating Gender Bias in Large Language Models0
Expect the unexpected: Harnessing Sentence Completion for Sarcasm Detection0
Exploiting Language Models as a Source of Knowledge for Cognitive Agents0
Exploiting Linguistic Features for Sentence Completion0
Filling Conversation Ellipsis for Better Social Dialog Understanding0
Hybrid Model For Word Prediction Using Naive Bayes and Latent Information0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CompassMTL 567M with TailorAccuracy96.1Unverified
2CompassMTL 567MAccuracy95.6Unverified
3DeBERTa-Large 304M (classification-based)Accuracy95.6Unverified
4GPT-4 (10-shot)Accuracy95.3Unverified
5LLaMA3+MoSLoRAAccuracy95Unverified
6LLaMA-2 13B + MixLoRAAccuracy94.7Unverified
7DeBERTa-Large 304MAccuracy94.7Unverified
8Unicorn 11B (fine-tuned)Accuracy93.9Unverified
9LLaMA-3 8B + MixLoRAAccuracy93.3Unverified
10LLaMA-2 7B + MixLoRAAccuracy93.1Unverified