SOTAVerified

Sentence Completion

Papers

Showing 150 of 91 papers

TitleStatusHype
Llama 2: Open Foundation and Fine-Tuned Chat ModelsCode8
LLaMA: Open and Efficient Foundation Language ModelsCode7
Training Compute-Optimal Large Language ModelsCode6
GPT-4 Technical ReportCode6
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
Mistral 7BCode6
Factuality Enhanced Language Models for Open-Ended Text GenerationCode5
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of ExpertsCode3
Finetuned Language Models Are Zero-Shot LearnersCode3
Language Models are Few-Shot LearnersCode3
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language ModelCode3
DeBERTa: Decoding-enhanced BERT with Disentangled AttentionCode2
Sheared LLaMA: Accelerating Language Model Pre-training via Structured PruningCode2
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General TasksCode2
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale InstructionsCode2
PaLM: Scaling Language Modeling with PathwaysCode2
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningCode2
Crosslingual Generalization through Multitask FinetuningCode2
Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question AnsweringCode1
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot LearnersCode1
HONEST: Measuring Hurtful Sentence Completion in Language ModelsCode1
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkCode1
Task Compass: Scaling Multi-task Pre-training with Task PrefixCode1
GePpeTto Carves Italian into a Language ModelCode1
RoBERTa: A Robustly Optimized BERT Pretraining ApproachCode1
Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ IndividualsCode1
Exploring the Benefits of Training Expert Language Models over Instruction TuningCode1
A Deep Architecture for Semantic Matching with Multiple Positional Sentence RepresentationsCode0
CODAH: An Adversarially Authored Question-Answer Dataset for Common SenseCode0
BTRec: BERT-Based Trajectory Recommendation for Personalized ToursCode0
CODAH: An Adversarially-Authored Question Answering Dataset for Common SenseCode0
Dependency Recurrent Neural Language Models for Sentence CompletionCode0
DiscoSense: Commonsense Reasoning with Discourse ConnectivesCode0
HellaSwag: Can a Machine Really Finish Your Sentence?Code0
Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative ModelsCode0
KatzBot: Revolutionizing Academic Chatbot for Enhanced CommunicationCode0
Language Model Sentence Completion with a Parser-Driven Rhetorical Control MethodCode0
Learning Semantically and Additively Compositional Distributional RepresentationsCode0
Mixture-of-Subspaces in Low-Rank AdaptationCode0
Muppet: Massive Multi-task Representations with Pre-FinetuningCode0
Recurrent Memory Networks for Language ModelingCode0
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context LearningCode0
SC-Ques: A Sentence Completion Question Dataset for English as a Second Language LearnersCode0
Solving ESL Sentence Completion Questions via Pre-trained Neural Language ModelsCode0
Top-down Tree Long Short-Term Memory NetworksCode0
Contextual LSTM (CLSTM) models for Large scale NLP tasks0
Learning Better Sentence Representation with Syntax Information0
Computational Approaches to Sentence Completion0
Learning Word Representations with Hierarchical Sparse Coding0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CompassMTL 567M with TailorAccuracy96.1Unverified
2CompassMTL 567MAccuracy95.6Unverified
3DeBERTa-Large 304M (classification-based)Accuracy95.6Unverified
4GPT-4 (10-shot)Accuracy95.3Unverified
5LLaMA3+MoSLoRAAccuracy95Unverified
6LLaMA-2 13B + MixLoRAAccuracy94.7Unverified
7DeBERTa-Large 304MAccuracy94.7Unverified
8Unicorn 11B (fine-tuned)Accuracy93.9Unverified
9LLaMA-3 8B + MixLoRAAccuracy93.3Unverified
10LLaMA-2 7B + MixLoRAAccuracy93.1Unverified