SOTAVerified

Sentence Completion

Papers

Showing 150 of 91 papers

TitleStatusHype
Llama 2: Open Foundation and Fine-Tuned Chat ModelsCode8
LLaMA: Open and Efficient Foundation Language ModelsCode7
GPT-4 Technical ReportCode6
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
Training Compute-Optimal Large Language ModelsCode6
Mistral 7BCode6
Factuality Enhanced Language Models for Open-Ended Text GenerationCode5
Language Models are Few-Shot LearnersCode3
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language ModelCode3
Finetuned Language Models Are Zero-Shot LearnersCode3
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of ExpertsCode3
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General TasksCode2
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale InstructionsCode2
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningCode2
Crosslingual Generalization through Multitask FinetuningCode2
DeBERTa: Decoding-enhanced BERT with Disentangled AttentionCode2
Sheared LLaMA: Accelerating Language Model Pre-training via Structured PruningCode2
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
PaLM: Scaling Language Modeling with PathwaysCode2
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkCode1
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot LearnersCode1
HONEST: Measuring Hurtful Sentence Completion in Language ModelsCode1
Task Compass: Scaling Multi-task Pre-training with Task PrefixCode1
GePpeTto Carves Italian into a Language ModelCode1
Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question AnsweringCode1
RoBERTa: A Robustly Optimized BERT Pretraining ApproachCode1
Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ IndividualsCode1
Exploring the Benefits of Training Expert Language Models over Instruction TuningCode1
A Deep Architecture for Semantic Matching with Multiple Positional Sentence RepresentationsCode0
CODAH: An Adversarially Authored Question-Answer Dataset for Common SenseCode0
BloombergGPT: A Large Language Model for FinanceCode0
BTRec: BERT-Based Trajectory Recommendation for Personalized ToursCode0
CODAH: An Adversarially-Authored Question Answering Dataset for Common SenseCode0
Dependency Recurrent Neural Language Models for Sentence CompletionCode0
DiscoSense: Commonsense Reasoning with Discourse ConnectivesCode0
HellaSwag: Can a Machine Really Finish Your Sentence?Code0
Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative ModelsCode0
KatzBot: Revolutionizing Academic Chatbot for Enhanced CommunicationCode0
Language Model Sentence Completion with a Parser-Driven Rhetorical Control MethodCode0
Learning Semantically and Additively Compositional Distributional RepresentationsCode0
mahaNLP: A Marathi Natural Language Processing LibraryCode0
Mixture-of-Subspaces in Low-Rank AdaptationCode0
Muppet: Massive Multi-task Representations with Pre-FinetuningCode0
PaLM 2 Technical ReportCode0
Recurrent Memory Networks for Language ModelingCode0
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context LearningCode0
SC-Ques: A Sentence Completion Question Dataset for English as a Second Language LearnersCode0
Solving ESL Sentence Completion Questions via Pre-trained Neural Language ModelsCode0
Top-down Tree Long Short-Term Memory NetworksCode0
Learning Word Representations with Hierarchical Sparse Coding0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CompassMTL 567M with TailorAccuracy96.1Unverified
2CompassMTL 567MAccuracy95.6Unverified
3DeBERTa-Large 304M (classification-based)Accuracy95.6Unverified
4GPT-4 (10-shot)Accuracy95.3Unverified
5LLaMA3+MoSLoRAAccuracy95Unverified
6LLaMA-2 13B + MixLoRAAccuracy94.7Unverified
7DeBERTa-Large 304MAccuracy94.7Unverified
8Unicorn 11B (fine-tuned)Accuracy93.9Unverified
9LLaMA-3 8B + MixLoRAAccuracy93.3Unverified
10LLaMA-2 7B + MixLoRAAccuracy93.1Unverified