SOTAVerified

Sentence Completion

Papers

Showing 2650 of 91 papers

TitleStatusHype
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkCode1
GePpeTto Carves Italian into a Language ModelCode1
RoBERTa: A Robustly Optimized BERT Pretraining ApproachCode1
Evaluating Gender Bias in Large Language Models0
KatzBot: Revolutionizing Academic Chatbot for Enhanced CommunicationCode0
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs0
Ranking LLMs by compression0
Mixture-of-Subspaces in Low-Rank AdaptationCode0
Enhancing Bangla Language Next Word Prediction and Sentence Completion through Extended RNN with Bi-LSTM Model On N-gram Language0
Language Model Sentence Completion with a Parser-Driven Rhetorical Control MethodCode0
Illuminating the Black Box: A Psychometric Investigation into the Multifaceted Nature of Large Language Models0
LLM in a flash: Efficient Large Language Model Inference with Limited Memory0
The Falcon Series of Open Language Models0
mahaNLP: A Marathi Natural Language Processing LibraryCode0
BTRec: BERT-Based Trajectory Recommendation for Personalized ToursCode0
Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative ModelsCode0
Exploiting Language Models as a Source of Knowledge for Cognitive Agents0
I-WAS: a Data Augmentation Method with GPT-2 for Simile Detection0
Stay on topic with Classifier-Free Guidance0
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context LearningCode0
PaLM 2 Technical ReportCode0
BloombergGPT: A Large Language Model for FinanceCode0
Numeracy from Literacy: Data Science as an Emergent Skill from Large Language Models0
POIBERT: A Transformer-based Model for the Tour Recommendation Problem0
Implicit causality in GPT-2: a case study0
Show:102550
← PrevPage 2 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CompassMTL 567M with TailorAccuracy96.1Unverified
2CompassMTL 567MAccuracy95.6Unverified
3DeBERTa-Large 304M (classification-based)Accuracy95.6Unverified
4GPT-4 (10-shot)Accuracy95.3Unverified
5LLaMA3+MoSLoRAAccuracy95Unverified
6LLaMA-2 13B + MixLoRAAccuracy94.7Unverified
7DeBERTa-Large 304MAccuracy94.7Unverified
8Unicorn 11B (fine-tuned)Accuracy93.9Unverified
9LLaMA-3 8B + MixLoRAAccuracy93.3Unverified
10LLaMA-2 7B + MixLoRAAccuracy93.1Unverified