SOTAVerified

Language Modeling

Papers

Showing 1065110700 of 14182 papers

TitleStatusHype
Relating Neural Text Degeneration to Exposure Bias0
Long-Range Modeling of Source Code Files with eWASH: Extended Window Access by Syntax Hierarchy0
SentiPrompt: Sentiment Knowledge Enhanced Prompt-Tuning for Aspect-Based Sentiment Analysis0
Primer: Searching for Efficient Transformers for Language ModelingCode0
Exploring Multitask Learning for Low-Resource AbstractiveSummarization0
Distilling Linguistic Context for Language Model CompressionCode1
Does Commonsense help in detecting Sarcasm?Code0
A Bag of Tricks for Dialogue Summarization0
Regularized Training of Nearest Neighbor Language Models0
MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance DetectionCode0
The Language Model Understood the Prompt was Ambiguous: Probing Syntactic Uncertainty Through Generation0
KnowMAN: Weakly Supervised Multinomial Adversarial NetworksCode1
Let the CAT out of the bag: Contrastive Attributed explanations for Text0
Do Language Models Know the Way to Rome?0
Improving Text Auto-Completion with Next Phrase Prediction0
Beyond Glass-Box Features: Uncertainty Quantification Enhanced Quality Estimation for Neural Machine Translation0
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-trainingCode1
Comparing Text Representations: A Theory-Driven ApproachCode0
Dialogue State Tracking with a Language Model using Schema-Driven PromptingCode1
"It doesn't look good for a date": Transforming Critiques into Preferences for Conversational Recommendation SystemsCode0
RankNAS: Efficient Neural Architecture Search by Pairwise Ranking0
Tied & Reduced RNN-T Decoder0
SupCL-Seq: Supervised Contrastive Learning for Downstream Optimized Sequence RepresentationsCode1
On the Complementarity of Data Selection and Fine Tuning for Domain Adaptation0
Rationales for Sequential PredictionsCode1
Types of Out-of-Distribution Texts and How to Detect ThemCode1
LM-Critic: Language Models for Unsupervised Grammatical Error CorrectionCode1
MDAPT: Multilingual Domain Adaptive Pretraining in a Single ModelCode0
Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained ModelsCode1
Connecting degree and polarity: An artificial language learning studyCode0
xGQA: Cross-Lingual Visual Question AnsweringCode1
KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation0
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and GenerationCode1
Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuningCode1
TEASEL: A Transformer-Based Speech-Prefixed Language ModelCode1
Single-Read Reconstruction for DNA Data Storage Using Transformers0
Towards Zero-shot Commonsense Reasoning with Self-supervised Refinement of Language ModelsCode0
Studying word order through iterative shufflingCode0
Euphemistic Phrase Detection by Masked Language ModelCode1
EfficientCLIP: Efficient Cross-Modal Pre-training by Ensemble Confident Learning and Language Modeling0
Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-TrainingCode1
Dual-State Capsule Networks for Text Classification0
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary InitializationCode1
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine TranslationCode1
Debiasing Methods in Natural Language Understanding Make Bias More AccessibleCode1
AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language ModelsCode1
Avoiding Inference Heuristics in Few-shot Prompt-based FinetuningCode1
Efficient Nearest Neighbor Language ModelsCode1
MetaXT: Meta Cross-Task Transfer between Disparate Label Spaces0
TruthfulQA: Measuring How Models Mimic Human FalsehoodsCode1
Show:102550
← PrevPage 214 of 284Next →

No leaderboard results yet.