SOTAVerified

Language Modeling

Papers

Showing 26512700 of 14182 papers

TitleStatusHype
Tracing Origins: Coreference-aware Machine Reading ComprehensionCode1
Control Prefixes for Parameter-Efficient Text GenerationCode1
Generated Knowledge Prompting for Commonsense ReasoningCode1
Meta-learning via Language Model In-context TuningCode1
mLUKE: The Power of Entity Representations in Multilingual Pretrained Language ModelsCode1
Composable Sparse Fine-Tuning for Cross-Lingual TransferCode1
UniPELT: A Unified Framework for Parameter-Efficient Language Model TuningCode1
Symbolic Knowledge Distillation: from General Language Models to Commonsense ModelsCode1
Learning Compact Metrics for MTCode1
Time Masking for Temporal Language ModelsCode1
Yuan 1.0: Large-Scale Pre-trained Language Model in Zero-Shot and Few-Shot LearningCode1
Long Expressive Memory for Sequence ModelingCode1
Improving Multi-Party Dialogue Discourse Parsing via Domain IntegrationCode1
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic FactorsCode1
Layer-wise Pruning of Transformer Attention Heads for Efficient Language ModelingCode1
Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddingsCode1
JuriBERT: A Masked-Language Model Adaptation for French Legal TextCode1
Revisiting Self-Training for Few-Shot Learning of Language ModelCode1
SlovakBERT: Slovak Masked Language ModelCode1
MatSciBERT: A Materials Domain Language Model for Text Mining and Information ExtractionCode1
BERT got a Date: Introducing Transformers to Temporal TaggingCode1
Factorized Neural Transducer for Efficient Language Model AdaptationCode1
Effective Use of Graph Convolution Network and Contextual Sub-Tree forCommodity News Event ExtractionCode1
XLM-K: Improving Cross-Lingual Language Model Pre-training with Multilingual KnowledgeCode1
Extracting and Inferring Personal Attributes from DialogueCode1
DziriBERT: a Pre-trained Language Model for the Algerian DialectCode1
Zero-Shot Information Extraction as a Unified Text-to-Triple TranslationCode1
Pix2seq: A Language Modeling Framework for Object DetectionCode1
TrOCR: Transformer-based Optical Character Recognition with Pre-trained ModelsCode1
JobBERT: Understanding Job Titles through SkillsCode1
Distilling Linguistic Context for Language Model CompressionCode1
KnowMAN: Weakly Supervised Multinomial Adversarial NetworksCode1
SupCL-Seq: Supervised Contrastive Learning for Downstream Optimized Sequence RepresentationsCode1
Dialogue State Tracking with a Language Model using Schema-Driven PromptingCode1
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-trainingCode1
LM-Critic: Language Models for Unsupervised Grammatical Error CorrectionCode1
Types of Out-of-Distribution Texts and How to Detect ThemCode1
Rationales for Sequential PredictionsCode1
xGQA: Cross-Lingual Visual Question AnsweringCode1
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and GenerationCode1
Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuningCode1
Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained ModelsCode1
TEASEL: A Transformer-Based Speech-Prefixed Language ModelCode1
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary InitializationCode1
Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-TrainingCode1
Euphemistic Phrase Detection by Masked Language ModelCode1
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine TranslationCode1
Avoiding Inference Heuristics in Few-shot Prompt-based FinetuningCode1
Efficient Nearest Neighbor Language ModelsCode1
AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language ModelsCode1
Show:102550
← PrevPage 54 of 284Next →

No leaderboard results yet.