SOTAVerified

Language Modeling

Papers

Showing 1095111000 of 14182 papers

TitleStatusHype
Composing Structure-Aware Batches for Pairwise Sentence Classification0
Composable Sparse Fine-Tuning for Cross-Lingual Transfer0
UniSAr: A Unified Structure-Aware Autoregressive Language Model for Text-to-SQL0
Probing BERT’s priors with serial reproduction chains0
Prompt-Learning for Fine-Grained Entity Typing0
Phrase-aware Unsupervised Constituency Parsing0
Tokenization on the Number Line is All You Need0
Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings0
Text Smoothing: Enhance Various Data Augmentation Methods on Text Classification Tasks0
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings0
Repetition Facilitates Processing: The Processing Advantage of Construction Repetition in Dialogue0
XLM-E: Cross-lingual Language Model Pre-training via ELECTRACode0
Meeting Summarization with Pre-training and Clustering MethodsCode0
UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning0
Training Data is More Valuable than You Think: A Simple and Effective Method by Retrieving from Training Data0
N-grammer: Augmenting Transformers with latent n-grams0
Prix-LM: Pretraining for Multilingual Knowledge Base Construction0
Mix and Match: Learning-free Controllable Text Generationusing Energy Language Models0
TACO: Pre-training of Deep Transformers with Attention Convolution using Disentangled Positional Representation0
NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction0
NSP-NER: A Prompt-based Learner for Few-shot NER Driven by Next Sentence Prediction0
Plug-Tagger: A Pluggable Sequence Labeling Framework with Pre-trained Language Models0
MIMICause: Representation and automatic extraction of causal relation types from clinical notes0
Unsupervised Dependency Graph Network0
Towards a Progression-Aware Autonomous Dialogue Agent0
Pinyin-bert: A new solution to Chinese pinyin to character conversion task0
Sentence-level Privacy for Document Embeddings0
StableMoE: Stable Routing Strategy for Mixture of Experts0
Mukayese: Turkish NLP Strikes Back0
On the Multilingual Capabilities of Very Large-Scale English Language Models0
Predicting Attention Sparsity in Transformers0
Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion0
Meta-learning via Language Model In-context Tuning0
Towards Unified Prompt Tuning for Few-shot Learning0
Your fairness may vary: Pretrained language model fairness in toxic text classification0
Multilingual unsupervised sequence segmentation transfers to extremely low-resource languages0
On the Use of Entity Embeddings from Pre-Trained Language Models for Knowledge Graph Completion0
Using Structured Content Plans for Fine-grained Syntactic Control in Pretrained Language Model Generation0
Temporal Language Modeling for Short Text Document Classification with Transformers0
Prompting as Multimodal Fusing0
Phone-ing it in: Towards Flexible Multi-Modal Language Model Training by Phonetic Representations of Data0
On a Benefit of Masked Language Model Pretraining: Robustness to Simplicity Bias0
MATHion: Solving Math Word Problems with Logically Consistent Problems0
RE: A Study for Restorable Embeddings0
Self-Distilled Pruning of Neural Networks0
Calculating Question Similarity is Enough: A New Method for KBQA Tasks0
Choose Your Programming Copilot: A Comparison of the Program Synthesis Performance of GitHub Copilot and Genetic Programming0
Joint Unsupervised and Supervised Training for Multilingual ASR0
Analysis of Data Augmentation Methods for Low-Resource Maltese ASR0
AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGSCode0
Show:102550
← PrevPage 220 of 284Next →

No leaderboard results yet.