SOTAVerified

Language Modeling

Papers

Showing 91019150 of 14182 papers

TitleStatusHype
Truncation Sampling as Language Model DesmoothingCode1
SAN: a robust end-to-end ASR model architecture0
Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence LabelingCode0
Learning Joint Representation of Human Motion and Language0
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust LearningCode1
Contrastive Decoding: Open-ended Text Generation as OptimizationCode2
Incorporating Pre-training Paradigm for Antibody Sequence-Structure Co-design0
Will we run out of data? Limits of LLM scaling based on human-generated dataCode1
A Robust Bias Mitigation Procedure Based on the Stereotype Content ModelCode0
Inducer-tuning: Connecting Prefix-tuning and Adapter-tuningCode1
Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks0
N-gram Is Back: Residual Learning of Neural Text Generation with n-gram Language ModelCode1
How Long Is Enough? Exploring the Optimal Intervals of Long-Range Clinical Note Language ModelingCode0
MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR PredictionCode1
Synthetic Text Generation with Differential Privacy: A Simple and Practical RecipeCode1
Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition0
Leveraging Open Data and Task Augmentation to Automated Behavioral Coding of Psychotherapy Conversations in Low-Resource Scenarios0
Learning Better Intent Representations for Financial Open Intent Classification0
A single-cell gene expression language modelCode1
Dual Mechanism Priming Effects in Hindi Word Order0
Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry WritingCode1
Contrastive Search Is What You Need For Neural Text GenerationCode2
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models0
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting Evidence0
Towards Unifying Reference Expression Generation and ComprehensionCode0
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text GenerationCode1
An Empirical Revisiting of Linguistic Knowledge Fusion in Language Understanding TasksCode0
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models0
A BERT-based Deep Learning Approach for Reputation Analysis in Social Media0
Code4Struct: Code Generation for Few-Shot Event Structure PredictionCode1
Do Language Models Understand Measurements?0
Language Model Pre-Training with Sparse Latent TypingCode1
Discriminative Language Model as Semantic Consistency Scorer for Prompt-based Few-Shot Text Classification0
Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model InfillingCode1
Hard Gate Knowledge Distillation -- Leverage Calibration for Robust and Reliable Language Model0
Generative Prompt Tuning for Relation ClassificationCode1
PENTATRON: PErsonalized coNText-Aware Transformer for Retrieval-based cOnversational uNderstanding0
NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data AugmentationCode0
LMPriors: Pre-Trained Language Models as Task-Specific Priors0
Understanding Domain Learning in Language Models Through Subpopulation AnalysisCode0
P^3LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training0
SpaBERT: A Pretrained Language Model from Geographic Data for Geo-Entity Representation0
Z-LaVI: Zero-Shot Language Solver Fueled by Visual ImaginationCode0
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal ProofsCode1
Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies?Code0
InforMask: Unsupervised Informative Masking for Language Model PretrainingCode1
Diffuser: Efficient Transformers with Multi-hop Attention Diffusion for Long SequencesCode1
Graphemic Normalization of the Perso-Arabic ScriptCode0
Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer0
Is Encoder-Decoder Redundant for Neural Machine Translation?0
Show:102550
← PrevPage 183 of 284Next →

No leaderboard results yet.