| Truncation Sampling as Language Model Desmoothing | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SAN: a robust end-to-end ASR model architecture | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence Labeling | Oct 27, 2022 | Chinese Named Entity RecognitionChinese Word Segmentation | CodeCode Available | 0 |
| Learning Joint Representation of Human Motion and Language | Oct 27, 2022 | Action RecognitionContrastive Learning | —Unverified | 0 |
| COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Contrastive Decoding: Open-ended Text Generation as Optimization | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Incorporating Pre-training Paradigm for Antibody Sequence-Structure Co-design | Oct 26, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Will we run out of data? Limits of LLM scaling based on human-generated data | Oct 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Robust Bias Mitigation Procedure Based on the Stereotype Content Model | Oct 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning | Oct 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks | Oct 26, 2022 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| N-gram Is Back: Residual Learning of Neural Text Generation with n-gram Language Model | Oct 26, 2022 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| How Long Is Enough? Exploring the Optimal Intervals of Long-Range Clinical Note Language Modeling | Oct 25, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR Prediction | Oct 25, 2022 | AllClick-Through Rate Prediction | CodeCode Available | 1 |
| Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe | Oct 25, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition | Oct 25, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Open Data and Task Augmentation to Automated Behavioral Coding of Psychotherapy Conversations in Low-Resource Scenarios | Oct 25, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning Better Intent Representations for Financial Open Intent Classification | Oct 25, 2022 | Classificationintent-classification | —Unverified | 0 |
| A single-cell gene expression language model | Oct 25, 2022 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Dual Mechanism Priming Effects in Hindi Word Order | Oct 25, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry Writing | Oct 25, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Contrastive Search Is What You Need For Neural Text Generation | Oct 25, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models | Oct 25, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting Evidence | Oct 25, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Unifying Reference Expression Generation and Comprehension | Oct 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation | Oct 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| An Empirical Revisiting of Linguistic Knowledge Fusion in Language Understanding Tasks | Oct 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models | Oct 24, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A BERT-based Deep Learning Approach for Reputation Analysis in Social Media | Oct 23, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Code4Struct: Code Generation for Few-Shot Event Structure Prediction | Oct 23, 2022 | Code GenerationEvent Argument Extraction | CodeCode Available | 1 |
| Do Language Models Understand Measurements? | Oct 23, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Pre-Training with Sparse Latent Typing | Oct 23, 2022 | Few-shot NERLanguage Modeling | CodeCode Available | 1 |
| Discriminative Language Model as Semantic Consistency Scorer for Prompt-based Few-Shot Text Classification | Oct 23, 2022 | Few-Shot Text ClassificationLanguage Modeling | —Unverified | 0 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 |
| Hard Gate Knowledge Distillation -- Leverage Calibration for Robust and Reliable Language Model | Oct 22, 2022 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Generative Prompt Tuning for Relation Classification | Oct 22, 2022 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| PENTATRON: PErsonalized coNText-Aware Transformer for Retrieval-based cOnversational uNderstanding | Oct 22, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation | Oct 22, 2022 | counterfactualData Augmentation | CodeCode Available | 0 |
| LMPriors: Pre-Trained Language Models as Task-Specific Priors | Oct 22, 2022 | Causal InferenceCommon Sense Reasoning | —Unverified | 0 |
| Understanding Domain Learning in Language Models Through Subpopulation Analysis | Oct 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| P^3LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training | Oct 22, 2022 | Conversational Question AnsweringDecoder | —Unverified | 0 |
| SpaBERT: A Pretrained Language Model from Geographic Data for Geo-Entity Representation | Oct 21, 2022 | Entity LinkingEntity Typing | —Unverified | 0 |
| Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination | Oct 21, 2022 | Image GenerationLanguage Modeling | CodeCode Available | 0 |
| Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs | Oct 21, 2022 | Automated Theorem ProvingLanguage Modeling | CodeCode Available | 1 |
| Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies? | Oct 21, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 0 |
| InforMask: Unsupervised Informative Masking for Language Model Pretraining | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Diffuser: Efficient Transformers with Multi-hop Attention Diffusion for Long Sequences | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Graphemic Normalization of the Perso-Arabic Script | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer | Oct 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Is Encoder-Decoder Redundant for Neural Machine Translation? | Oct 21, 2022 | DecoderLanguage Modeling | —Unverified | 0 |