| AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-Tuning | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A context-aware knowledge transferring strategy for CTC-based ASR | Oct 12, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model | Oct 11, 2022 | Contrastive LearningImage-text matching | CodeCode Available | 1 |
| Understanding the Failure of Batch Normalization for Transformers in NLP | Oct 11, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| Mixture of Attention Heads: Selecting Attention Heads Per Token | Oct 11, 2022 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| A Kernel-Based View of Language Model Fine-Tuning | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Controllable Dialogue Simulation with In-Context Learning | Oct 9, 2022 | Data AugmentationIn-Context Learning | CodeCode Available | 1 |
| Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling | Oct 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings | Oct 8, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners | Oct 6, 2022 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| Bayesian Prompt Learning for Image-Language Model Generalization | Oct 5, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations | Oct 5, 2022 | Automatic Speech Recognition (ASR)Clustering | CodeCode Available | 1 |
| Less is More: Task-aware Layer-wise Distillation for Language Model Compression | Oct 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Knowledge Unlearning for Mitigating Privacy Risks in Language Models | Oct 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Improving Faithfulness in Abstractive Summarization | Oct 4, 2022 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |
| The Surprising Computational Power of Nondeterministic Stack RNNs | Oct 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model | Oct 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ContraCLM: Contrastive Learning For Causal Language Model | Oct 3, 2022 | Code GenerationCode Search | CodeCode Available | 1 |
| Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks | Oct 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Event Causality Identification via Derivative Prompt Joint Learning | Oct 1, 2022 | Event Causality IdentificationLanguage Modeling | CodeCode Available | 1 |
| BECEL: Benchmark for Consistency Evaluation of Language Models | Oct 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like Documents | Oct 1, 2022 | document understandingForm | CodeCode Available | 1 |
| polyBERT: A chemical language model to enable fully machine-driven ultrafast polymer informatics | Sep 29, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing | Sep 27, 2022 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models | Sep 20, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| Probabilistic Generative Transformer Language models for Generative Design of Molecules | Sep 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction and Constrained Decoding | Sep 16, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach | Sep 15, 2022 | DiversityLanguage Modeling | CodeCode Available | 1 |
| TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations at Twitter | Sep 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM | Sep 8, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| ASR2K: Speech Recognition for Around 2000 Languages without Audio | Sep 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TransPolymer: a Transformer-based language model for polymer property predictions | Sep 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FOLIO: Natural Language Reasoning with First-Order Logic | Sep 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale Retrieval | Aug 31, 2022 | CPUDecoder | CodeCode Available | 1 |
| Learning from Unlabeled 3D Environments for Vision-and-Language Navigation | Aug 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Interpreting Song Lyrics with an Audio-Informed Pre-trained Language Model | Aug 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Prompting as Probing: Using Language Models for Knowledge Base Construction | Aug 23, 2022 | Knowledge Base ConstructionLanguage Modeling | CodeCode Available | 1 |
| Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies | Aug 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CoditT5: Pretraining for Source Code and Natural Language Editing | Aug 10, 2022 | Bug fixingLanguage Modeling | CodeCode Available | 1 |
| Controlling Perceived Emotion in Symbolic Music Generation with Monte Carlo Tree Search | Aug 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training | Aug 8, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 1 |
| Composable Text Controls in Latent Space with ODEs | Aug 1, 2022 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval | Jul 31, 2022 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Contextual Information and Commonsense Based Prompt for Emotion Recognition in Conversation | Jul 27, 2022 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 1 |
| Training Effective Neural Sentence Encoders from Automatically Mined Paraphrases | Jul 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Mandarin Speech Recogntion with Block-augmented Transformer | Jul 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Zero-Shot Video Captioning with Evolving Pseudo-Tokens | Jul 22, 2022 | Image CaptioningImage-text matching | CodeCode Available | 1 |
| Unsupervised pre-training of graph transformers on patient population graphs | Jul 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Label2Label: A Language Modeling Framework for Multi-Attribute Learning | Jul 18, 2022 | AttributeClothing Attribute Recognition | CodeCode Available | 1 |