| AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-Tuning | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A context-aware knowledge transferring strategy for CTC-based ASR | Oct 12, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Understanding the Failure of Batch Normalization for Transformers in NLP | Oct 11, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| Mixture of Attention Heads: Selecting Attention Heads Per Token | Oct 11, 2022 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model | Oct 11, 2022 | Contrastive LearningImage-text matching | CodeCode Available | 1 |
| A Kernel-Based View of Language Model Fine-Tuning | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Controllable Dialogue Simulation with In-Context Learning | Oct 9, 2022 | Data AugmentationIn-Context Learning | CodeCode Available | 1 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings | Oct 8, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling | Oct 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners | Oct 6, 2022 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| Bayesian Prompt Learning for Image-Language Model Generalization | Oct 5, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations | Oct 5, 2022 | Automatic Speech Recognition (ASR)Clustering | CodeCode Available | 1 |
| Less is More: Task-aware Layer-wise Distillation for Language Model Compression | Oct 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Knowledge Unlearning for Mitigating Privacy Risks in Language Models | Oct 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Improving Faithfulness in Abstractive Summarization | Oct 4, 2022 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |
| The Surprising Computational Power of Nondeterministic Stack RNNs | Oct 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model | Oct 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ContraCLM: Contrastive Learning For Causal Language Model | Oct 3, 2022 | Code GenerationCode Search | CodeCode Available | 1 |
| DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like Documents | Oct 1, 2022 | document understandingForm | CodeCode Available | 1 |
| BECEL: Benchmark for Consistency Evaluation of Language Models | Oct 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Event Causality Identification via Derivative Prompt Joint Learning | Oct 1, 2022 | Event Causality IdentificationLanguage Modeling | CodeCode Available | 1 |
| Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks | Oct 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| polyBERT: A chemical language model to enable fully machine-driven ultrafast polymer informatics | Sep 29, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing | Sep 27, 2022 | ArticlesLanguage Modeling | CodeCode Available | 1 |