| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| Efficient recurrent architectures through activity sparsity and sparse back-propagation through time | Jun 13, 2022 | Gesture RecognitionLanguage Modeling | CodeCode Available | 1 | 5 |
| Language Models Implement Simple Word2Vec-style Vector Arithmetic | May 25, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model | Mar 15, 2024 | Cloze TestDistractor Generation | CodeCode Available | 1 | 5 |
| CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl Data | Apr 28, 2023 | document understandingLanguage Modeling | CodeCode Available | 1 | 5 |
| Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method | Jun 11, 2023 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| Language Models Encode the Value of Numbers Linearly | Jan 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations | Oct 5, 2022 | Automatic Speech Recognition (ASR)Clustering | CodeCode Available | 1 | 5 |
| ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation | Oct 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |