| Long-Short Transformer: Efficient Transformers for Language and Vision | Jul 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Robust End-to-End Offline Chinese Handwriting Text Page Spotter with Text Kernel | Jul 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling | Jul 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| XLM-E: Cross-lingual Language Model Pre-training via ELECTRA | Jun 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information | Jun 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| R-Drop: Regularized Dropout for Neural Networks | Jun 28, 2021 | Abstractive Text Summarizationimage-classification | CodeCode Available | 1 |
| Stabilizing Equilibrium Models by Jacobian Regularization | Jun 28, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SymbolicGPT: A Generative Transformer Model for Symbolic Regression | Jun 27, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CLIP2Video: Mastering Video-Text Retrieval via Image CLIP | Jun 21, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Distributed Deep Learning in Open Collaborations | Jun 18, 2021 | Deep LearningLanguage Modeling | CodeCode Available | 1 |