| A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue Systems | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak Decoder | Nov 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Small Data? No Problem! Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AttentionRank: Unsupervised Keyphrase Extraction using Self and Cross Attentions | Nov 1, 2021 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree for Commodity News Event Extraction | Nov 1, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| TSDAE: Using Transformer-based Sequential Denoising Auto-Encoderfor Unsupervised Sentence Embedding Learning | Nov 1, 2021 | DenoisingDomain Adaptation | CodeCode Available | 1 |
| Efficiently Modeling Long Sequences with Structured State Spaces | Oct 31, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| Top1 Solution of QQ Browser 2021 Ai Algorithm Competition Track 1 : Multimodal Video Similarity | Oct 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Scatterbrain: Unifying Sparse and Low-rank Attention Approximation | Oct 28, 2021 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| ÚFAL at MultiLexNorm 2021: Improving Multilingual Lexical Normalization by Fine-tuning ByT5 | Oct 28, 2021 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| Discovering Non-monotonic Autoregressive Orderings with Variational Inference | Oct 27, 2021 | DecoderImage Captioning | CodeCode Available | 1 |
| Deciphering the Language of Nature: A transformer-based language model for deleterious mutations in proteins | Oct 27, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hierarchical Transformers Are More Efficient Language Models | Oct 26, 2021 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| AVocaDo: Strategy for Adapting Vocabulary to Downstream Domain | Oct 26, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Spanish Legalese Language Model and Corpora | Oct 23, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ClimateBert: A Pretrained Language Model for Climate-Related Text | Oct 22, 2021 | ArticlesFact Checking | CodeCode Available | 1 |
| LMSOC: An Approach for Socially Sensitive Pretraining | Oct 20, 2021 | Cloze TestGraph Representation Learning | CodeCode Available | 1 |
| Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic Optimization | Oct 18, 2021 | BIG-bench Machine Learningimage-classification | CodeCode Available | 1 |
| GNN-LM: Language Modeling based on Global Contexts via GNN | Oct 17, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models | Oct 16, 2021 | counterfactualData Augmentation | CodeCode Available | 1 |
| Improving Transformers with Probabilistic Attention Keys | Oct 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hydra: A System for Large Multi-Model Deep Learning | Oct 16, 2021 | Deep LearningGPU | CodeCode Available | 1 |
| Invariant Language Modeling | Oct 16, 2021 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models | Oct 16, 2021 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Coherence boosting: When your pretrained language model is not paying enough attention | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |