| Toward Interpretable Semantic Textual Similarity via Optimal Transport-based Contrastive Sentence Learning | Feb 26, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Interpreting Language Models with Contrastive Explanations | Feb 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Transformer Quality in Linear Time | Feb 21, 2022 | 8kLanguage Modeling | CodeCode Available | 1 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 |
| LAMP: Extracting Text from Gradients with Language Model Priors | Feb 17, 2022 | Federated LearningLanguage Modeling | CodeCode Available | 1 |
| Should You Mask 15% in Masked Language Modeling? | Feb 16, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings | Feb 14, 2022 | Citation PredictionContrastive Learning | CodeCode Available | 1 |
| Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations | Feb 9, 2022 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling | Feb 7, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unified Scaling Laws for Routed Language Models | Feb 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| What Has Been Enhanced in my Knowledge-Enhanced Language Model? | Feb 2, 2022 | Graph AttentionLanguage Modeling | CodeCode Available | 1 |
| Regression Transformer: Concurrent sequence regression and generation for molecular language modeling | Feb 1, 2022 | Conditional Text GenerationInductive Bias | CodeCode Available | 1 |
| MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning | Jan 29, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 1 |
| Neural Grapheme-to-Phoneme Conversion with Pre-trained Grapheme Models | Jan 26, 2022 | Grapheme-to-Phoneme ConversionLanguage Modeling | CodeCode Available | 1 |
| Korean-Specific Dataset for Table Question Answering | Jan 17, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer | Jan 14, 2022 | ClassificationContrastive Learning | CodeCode Available | 1 |
| Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model | Jan 6, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model | Dec 29, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate | Dec 29, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition | Dec 24, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation | Dec 23, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Learning To Retrieve Prompts for In-Context Learning | Dec 16, 2021 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Self-Supervised Learning for speech recognition with Intermediate layer supervision | Dec 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AcTune: Uncertainty-aware Active Self-Training for Semi-Supervised Active Learning with Pretrained Language Models | Dec 16, 2021 | Active LearningLanguage Modeling | CodeCode Available | 1 |
| Efficient Hierarchical Domain Adaptation for Pretrained Language Models | Dec 16, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |