| Accelerating Vision-Language Pretraining with Free Language Modeling | Mar 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Representation Deficiency in Masked Language Modeling | Feb 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Debiasing the Cloze Task in Sequential Recommendation with Bidirectional Transformers | Jan 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MicroBERT: Effective Training of Low-resource Monolingual BERTs through Parameter Reduction and Multitask Learning | Dec 23, 2022 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking | Dec 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Nonparametric Masked Language Modeling | Dec 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Self-supervised vision-language pretraining for Medical visual question answering | Nov 24, 2022 | Contrastive LearningImage-text matching | CodeCode Available | 1 |
| Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning | Nov 24, 2022 | cross-modal alignmentImage-text Retrieval | CodeCode Available | 1 |
| Unified Multimodal Model with Unlikelihood Training for Visual Dialog | Nov 23, 2022 | Answer GenerationChatbot | CodeCode Available | 1 |
| Leveraging Label Correlations in a Multi-label Setting: A Case Study in Emotion | Oct 28, 2022 | Emotion RecognitionLanguage Modeling | CodeCode Available | 1 |
| Generative Prompt Tuning for Relation Classification | Oct 22, 2022 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| InforMask: Unsupervised Informative Masking for Language Model Pretraining | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mixture of Attention Heads: Selecting Attention Heads Per Token | Oct 11, 2022 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model | Oct 11, 2022 | Contrastive LearningImage-text matching | CodeCode Available | 1 |
| TransPolymer: a Transformer-based language model for polymer property predictions | Sep 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training | Aug 8, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 1 |
| Unsupervised pre-training of graph transformers on patient population graphs | Jul 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders | Jun 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity Prediction | Jun 20, 2022 | Drug DiscoveryLanguage Modeling | CodeCode Available | 1 |
| Zero-Shot Video Question Answering via Frozen Bidirectional Language Models | Jun 16, 2022 | Fill MaskLanguage Modeling | CodeCode Available | 1 |
| LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling | Jun 14, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training | Jun 1, 2022 | Contrastive LearningCross-Lingual Transfer | CodeCode Available | 1 |
| Training and Inference on Any-Order Autoregressive Models the Right Way | May 26, 2022 | Image InpaintingLanguage Modeling | CodeCode Available | 1 |
| Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling | May 25, 2022 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| Declaration-based Prompt Tuning for Visual Question Answering | May 5, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 1 |
| Contrastive Learning for Prompt-Based Few-Shot Language Learners | May 3, 2022 | Contrastive LearningIn-Context Learning | CodeCode Available | 1 |
| Unsupervised Dependency Graph Network | May 1, 2022 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| Generative power of a protein language model trained on multiple sequence alignments | Apr 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization? | Apr 12, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Contextual Representation Learning beyond Masked Language Modeling | Apr 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SecureBERT: A Domain-Specific Language Model for Cybersecurity | Apr 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| POS-BERT: Point Cloud One-Stage BERT Pre-Training | Apr 3, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| What to Hide from Your Students: Attention-Guided Masked Image Modeling | Mar 23, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation | Mar 22, 2022 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| How does the pre-training objective affect what large language models learn about linguistic properties? | Mar 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Transformer Quality in Linear Time | Feb 21, 2022 | 8kLanguage Modeling | CodeCode Available | 1 |
| Should You Mask 15% in Masked Language Modeling? | Feb 16, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling | Feb 7, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning | Jan 29, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 1 |
| Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer | Jan 14, 2022 | ClassificationContrastive Learning | CodeCode Available | 1 |
| EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate | Dec 29, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Causal Distillation for Language Models | Dec 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| iBOT: Image BERT Pre-Training with Online Tokenizer | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models | Oct 16, 2021 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Composable Sparse Fine-Tuning for Cross-Lingual Transfer | Oct 14, 2021 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| SupCL-Seq: Supervised Contrastive Learning for Downstream Optimized Sequence Representations | Sep 15, 2021 | CoLAContrastive Learning | CodeCode Available | 1 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 |
| Data Efficient Masked Language Modeling for Vision and Language | Sep 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Frustratingly Simple Pretraining Alternatives to Masked Language Modeling | Sep 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations | Sep 1, 2021 | Emotion ClassificationLanguage Modeling | CodeCode Available | 1 |