| RetroMAE v2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models | Nov 16, 2022 | Dimensionality ReductionInformation Retrieval | CodeCode Available | 2 |
| Towards Realistic Low-resource Relation Extraction: A Benchmark with Empirical Baseline Study | Oct 19, 2022 | Data AugmentationRelation | CodeCode Available | 2 |
| CCTC: A Cross-Sentence Chinese Text Correction Dataset for Native Speakers | Oct 1, 2022 | Grammatical Error CorrectionSentence | CodeCode Available | 2 |
| TEACH: Temporal Action Composition for 3D Humans | Sep 9, 2022 | Motion SynthesisSentence | CodeCode Available | 2 |
| Comprehending and Ordering Semantics for Image Captioning | Jun 14, 2022 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 2 |
| Compositional Visual Generation with Composable Diffusion Models | Jun 3, 2022 | Sentence | CodeCode Available | 2 |
| RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder | May 24, 2022 | DecoderInformation Retrieval | CodeCode Available | 2 |
| "I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset | May 18, 2022 | Sentence | CodeCode Available | 2 |
| NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality | May 9, 2022 | SentenceSpeech Synthesis | CodeCode Available | 2 |
| MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction | Apr 23, 2022 | Grammatical Error CorrectionSentence | CodeCode Available | 2 |
| Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval | Apr 21, 2022 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 2 |
| DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings | Apr 21, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing | Feb 21, 2022 | Few-Shot LearningSentence | CodeCode Available | 2 |
| SGPT: GPT Sentence Embeddings for Semantic Search | Feb 17, 2022 | Argument RetrievalBiomedical Information Retrieval | CodeCode Available | 2 |
| PromptBERT: Improving BERT Sentence Embeddings with Prompts | Jan 12, 2022 | Contrastive LearningDenoising | CodeCode Available | 2 |
| CVSS Corpus and Massively Multilingual Speech-to-Speech Translation | Jan 11, 2022 | SentenceSpeech-to-Speech Translation | CodeCode Available | 2 |
| Deduplicating Training Data Makes Language Models Better | Jul 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SimCSE: Simple Contrastive Learning of Sentence Embeddings | Apr 18, 2021 | Contrastive LearningData Augmentation | CodeCode Available | 2 |
| Pretrained Transformers for Text Ranking: BERT and Beyond | Oct 13, 2020 | Information RetrievalReranking | CodeCode Available | 2 |
| Abstractive Summarization of Spoken andWritten Instructions with BERT | Aug 21, 2020 | Abstractive Text SummarizationArticles | CodeCode Available | 2 |
| Reevaluating Adversarial Examples in Natural Language | Apr 25, 2020 | Sentence | CodeCode Available | 2 |
| MPNet: Masked and Permuted Pre-training for Language Understanding | Apr 20, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CLUE: A Chinese Language Understanding Evaluation Benchmark | Apr 13, 2020 | General ClassificationMachine Reading Comprehension | CodeCode Available | 2 |
| ALBERT: A Lite BERT for Self-supervised Learning of Language Representations | Sep 26, 2019 | Common Sense ReasoningGPU | CodeCode Available | 2 |
| PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification | Aug 30, 2019 | Paraphrase IdentificationSentence | CodeCode Available | 2 |