| fairseq: A Fast, Extensible Toolkit for Sequence Modeling | Apr 1, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Annotation-Efficient Preference Optimization for Language Model Alignment | May 22, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Critic-Guided Decoding for Controlled Text Generation | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning | Apr 17, 2020 | CPULanguage Modeling | CodeCode Available | 1 | 5 |
| Improving Neural Machine Translation Models with Monolingual Data | Nov 20, 2015 | Cross-Lingual Bitext MiningDecoder | CodeCode Available | 1 | 5 |
| Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach | Sep 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model | Jan 6, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Improving Mandarin Speech Recogntion with Block-augmented Transformer | Jul 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Thought for Sentence Encoder Pre-training | Oct 7, 2020 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| Improving Multi-Party Dialogue Discourse Parsing via Domain Integration | Oct 9, 2021 | Discourse ParsingDomain Adaptation | CodeCode Available | 1 | 5 |
| Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning | Dec 2, 2023 | Causal Language ModelingContrastive Learning | CodeCode Available | 1 | 5 |
| An Open Source Data Contamination Report for Large Language Models | Oct 26, 2023 | HellaSwagLanguage Modeling | CodeCode Available | 1 | 5 |
| MMBERT: Multimodal BERT Pretraining for Improved Medical VQA | Apr 3, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AttentionRank: Unsupervised Keyphrase Extraction using Self and Cross Attentions | Nov 1, 2021 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| Generating Query Focused Summaries from Query-Free Resources | Dec 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring | Apr 1, 2022 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| FedJudge: Federated Legal Large Language Model | Sep 15, 2023 | Continual LearningFederated Learning | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages | Nov 7, 2022 | Active LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |