| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| CMD: a framework for Context-aware Model self-Detoxification | Aug 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| KR-BERT: A Small-Scale Korean-Specific Language Model | Aug 10, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Labrador: Exploring the Limits of Masked Language Modeling for Laboratory Data | Dec 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing | Mar 4, 2023 | DiversityImage Captioning | CodeCode Available | 1 | 5 |
| DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control | Feb 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning from Unlabeled 3D Environments for Vision-and-Language Navigation | Aug 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Kosmos-2: Grounding Multimodal Large Language Models to the World | Jun 26, 2023 | Image CaptioningIn-Context Learning | CodeCode Available | 1 | 5 |
| Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Dialogue State Tracking with a Language Model using Schema-Driven Prompting | Sep 15, 2021 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 | 5 |
| Dialogue-oriented Pre-training | Jun 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation | Apr 27, 2022 | DecoderDiversity | CodeCode Available | 1 | 5 |
| Learning to Attribute with Attention | Apr 18, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 | 5 |
| DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation | Apr 19, 2022 | Dialogue GenerationKnowledge Distillation | CodeCode Available | 1 | 5 |
| Learning to engineer protein flexibility | Dec 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DiffEditor: Enhancing Speech Editing with Semantic Enrichment and Acoustic Consistency | Sep 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| K-PLUG: KNOWLEDGE-INJECTED PRE-TRAINED LANGUAGE MODEL FOR NATURAL LANGUAGE UNDERSTANDING AND GENERATION | Jan 1, 2021 | ChatbotDecoder | CodeCode Available | 1 | 5 |
| Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions | Jun 9, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Differentiable Language Model Adversarial Attacks on Categorical Sequence Classifiers | Jun 19, 2020 | Adversarial AttackLanguage Modeling | CodeCode Available | 1 | 5 |
| KnowMAN: Weakly Supervised Multinomial Adversarial Networks | Sep 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Citekit: A Modular Toolkit for Large Language Model Citation Generation | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations | Oct 6, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |