| Kosmos-2: Grounding Multimodal Large Language Models to the World | Jun 26, 2023 | Image CaptioningIn-Context Learning | CodeCode Available | 1 | 5 |
| Knowledge Graphs and Pre-trained Language Models enhanced Representation Learning for Conversational Recommender Systems | Dec 18, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 | 5 |
| Knowledge graph enhanced retrieval-augmented generation for failure mode and effects analysis | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Knowledge Graph Generation From Text | Nov 18, 2022 | Graph GenerationJoint Entity and Relation Extraction | CodeCode Available | 1 | 5 |
| Knowledge-Grounded Dialogue Generation with Pre-trained Language Models | Oct 17, 2020 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Critic-Guided Decoding for Controlled Text Generation | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Python Code Generation by Asking Clarification Questions | Dec 19, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| A Batch Normalized Inference Network Keeps the KL Vanishing Away | Apr 27, 2020 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 | 5 |
| Knowledge Perceived Multi-modal Pretraining in E-commerce | Aug 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| K-PLUG: KNOWLEDGE-INJECTED PRE-TRAINED LANGUAGE MODEL FOR NATURAL LANGUAGE UNDERSTANDING AND GENERATION | Jan 1, 2021 | ChatbotDecoder | CodeCode Available | 1 | 5 |
| A single-cell gene expression language model | Oct 25, 2022 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Knowledge Enhanced Masked Language Model for Stance Detection | May 26, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering | Nov 7, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Knowledge-enhanced Visual-Language Pretraining for Computational Pathology | Apr 15, 2024 | Cross-Modal RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Knowledge Distillation for BERT Unsupervised Domain Adaptation | Oct 22, 2020 | Domain AdaptationGeneral Classification | CodeCode Available | 1 | 5 |
| A Simple Long-Tailed Recognition Baseline via Vision-Language Model | Nov 29, 2021 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization | May 5, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification | Aug 5, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Simple Language Model for Task-Oriented Dialogue | May 2, 2020 | Dialogue State TrackingEnd-To-End Dialogue Modelling | CodeCode Available | 1 | 5 |
| Aioli: A Unified Optimization Framework for Language Model Data Mixing | Nov 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Knowledge-Augmented Language Model Verification | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration | May 5, 2022 | Contrastive LearningDialogue Generation | CodeCode Available | 1 | 5 |
| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 | 5 |
| 14 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon | Jun 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Data Efficient Masked Language Modeling for Vision and Language | Sep 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples | Nov 22, 2021 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 | 5 |
| Knowledge Graph-Driven Retrieval-Augmented Generation: Integrating Deepseek-R1 with Weaviate for Advanced Chatbot Applications | Feb 16, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 | 5 |
| K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce | Apr 14, 2021 | DecoderKnowledge Base Completion | CodeCode Available | 1 | 5 |
| Large Language Models Can Be Easily Distracted by Irrelevant Context | Jan 31, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 | 5 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction | Feb 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| KGLM: Integrating Knowledge Graph Structure in Language Models for Link Prediction | Nov 4, 2022 | Fraud DetectionKnowledge Graph Completion | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| Matrix Information Theory for Self-Supervised Learning | May 27, 2023 | Contrastive LearningGSM8K | CodeCode Available | 1 | 5 |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Copy Is All You Need | Jul 13, 2023 | AllDomain Adaptation | CodeCode Available | 1 | 5 |