| KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation | Jan 2, 2021 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 | 5 |
| KGLM: Integrating Knowledge Graph Structure in Language Models for Link Prediction | Nov 4, 2022 | Fraud DetectionKnowledge Graph Completion | CodeCode Available | 1 | 5 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization | May 5, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 | 5 |
| KinyaBERT: a Morphology-aware Kinyarwanda Language Model | Mar 16, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification | Aug 4, 2021 | ClassificationFew-Shot Text Classification | CodeCode Available | 1 | 5 |
| A Language Model based Framework for New Concept Placement in Ontologies | Feb 27, 2024 | Contrastive LearningEntity Linking | CodeCode Available | 1 | 5 |
| Matrix Information Theory for Self-Supervised Learning | May 27, 2023 | Contrastive LearningGSM8K | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation | May 20, 2022 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| An In-Context Learning Agent for Formal Theorem-Proving | Oct 6, 2023 | Automated Theorem ProvingIn-Context Learning | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Keep CALM and Explore: Language Models for Action Generation in Text-based Games | Oct 6, 2020 | Action GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application | May 7, 2024 | Collaborative FilteringLanguage Modeling | CodeCode Available | 1 | 5 |
| Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions | Jun 9, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| KALA: Knowledge-Augmented Language Model Adaptation | Apr 22, 2022 | Domain AdaptationGeneral Knowledge | CodeCode Available | 1 | 5 |
| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning | May 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Jump to Conclusions: Short-Cutting Transformers With Linear Transformations | Mar 16, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling | May 14, 2021 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| JuriBERT: A Masked-Language Model Adaptation for French Legal Text | Oct 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| Just One Byte (per gradient): A Note on Low-Bandwidth Decentralized Language Model Finetuning Using Shared Randomness | Jun 16, 2023 | Distributed OptimizationLanguage Modeling | CodeCode Available | 1 | 5 |
| JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem Understanding | Jun 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset | Mar 26, 2024 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 | 5 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| JobBERT: Understanding Job Titles through Skills | Sep 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| A Kernel-Based View of Language Model Fine-Tuning | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| ASR2K: Speech Recognition for Around 2000 Languages without Audio | Sep 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| Joint Entity and Relation Extraction Based on Table Labeling Using Convolutional Neural Networks | May 1, 2022 | Joint Entity and Relation ExtractionLanguage Modeling | CodeCode Available | 1 | 5 |
| Knowledge-Augmented Language Model Verification | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing | Mar 4, 2023 | DiversityImage Captioning | CodeCode Available | 1 | 5 |
| IvyGPT: InteractiVe Chinese pathwaY language model in medical domain | Jul 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ITER: Iterative Transformer-based Entity Recognition and Relation Extraction | Nov 11, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Aspect-Controlled Neural Argument Generation | Apr 30, 2020 | Data AugmentationLanguage Modeling | CodeCode Available | 1 | 5 |
| IterVM: Iterative Vision Modeling Module for Scene Text Recognition | Apr 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models | Feb 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Bayesian Flow Network Framework for Chemistry Tasks | Jul 28, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |