| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| Graph Neural Prompting with Large Language Models | Sep 27, 2023 | Graph Neural NetworkKnowledge Graphs | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| GraphLLM: Boosting Graph Reasoning Ability of Large Language Model | Oct 9, 2023 | Graph LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Oct 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training | Feb 16, 2021 | Image ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Gradient Ascent Post-training Enhances Language Model Generalization | Jun 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| GPU-based Private Information Retrieval for On-Device Machine Learning Inference | Jan 26, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints | May 22, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Addressing Some Limitations of Transformers with Feedback Memory | Feb 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation | Oct 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression | May 17, 2023 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing | Mar 4, 2023 | DiversityImage Captioning | CodeCode Available | 1 | 5 |
| GPT-too: A language-model-first approach for AMR-to-text generation | May 18, 2020 | AMR-to-Text GenerationData-to-Text Generation | CodeCode Available | 1 | 5 |
| CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization | May 5, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| GPT-NeoX-20B: An Open-Source Autoregressive Language Model | Apr 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Analyzing the Generalization and Reliability of Steering Vectors | Jul 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| SentenceMIM: A Latent Variable Language Model | Feb 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Copy Is All You Need | Jul 13, 2023 | AllDomain Adaptation | CodeCode Available | 1 | 5 |
| Control Prefixes for Parameter-Efficient Text Generation | Oct 15, 2021 | Abstractive Text SummarizationAttribute | CodeCode Available | 1 | 5 |
| AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation | Aug 1, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |