| Contextual Augmented Multi-Model Programming (CAMP): A Hybrid Local-Cloud Copilot Framework | Oct 20, 2024 | Code CompletionRAG | CodeCode Available | 9 |
| StarCoder 2 and The Stack v2: The Next Generation | Feb 29, 2024 | Code CompletionCode Generation | CodeCode Available | 7 |
| aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing | Oct 17, 2024 | AttributeCode Completion | CodeCode Available | 7 |
| Break the Sequential Dependency of LLM Inference Using Lookahead Decoding | Feb 3, 2024 | Code Completion | CodeCode Available | 5 |
| LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression | Oct 10, 2023 | Code CompletionFew-Shot Learning | CodeCode Available | 5 |
| Seed-Coder: Let the Code Model Curate Data for Itself | Jun 4, 2025 | Code CompletionCode Generation | CodeCode Available | 4 |
| AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct | May 23, 2024 | Class-level Code GenerationCode Completion | CodeCode Available | 4 |
| Scaling Granite Code Models to 128K Context | Jul 18, 2024 | 2k4k | CodeCode Available | 4 |
| Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection | Feb 23, 2023 | Code CompletionComputer Security | CodeCode Available | 4 |
| On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards | Jul 4, 2024 | Code Completion | CodeCode Available | 3 |
| PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models | Mar 26, 2024 | Code CompletionFew-Shot Learning | CodeCode Available | 3 |
| Revisiting VerilogEval: A Year of Improvements in Large-Language Models for Hardware Code Generation | Aug 20, 2024 | Code CompletionCode Generation | CodeCode Available | 3 |
| LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding | Aug 28, 2023 | 16kCode Completion | CodeCode Available | 3 |
| LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification | Feb 24, 2025 | Code Completion | CodeCode Available | 2 |
| SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development | May 22, 2025 | Bug fixingChatbot | CodeCode Available | 2 |
| Guiding Language Models of Code with Global Context using Monitors | Jun 19, 2023 | Code CompletionCode Generation | CodeCode Available | 2 |
| An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection | Jun 10, 2024 | Backdoor AttackCode Completion | CodeCode Available | 2 |
| EffiBench: Benchmarking the Efficiency of Automatically Generated Code | Feb 3, 2024 | BenchmarkingCode Completion | CodeCode Available | 2 |
| RepoHyper: Search-Expand-Refine on Semantic Graphs for Repository-Level Code Completion | Mar 10, 2024 | Code CompletionLink Prediction | CodeCode Available | 2 |
| CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion | Mar 12, 2024 | Code CompletionSafety Alignment | CodeCode Available | 2 |
| CursorCore: Assist Programming through Aligning Anything | Oct 9, 2024 | Code Completion | CodeCode Available | 2 |
| Optimizing Large Language Models for OpenAPI Code Completion | May 24, 2024 | Code CompletionCode Generation | CodeCode Available | 2 |
| LangBridge: Multilingual Reasoning Without Multilingual Supervision | Jan 19, 2024 | Code CompletionLogical Reasoning | CodeCode Available | 2 |
| Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion? | Oct 2, 2024 | Code CompletionCode Generation | CodeCode Available | 2 |
| RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation | Mar 22, 2023 | Code CompletionLanguage Modeling | CodeCode Available | 2 |
| StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback | Feb 2, 2024 | Code CompletionCode Generation | CodeCode Available | 2 |
| Building A Coding Assistant via the Retrieval-Augmented Language Model | Oct 21, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code | Oct 23, 2020 | Bug fixingCode Completion | CodeCode Available | 1 |
| MetaTPTrans: A Meta Learning Approach for Multilingual Code Representation Learning | Jun 13, 2022 | Code CompletionCode Summarization | CodeCode Available | 1 |
| Can Language Models Replace Programmers for Coding? REPOCOD Says 'Not Yet' | Oct 29, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation | Mar 25, 2025 | Code CompletionLanguage Modeling | CodeCode Available | 1 |
| Long Code Arena: a Set of Benchmarks for Long-Context Code Models | Jun 17, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| Learning Deep Semantics for Test Completion | Feb 20, 2023 | Code CompletionCode Generation | CodeCode Available | 1 |
| Language Models for Code Completion: A Practical Evaluation | Feb 25, 2024 | Code Completionvalid | CodeCode Available | 1 |
| LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models | Sep 18, 2023 | Code CompletionCode Generation | CodeCode Available | 1 |
| LLMSecEval: A Dataset of Natural Language Prompts for Security Evaluations | Mar 16, 2023 | Code CompletionCode Generation | CodeCode Available | 1 |
| MPI-rical: Data-Driven MPI Distributed Parallelism Assistance with Transformers | May 16, 2023 | Code CompletionCode Generation | CodeCode Available | 1 |
| How to Get Your LLM to Generate Challenging Problems for Evaluation | Feb 20, 2025 | Code CompletionMath | CodeCode Available | 1 |
| CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation | Feb 9, 2021 | BIG-bench Machine LearningClone Detection | CodeCode Available | 1 |
| IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators | Mar 6, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models | Nov 5, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| Adversarial Robustness for Code | Feb 11, 2020 | Adversarial RobustnessBIG-bench Machine Learning | CodeCode Available | 1 |
| CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models | Feb 26, 2024 | Code CompletionResponse Generation | CodeCode Available | 1 |
| CodeFill: Multi-token Code Completion by Jointly Learning from Structure and Naming Sequences | Feb 14, 2022 | Code CompletionLanguage Modelling | CodeCode Available | 1 |
| Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs | Jun 26, 2024 | Code Completion | CodeCode Available | 1 |
| How Effective Are Neural Networks for Fixing Security Vulnerabilities | May 29, 2023 | Code CompletionProgram Repair | CodeCode Available | 1 |
| Execution-based Code Generation using Deep Reinforcement Learning | Jan 31, 2023 | Code CompletionCode Generation | CodeCode Available | 1 |
| LambdaNet: Probabilistic Type Inference using Graph Neural Networks | Apr 29, 2020 | Code CompletionGraph Neural Network | CodeCode Available | 1 |
| BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models | Sep 23, 2023 | Code CompletionHallucination | CodeCode Available | 1 |
| CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context | Dec 20, 2022 | Code Completion | CodeCode Available | 1 |