Code Completion

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 212 papers

Title	Date	Tasks	Status	Hype
Contextual Augmented Multi-Model Programming (CAMP): A Hybrid Local-Cloud Copilot Framework	Oct 20, 2024	Code CompletionRAG	CodeCode Available	9
StarCoder 2 and The Stack v2: The Next Generation	Feb 29, 2024	Code CompletionCode Generation	CodeCode Available	7
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing	Oct 17, 2024	AttributeCode Completion	CodeCode Available	7
Break the Sequential Dependency of LLM Inference Using Lookahead Decoding	Feb 3, 2024	Code Completion	CodeCode Available	5
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression	Oct 10, 2023	Code CompletionFew-Shot Learning	CodeCode Available	5
Seed-Coder: Let the Code Model Curate Data for Itself	Jun 4, 2025	Code CompletionCode Generation	CodeCode Available	4
AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct	May 23, 2024	Class-level Code GenerationCode Completion	CodeCode Available	4
Scaling Granite Code Models to 128K Context	Jul 18, 2024	2k4k	CodeCode Available	4
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection	Feb 23, 2023	Code CompletionComputer Security	CodeCode Available	4
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards	Jul 4, 2024	Code Completion	CodeCode Available	3
PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models	Mar 26, 2024	Code CompletionFew-Shot Learning	CodeCode Available	3
Revisiting VerilogEval: A Year of Improvements in Large-Language Models for Hardware Code Generation	Aug 20, 2024	Code CompletionCode Generation	CodeCode Available	3
LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding	Aug 28, 2023	16kCode Completion	CodeCode Available	3
LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification	Feb 24, 2025	Code Completion	CodeCode Available	2
SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development	May 22, 2025	Bug fixingChatbot	CodeCode Available	2
Guiding Language Models of Code with Global Context using Monitors	Jun 19, 2023	Code CompletionCode Generation	CodeCode Available	2
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection	Jun 10, 2024	Backdoor AttackCode Completion	CodeCode Available	2
EffiBench: Benchmarking the Efficiency of Automatically Generated Code	Feb 3, 2024	BenchmarkingCode Completion	CodeCode Available	2
RepoHyper: Search-Expand-Refine on Semantic Graphs for Repository-Level Code Completion	Mar 10, 2024	Code CompletionLink Prediction	CodeCode Available	2
CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion	Mar 12, 2024	Code CompletionSafety Alignment	CodeCode Available	2
CursorCore: Assist Programming through Aligning Anything	Oct 9, 2024	Code Completion	CodeCode Available	2
Optimizing Large Language Models for OpenAPI Code Completion	May 24, 2024	Code CompletionCode Generation	CodeCode Available	2
LangBridge: Multilingual Reasoning Without Multilingual Supervision	Jan 19, 2024	Code CompletionLogical Reasoning	CodeCode Available	2
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?	Oct 2, 2024	Code CompletionCode Generation	CodeCode Available	2
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation	Mar 22, 2023	Code CompletionLanguage Modeling	CodeCode Available	2
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback	Feb 2, 2024	Code CompletionCode Generation	CodeCode Available	2
Building A Coding Assistant via the Retrieval-Augmented Language Model	Oct 21, 2024	Code CompletionCode Generation	CodeCode Available	1
A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code	Oct 23, 2020	Bug fixingCode Completion	CodeCode Available	1
MetaTPTrans: A Meta Learning Approach for Multilingual Code Representation Learning	Jun 13, 2022	Code CompletionCode Summarization	CodeCode Available	1
Can Language Models Replace Programmers for Coding? REPOCOD Says 'Not Yet'	Oct 29, 2024	Code CompletionCode Generation	CodeCode Available	1
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation	Mar 25, 2025	Code CompletionLanguage Modeling	CodeCode Available	1
Long Code Arena: a Set of Benchmarks for Long-Context Code Models	Jun 17, 2024	Code CompletionCode Generation	CodeCode Available	1
Learning Deep Semantics for Test Completion	Feb 20, 2023	Code CompletionCode Generation	CodeCode Available	1
Language Models for Code Completion: A Practical Evaluation	Feb 25, 2024	Code Completionvalid	CodeCode Available	1
LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models	Sep 18, 2023	Code CompletionCode Generation	CodeCode Available	1
LLMSecEval: A Dataset of Natural Language Prompts for Security Evaluations	Mar 16, 2023	Code CompletionCode Generation	CodeCode Available	1
MPI-rical: Data-Driven MPI Distributed Parallelism Assistance with Transformers	May 16, 2023	Code CompletionCode Generation	CodeCode Available	1
How to Get Your LLM to Generate Challenging Problems for Evaluation	Feb 20, 2025	Code CompletionMath	CodeCode Available	1
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation	Feb 9, 2021	BIG-bench Machine LearningClone Detection	CodeCode Available	1
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators	Mar 6, 2024	Code CompletionCode Generation	CodeCode Available	1
GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models	Nov 5, 2024	Code CompletionCode Generation	CodeCode Available	1
Adversarial Robustness for Code	Feb 11, 2020	Adversarial RobustnessBIG-bench Machine Learning	CodeCode Available	1
CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models	Feb 26, 2024	Code CompletionResponse Generation	CodeCode Available	1
CodeFill: Multi-token Code Completion by Jointly Learning from Structure and Naming Sequences	Feb 14, 2022	Code CompletionLanguage Modelling	CodeCode Available	1
Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs	Jun 26, 2024	Code Completion	CodeCode Available	1
How Effective Are Neural Networks for Fixing Security Vulnerabilities	May 29, 2023	Code CompletionProgram Repair	CodeCode Available	1
Execution-based Code Generation using Deep Reinforcement Learning	Jan 31, 2023	Code CompletionCode Generation	CodeCode Available	1
LambdaNet: Probabilistic Type Inference using Graph Neural Networks	Apr 29, 2020	Code CompletionGraph Neural Network	CodeCode Available	1
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models	Sep 23, 2023	Code CompletionHallucination	CodeCode Available	1
CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context	Dec 20, 2022	Code Completion	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 5Next →

All datasets SAFIM CodeXGLUE - Github Java Corpus CodeXGLUE - PY150 DotPrompts Defects4J Rambo Benchmark

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	deepseek-coder-33b-base	Average	69.01	—	Unverified
2	deepseek-coder-6.7b-base	Average	63.4	—	Unverified
3	starcoderbase	Average	55.54	—	Unverified
4	gpt-4-1106-preview	Average	53.28	—	Unverified
5	CodeLlama-13b-hf	Average	52.78	—	Unverified
6	deepseek-coder-1.3b-base	Average	52.63	—	Unverified
7	CodeLlama-34b-hf	Average	49.66	—	Unverified
8	CodeLlama-7b-hf	Average	45	—	Unverified
9	gpt-3.5-turbo-0301	Average	40.86	—	Unverified
10	incoder-6B	Average	33.79	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CodeGPT-adapted	Accuracy (token-level)	77.13	—	Unverified
2	CodeT5+ 770M	EM (line-level)	37.9	—	Unverified
3	CodeT5+ 220M	EM (line-level)	35.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CodeGPT-adapted	Accuracy (token-level)	75.11	—	Unverified
2	CodeT5+ 770M	EM (line-level)	44.86	—	Unverified
3	CodeT5+ 220M	EM (line-level)	43.42	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SantaCoder-MGD	Compilation Rate	73.03	—	Unverified
2	SantaCoder	Compilation Rate	59.97	—	Unverified
3	SantaCoder	Compilation Rate	59.79	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Rambo	Compilation Rate	76.47	—	Unverified
2	RepoCoder	Compilation Rate	74.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Rambo	Compilation Rate	61.7	—	Unverified
2	RepoCoder	Compilation Rate	58.09	—	Unverified