SOTAVerified

Code Completion

Papers

Showing 101125 of 212 papers

TitleStatusHype
Protect Your Secrets: Understanding and Measuring Data Exposure in VSCode Extensions0
LibEvolutionEval: A Benchmark and Study for Version-Specific Code Generation0
FastDraft: How to Train Your Draft0
CoCoP: Enhancing Text Classification with LLM through Code Completion Prompt0
JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking0
M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation0
KV Prediction for Improved Time to First TokenCode0
CodeCipher: Learning to Obfuscate Source Code Against LLMs0
Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning0
Statically Contextualizing Large Language Models with Typed Holes0
MarsCode Agent: AI-native Automated Bug Fixing0
Retrieval-augmented code completion for local projects using large language models0
RepoMasterEval: Evaluating Code Completion via Real-World Repositories0
Black-Box Adversarial Attacks on LLM-Based Code Completion0
TaskEval: Assessing Difficulty of Code Generation Tasks for Large Language Models0
Evaluating Long Range Dependency Handling in Code Generation Models using Multi-Step Key Retrieval0
Curriculum Learning for Small Code Language Models0
TPIA: Towards Target-specific Prompt Injection Attack against Code-oriented Large Language Models0
Jailbreak Attacks and Defenses Against Large Language Models: A Survey0
Measuring memorization in RLHF for code completion0
CodeGemma: Open Code Models Based on Gemma0
Chain of Agents: Large Language Models Collaborating on Long-Context Tasks0
R2C2-Coder: Enhancing and Benchmarking Real-world Repository-level Code Completion Abilities of Code Large Language Models0
Model Cascading for Code: A Cascaded Black-Box Multi-Model Framework for Cost-Efficient Code Completion with Self-Testing0
A Transformer-Based Approach for Smart Invocation of Automatic Code CompletionCode0
Show:102550
← PrevPage 5 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1deepseek-coder-33b-baseAverage69.01Unverified
2deepseek-coder-6.7b-baseAverage63.4Unverified
3starcoderbaseAverage55.54Unverified
4gpt-4-1106-previewAverage53.28Unverified
5CodeLlama-13b-hfAverage52.78Unverified
6deepseek-coder-1.3b-baseAverage52.63Unverified
7CodeLlama-34b-hfAverage49.66Unverified
8CodeLlama-7b-hfAverage45Unverified
9gpt-3.5-turbo-0301Average40.86Unverified
10incoder-6BAverage33.79Unverified
#ModelMetricClaimedVerifiedStatus
1CodeGPT-adaptedAccuracy (token-level)77.13Unverified
2CodeT5+ 770MEM (line-level)37.9Unverified
3CodeT5+ 220MEM (line-level)35.17Unverified
#ModelMetricClaimedVerifiedStatus
1CodeGPT-adaptedAccuracy (token-level)75.11Unverified
2CodeT5+ 770MEM (line-level)44.86Unverified
3CodeT5+ 220MEM (line-level)43.42Unverified
#ModelMetricClaimedVerifiedStatus
1SantaCoder-MGDCompilation Rate73.03Unverified
2SantaCoderCompilation Rate59.97Unverified
3SantaCoderCompilation Rate59.79Unverified
#ModelMetricClaimedVerifiedStatus
1RamboCompilation Rate76.47Unverified
2RepoCoderCompilation Rate74.02Unverified
#ModelMetricClaimedVerifiedStatus
1RamboCompilation Rate61.7Unverified
2RepoCoderCompilation Rate58.09Unverified