| On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards | Jul 4, 2024 | Code Completion | CodeCode Available | 3 |
| LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding | Aug 28, 2023 | 16kCode Completion | CodeCode Available | 3 |
| Revisiting VerilogEval: A Year of Improvements in Large-Language Models for Hardware Code Generation | Aug 20, 2024 | Code CompletionCode Generation | CodeCode Available | 3 |
| Optimizing Large Language Models for OpenAPI Code Completion | May 24, 2024 | Code CompletionCode Generation | CodeCode Available | 2 |
| Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion? | Oct 2, 2024 | Code CompletionCode Generation | CodeCode Available | 2 |
| RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation | Mar 22, 2023 | Code CompletionLanguage Modeling | CodeCode Available | 2 |
| An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection | Jun 10, 2024 | Backdoor AttackCode Completion | CodeCode Available | 2 |
| LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification | Feb 24, 2025 | Code Completion | CodeCode Available | 2 |
| CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion | Mar 12, 2024 | Code CompletionSafety Alignment | CodeCode Available | 2 |
| Guiding Language Models of Code with Global Context using Monitors | Jun 19, 2023 | Code CompletionCode Generation | CodeCode Available | 2 |