| LLMSecEval: A Dataset of Natural Language Prompts for Security Evaluations | Mar 16, 2023 | Code CompletionCode Generation | CodeCode Available | 1 | 5 |
| CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation | Feb 9, 2021 | BIG-bench Machine LearningClone Detection | CodeCode Available | 1 | 5 |
| Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion | May 30, 2024 | Code CompletionRetrieval | CodeCode Available | 1 | 5 |
| IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators | Mar 6, 2024 | Code CompletionCode Generation | CodeCode Available | 1 | 5 |
| Language Models for Code Completion: A Practical Evaluation | Feb 25, 2024 | Code Completionvalid | CodeCode Available | 1 | 5 |
| DataSculpt: Crafting Data Landscapes for Long-Context LLMs through Multi-Objective Partitioning | Sep 2, 2024 | Code CompletionCombinatorial Optimization | CodeCode Available | 1 | 5 |
| Empirical Study of Transformers for Source Code | Oct 15, 2020 | Bug fixingCode Completion | CodeCode Available | 1 | 5 |
| LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation | Mar 25, 2025 | Code CompletionLanguage Modeling | CodeCode Available | 1 | 5 |
| Building A Coding Assistant via the Retrieval-Augmented Language Model | Oct 21, 2024 | Code CompletionCode Generation | CodeCode Available | 1 | 5 |
| CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection | Mar 12, 2025 | BenchmarkingCode Classification | CodeCode Available | 1 | 5 |