| Traces of Memorisation in Large Language Models for Code | Dec 18, 2023 | Code Completion | CodeCode Available | 0 |
| A Review of Repository Level Prompting for LLMs | Dec 15, 2023 | Code CompletionCode Generation | —Unverified | 0 |
| Breaking the Silence: the Threats of Using LLMs in Software Engineering | Dec 13, 2023 | Code CompletionCode Summarization | CodeCode Available | 0 |
| INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers | Dec 8, 2023 | Code CompletionDiagnostic | CodeCode Available | 0 |
| Interpretability Illusions in the Generalization of Simplified Models | Dec 6, 2023 | Code CompletionDimensionality Reduction | —Unverified | 0 |
| GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization in Programming Language Understanding | Nov 16, 2023 | Code CompletionCode Generation | CodeCode Available | 0 |
| Past as a Guide: Leveraging Retrospective Learning for Python Code Completion | Nov 13, 2023 | Code CompletionHumanEval | —Unverified | 0 |
| Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications | Nov 7, 2023 | Code Completion | —Unverified | 0 |
| Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation | Nov 1, 2023 | Code CompletionLanguage Modeling | —Unverified | 0 |
| CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion | Oct 17, 2023 | Code CompletionHumanEval | CodeCode Available | 1 |