| CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks | May 25, 2021 | BIG-bench Machine LearningCode Classification | CodeCode Available | 2 |
| CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection | Mar 12, 2025 | BenchmarkingCode Classification | CodeCode Available | 1 |
| Graph Convolutions Enrich the Self-Attention in Transformers! | Dec 7, 2023 | Clone Detection | CodeCode Available | 1 |
| A General-Purpose Self-Supervised Model for Computational Pathology | Aug 29, 2023 | Code ClassificationDiagnostic | CodeCode Available | 1 |
| Understanding Programs by Exploiting (Fuzzing) Test Cases | May 23, 2023 | Clone DetectionCode Classification | CodeCode Available | 1 |
| Heterogeneous Directed Hypergraph Neural Network over abstract syntax tree (AST) for Code Classification | May 7, 2023 | Code ClassificationGraph Neural Network | CodeCode Available | 1 |
| MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation | Oct 6, 2022 | ClassificationCode Classification | CodeCode Available | 1 |
| Learning Program Semantics with Code Representations: An Empirical Study | Mar 22, 2022 | Clone DetectionCode Classification | CodeCode Available | 1 |
| Embedding Java Classes with code2vec: Improvements from Variable Obfuscation | Apr 6, 2020 | Code ClassificationMethod name prediction | CodeCode Available | 1 |
| Federated Learning for ICD Classification with Lightweight Models and Pretrained Embeddings | Jul 3, 2025 | Code ClassificationFederated Learning | —Unverified | 0 |
| ORIGAMI: A generative transformer architecture for predictions from semi-structured data | Dec 23, 2024 | Code ClassificationMulti-Label Classification | —Unverified | 0 |
| Large Language Model in Medical Informatics: Direct Classification and Enhanced Text Representations for Automatic ICD Coding | Nov 11, 2024 | ClassificationCode Classification | —Unverified | 0 |
| More Questions than Answers? Lessons from Integrating Explainable AI into a Cyber-AI Tool | Aug 8, 2024 | Code Classification | —Unverified | 0 |
| Enhancing Source Code Classification Effectiveness via Prompt Learning Incorporating Knowledge Features | Jan 10, 2024 | ClassificationCode Classification | CodeCode Available | 0 |
| Sparse Attention-Based Neural Networks for Code Classification | Nov 11, 2023 | ClassificationCode Classification | —Unverified | 0 |
| Replication and Extension of Schnappinger’s Study on Human-level Ordinal Maintainability Prediction Based on Static Code Metrics | Jun 14, 2023 | Code Classification | —Unverified | 0 |
| InProC: Industry and Product/Service Code Classification | May 22, 2023 | AI AgentClassification | —Unverified | 0 |
| The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification | May 8, 2023 | Code ClassificationDefect Detection | CodeCode Available | 0 |
| xASTNN: Improved Code Representations for Industrial Practice | Mar 13, 2023 | Clone DetectionCode Classification | —Unverified | 0 |
| Boosting Source Code Learning with Text-Oriented Data Augmentation: An Empirical Study | Mar 13, 2023 | Clone DetectionCode Classification | —Unverified | 0 |
| On the Effectiveness of Hybrid Pooling in Mixup-Based Graph Learning for Language Processing | Oct 6, 2022 | Code ClassificationData Augmentation | CodeCode Available | 0 |
| Adding Context to Source Code Representations for Deep Learning | Jul 30, 2022 | Code ClassificationDeep Learning | —Unverified | 0 |
| CodeS: Towards Code Model Generalization Under Distribution Shift | Jun 11, 2022 | BenchmarkingCode Classification | CodeCode Available | 0 |
| HierarchyNet: Learning to Summarize Source Code with Heterogeneous Representations | May 31, 2022 | Clone DetectionCode Classification | —Unverified | 0 |
| Towards Using Data-Influence Methods to Detect Noisy Samples in Source Code Corpora | May 25, 2022 | Code ClassificationRepresentation Learning | —Unverified | 0 |