| WATT: Weight Average Test-Time Adaptation of CLIP | Jun 19, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning | Oct 21, 2023 | Overall - TestProblem Decomposition | CodeCode Available | 1 |
| Comparative study of deep learning methods for the automatic segmentation of lung, lesion and lesion type in CT scans of COVID-19 patients | Jul 29, 2020 | Lesion SegmentationOverall - Test | CodeCode Available | 1 |
| Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models | May 24, 2023 | Overall - Test | CodeCode Available | 1 |
| FreeLB: Enhanced Adversarial Training for Natural Language Understanding | Sep 25, 2019 | ARCNatural Language Understanding | CodeCode Available | 1 |
| Using Interactive Feedback to Improve the Accuracy and Explainability of Question Answering Systems Post-Deployment | Apr 6, 2022 | Overall - TestQuestion Answering | CodeCode Available | 1 |
| Amplifying Membership Exposure via Data Poisoning | Nov 1, 2022 | Data PoisoningOverall - Test | CodeCode Available | 1 |
| Contraction Properties of the Global Workspace Primitive | Oct 2, 2023 | Overall - Test | —Unverified | 0 |
| Cost-Saving LLM Cascades with Early Abstention | Feb 13, 2025 | GSM8KMMLU | —Unverified | 0 |
| Fast and accurate classification of echocardiograms using deep learning | Jun 27, 2017 | ClassificationDeep Learning | —Unverified | 0 |
| Fault Sneaking Attack: a Stealthy Framework for Misleading Deep Neural Networks | May 28, 2019 | Overall - Test | —Unverified | 0 |
| GanDef: A GAN based Adversarial Training Defense for Neural Network Classifier | Mar 6, 2019 | feature selectionOverall - Test | —Unverified | 0 |
| AI5GTest: AI-Driven Specification-Aware Automated Testing and Validation of 5G O-RAN Components | Jun 11, 2025 | Overall - Test | —Unverified | 0 |
| mmID: High-Resolution mmWave Imaging for Human Identification | Feb 1, 2024 | Activity RecognitionOverall - Test | —Unverified | 0 |
| Modeling speech emotion with label variance and analyzing performance across speakers and unseen acoustic conditions | Mar 24, 2025 | Emotion RecognitionOverall - Test | —Unverified | 0 |
| Network two-sample test for block models | Jun 10, 2024 | Graph MatchingOverall - Test | —Unverified | 0 |
| Optimal Layer Selection for Latent Data Augmentation | Aug 24, 2024 | Data Augmentationimage-classification | —Unverified | 0 |
| Predicting the Outcome of Judicial Decisions made by the European Court of Human Rights | Dec 16, 2019 | ArticlesBIG-bench Machine Learning | —Unverified | 0 |
| Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models | Oct 8, 2024 | HallucinationOverall - Test | —Unverified | 0 |
| Application of DenseNet in Camera Model Identification and Post-processing Detection | May 27, 2019 | Image ForensicsOverall - Test | —Unverified | 0 |
| Artificial Data Point Generation in Clustered Latent Space for Small Medical Datasets | Sep 26, 2024 | Overall - TestSynthetic Data Generation | —Unverified | 0 |
| Attention Tree: Learning Hierarchies of Visual Features for Large-Scale Image Recognition | Aug 1, 2016 | image-classificationImage Classification | —Unverified | 0 |
| Automated Human Cell Classification in Sparse Datasets using Few-Shot Learning | Jul 27, 2021 | ClassificationFew-Shot Learning | —Unverified | 0 |
| Classifier Enhanced Deep Learning Model for Erythroblast Differentiation with Limited Data | Nov 23, 2024 | DiagnosticOverall - Test | —Unverified | 0 |
| Constructing Open Cloze Tests Using Generation and Discrimination Capabilities of Transformers | Apr 14, 2022 | Overall - TestRe-Ranking | —Unverified | 0 |
| Targeted Data Generation: Finding and Fixing Model Weaknesses | May 28, 2023 | Data AugmentationNatural Language Inference | —Unverified | 0 |
| The Future of Software Testing: AI-Powered Test Case Generation and Validation | Sep 9, 2024 | Overall - Testsoftware testing | —Unverified | 0 |
| Underage Detection through a Multi-Task and MultiAge Approach for Screening Minors in Unconstrained Imagery | Jun 12, 2025 | Age EstimationOverall - Test | —Unverified | 0 |
| Unify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of Unit Tests with LLMs | Mar 20, 2025 | DiversityLarge Language Model | —Unverified | 0 |
| Solving the Same-Different Task with Convolutional Neural Networks | Jan 22, 2021 | Overall - TestZero-shot Generalization | —Unverified | 0 |
| Localizing Open-Ontology QA Semantic Parsers in a Day Using Machine Translation | Oct 10, 2020 | Machine TranslationNMT | CodeCode Available | 0 |
| Transferable Availability Poisoning Attacks | Oct 8, 2023 | Contrastive LearningData Poisoning | CodeCode Available | 0 |
| Efficient Training of Deep Neural Operator Networks via Randomized Sampling | Sep 20, 2024 | Overall - Test | CodeCode Available | 0 |
| Deep Modeling and Optimization of Medical Image Classification | May 29, 2025 | AvgClassification | CodeCode Available | 0 |