| WATT: Weight Average Test-Time Adaptation of CLIP | Jun 19, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning | Oct 21, 2023 | Overall - TestProblem Decomposition | CodeCode Available | 1 |
| Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models | May 24, 2023 | Overall - Test | CodeCode Available | 1 |
| Amplifying Membership Exposure via Data Poisoning | Nov 1, 2022 | Data PoisoningOverall - Test | CodeCode Available | 1 |
| Using Interactive Feedback to Improve the Accuracy and Explainability of Question Answering Systems Post-Deployment | Apr 6, 2022 | Overall - TestQuestion Answering | CodeCode Available | 1 |
| Comparative study of deep learning methods for the automatic segmentation of lung, lesion and lesion type in CT scans of COVID-19 patients | Jul 29, 2020 | Lesion SegmentationOverall - Test | CodeCode Available | 1 |
| FreeLB: Enhanced Adversarial Training for Natural Language Understanding | Sep 25, 2019 | ARCNatural Language Understanding | CodeCode Available | 1 |
| Underage Detection through a Multi-Task and MultiAge Approach for Screening Minors in Unconstrained Imagery | Jun 12, 2025 | Age EstimationOverall - Test | —Unverified | 0 |
| AI5GTest: AI-Driven Specification-Aware Automated Testing and Validation of 5G O-RAN Components | Jun 11, 2025 | Overall - Test | —Unverified | 0 |
| Deep Modeling and Optimization of Medical Image Classification | May 29, 2025 | AvgClassification | CodeCode Available | 0 |