| Automated legal reasoning with discretion to act using s(LAW) | Jan 25, 2024 | BenchmarkingLegal Reasoning | —Unverified | 0 |
| Benchmarking the Robustness of Quantized Models | Apr 8, 2023 | BenchmarkingQuantization | —Unverified | 0 |
| Benchmarking the Robustness of Panoptic Segmentation for Automated Driving | Feb 23, 2024 | BenchmarkingDecision Making | —Unverified | 0 |
| Automated Factual Benchmarking for In-Car Conversational Systems using Large Language Models | Apr 1, 2025 | BenchmarkingConversational Question Answering | —Unverified | 0 |
| A lightweight and accurate YOLO-like network for small target detection in Aerial Imagery | Apr 5, 2022 | Benchmarkingobject-detection | —Unverified | 0 |
| A Baseline Method for Removing Invisible Image Watermarks using Deep Image Prior | Feb 19, 2025 | BenchmarkingMisinformation | —Unverified | 0 |
| Benchmarking the Robustness of Instance Segmentation Models | Sep 2, 2021 | BenchmarkingDomain Adaptation | —Unverified | 0 |
| Automated detection of gibbon calls from passive acoustic monitoring data using convolutional neural networks in the "torch for R" ecosystem | Jul 13, 2024 | BenchmarkingDeep Learning | —Unverified | 0 |
| Genetic algorithm for feature selection of EEG heterogeneous data | Mar 12, 2021 | BenchmarkingEEG | —Unverified | 0 |
| Galvatron: An Automatic Distributed System for Efficient Foundation Model Training | Apr 30, 2025 | Benchmarking | —Unverified | 0 |
| Alibaba’s Submission for the WMT 2020 APE Shared Task: Improving Automatic Post-Editing with Pre-trained Conditional Cross-Lingual BERT | Nov 1, 2020 | Automatic Post-EditingBenchmarking | —Unverified | 0 |
| Benchmarking the Reliability of Post-training Quantization: a Particular Focus on Worst-case Performance | Mar 23, 2023 | BenchmarkingData Augmentation | —Unverified | 0 |
| Benchmarking the rationality of AI decision making using the transitivity axiom | Feb 14, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| Automated 3D Tumor Segmentation using Temporal Cubic PatchGAN (TCuP-GAN) | Nov 23, 2023 | BenchmarkingBrain Tumor Segmentation | —Unverified | 0 |
| Benchmarking the Physical-world Adversarial Robustness of Vehicle Detection | Apr 11, 2023 | Adversarial AttackAdversarial Robustness | —Unverified | 0 |
| AutoLay: Benchmarking amodal layout estimation for autonomous driving | Aug 20, 2021 | Amodal Layout EstimationAutonomous Driving | —Unverified | 0 |
| Benchmarking the Neural Linear Model for Regression | Dec 18, 2019 | Bayesian OptimizationBenchmarking | —Unverified | 0 |
| Algorithm Selection with Probing Trajectories: Benchmarking the Choice of Classifier Model | Jan 20, 2025 | Benchmarking | —Unverified | 0 |
| Benchmarking the Impact of Noise on Deep Learning-based Classification of Atrial Fibrillation in 12-Lead ECG | Mar 24, 2023 | Atrial Fibrillation DetectionBenchmarking | —Unverified | 0 |
| Functional Code Building Genetic Programming | Jun 9, 2022 | BenchmarkingProgram Synthesis | —Unverified | 0 |
| Benchmarking the human brain against computational architectures | May 15, 2023 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| A Conformance Checking-based Approach for Drift Detection in Business Processes | Jul 9, 2019 | BenchmarkingDrift Detection | —Unverified | 0 |
| FunBench: Benchmarking Fundus Reading Skills of MLLMs | Mar 2, 2025 | AnatomyBenchmarking | —Unverified | 0 |
| Efficient Pauli channel estimation with logarithmic quantum memory | Sep 25, 2023 | Benchmarking | —Unverified | 0 |
| AutoAI-TS: AutoAI for Time Series Forecasting | Feb 24, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |