| Applicability and Challenges of Deep Reinforcement Learning for Satellite Frequency Plan Design | Oct 15, 2020 | BenchmarkingDecision Making | —Unverified | 0 |
| Apples to Apples: Learning Semantics of Common Entities Through a Novel Comprehension Task | Jul 1, 2017 | BenchmarkingPart-Of-Speech Tagging | —Unverified | 0 |
| Benchmarking Foundation Models for Zero-Shot Biometric Tasks | May 30, 2025 | AttributeBenchmarking | —Unverified | 0 |
| Benchmarking foundation models as feature extractors for weakly-supervised computational pathology | Aug 28, 2024 | BenchmarkingDiversity | —Unverified | 0 |
| Advocating Character Error Rate for Multilingual ASR Evaluation | Oct 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Data Analysis in the Era of Generative AI | Sep 27, 2024 | Benchmarking | —Unverified | 0 |
| Benchmarking for Public Health Surveillance tasks on Social Media with a Domain-Specific Pretrained Language Model | Apr 9, 2022 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Benchmarking for Metaheuristic Black-Box Optimization: Perspectives and Open Challenges | Jul 1, 2020 | BenchmarkingMetaheuristic Optimization | —Unverified | 0 |
| Adversarial Reinforcement Learning Framework for Benchmarking Collision Avoidance Mechanisms in Autonomous Vehicles | Jun 4, 2018 | Autonomous NavigationAutonomous Vehicles | —Unverified | 0 |
| Benchmarking for Bayesian Reinforcement Learning | Sep 14, 2015 | Benchmarkingreinforcement-learning | —Unverified | 0 |
| Benchmarking Floworks against OpenAI & Anthropic: A Novel Framework for Enhanced LLM Function Calling | Oct 23, 2024 | Benchmarking | —Unverified | 0 |
| A Platform for Event Extraction in Hindi | May 1, 2020 | ArticlesBenchmarking | —Unverified | 0 |
| Ensuring Reliability of Curated EHR-Derived Data: The Validation of Accuracy for LLM/ML-Extracted Information and Data (VALID) Framework | Jun 9, 2025 | BenchmarkingFairness | —Unverified | 0 |
| Benchmarking fixed-length Fingerprint Representations across different Embedding Sizes and Sensor Types | Jul 17, 2023 | Benchmarking | —Unverified | 0 |
| Benchmarking five global optimization approaches for nano-optical shape optimization and parameter reconstruction | Sep 18, 2018 | Bayesian OptimizationBenchmarking | —Unverified | 0 |
| DailyQA: A Benchmark to Evaluate Web Retrieval Augmented LLMs Based on Capturing Real-World Changes | May 22, 2025 | BenchmarkingRAG | —Unverified | 0 |
| Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization | Feb 3, 2022 | 3D ReconstructionBenchmarking | —Unverified | 0 |
| Data and its (dis)contents: A survey of dataset development and use in machine learning research | Dec 9, 2020 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning | Jan 14, 2025 | BenchmarkingManagement | —Unverified | 0 |
| Benchmarking federated strategies in Peer-to-Peer Federated learning for biomedical data | Feb 15, 2024 | BenchmarkingFederated Learning | —Unverified | 0 |
| Benchmarking Federated Machine Unlearning methods for Tabular Data | Apr 1, 2025 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| A Pipeline for Post-Crisis Twitter Data Acquisition | Jan 17, 2018 | Active LearningBenchmarking | —Unverified | 0 |
| Benchmarking FedAvg and FedCurv for Image Classification Tasks | Mar 31, 2023 | BenchmarkingClassification | —Unverified | 0 |
| A Perspective on Neural Capacity Estimation: Viability and Reliability | Mar 22, 2022 | BenchmarkingCapacity Estimation | —Unverified | 0 |
| Accelerating the discovery of steady-states of planetary interior dynamics with machine learning | Aug 30, 2024 | Benchmarking | —Unverified | 0 |