| Benchmarking GPUs on SVBRDF Extractor Model | Oct 19, 2023 | BenchmarkingGPU | —Unverified | 0 |
| Benchmarking GPU and TPU Performance with Graph Neural Networks | Oct 21, 2022 | BenchmarkingGPU | —Unverified | 0 |
| Data Collection of Real-Life Knowledge Work in Context: The RLKWiC Dataset | Apr 16, 2024 | BenchmarkingManagement | —Unverified | 0 |
| Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of Prompting Strategies | Feb 27, 2024 | BenchmarkingSystematic Generalization | —Unverified | 0 |
| Approaches for benchmarking single-cell gene regulatory network inference methods | Jul 17, 2023 | Benchmarking | —Unverified | 0 |
| Applying Standards to Advance Upstream & Downstream Ethics in Large Language Models | Jun 6, 2023 | BenchmarkingEthics | —Unverified | 0 |
| Benchmarking GNNs Using Lightning Network Data | Jul 5, 2024 | Benchmarking | —Unverified | 0 |
| Benchmarking global optimization techniques for unmanned aerial vehicle path planning | Jan 24, 2025 | Benchmarkingglobal-optimization | —Unverified | 0 |
| Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data | May 16, 2022 | Accented Speech RecognitionBenchmarking | —Unverified | 0 |
| Data-driven Approach for Static Hedging of Exchange Traded Options | Feb 1, 2023 | BenchmarkingInterpretable Machine Learning | —Unverified | 0 |
| Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming | Jun 14, 2024 | BenchmarkingGeneral Knowledge | —Unverified | 0 |
| Applications in CityLearn Gym Environment for Multi-Objective Control Benchmarking in Grid-Interactive Buildings and Districts | Aug 27, 2024 | BenchmarkingModel Predictive Control | —Unverified | 0 |
| AEON: Adaptive Estimation of Instance-Dependent In-Distribution and Out-of-Distribution Label Noise for Robust Learning | Jan 23, 2025 | Benchmarkingimage-classification | —Unverified | 0 |
| Data Augmentation for Continual RL via Adversarial Gradient Episodic Memory | Aug 24, 2024 | BenchmarkingData Augmentation | —Unverified | 0 |
| Benchmarking Generative AI for Scoring Medical Student Interviews in Objective Structured Clinical Examinations (OSCEs) | Jan 21, 2025 | Benchmarking | —Unverified | 0 |
| Application of Machine Learning for Online Reputation Systems | Sep 10, 2022 | BenchmarkingRecommendation Systems | —Unverified | 0 |
| Benchmarking General-Purpose In-Context Learning | May 27, 2024 | BenchmarkingDecision Making | —Unverified | 0 |
| Application of DEA in International Market Selection for the export of products from Spain | Sep 10, 2021 | BenchmarkingDecision Making | —Unverified | 0 |
| Data Augmentation for Traffic Classification | Jan 19, 2024 | BenchmarkingClassification | —Unverified | 0 |
| Application Inference using Machine Learning based Side Channel Analysis | Jul 9, 2019 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| DarkBench: Benchmarking Dark Patterns in Large Language Models | Mar 13, 2025 | Benchmarking | —Unverified | 0 |
| Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech | Jun 9, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Application based Evaluation of an Efficient Spike-Encoder, "Spiketrum" | May 24, 2024 | BenchmarkingClassification | —Unverified | 0 |
| DASB -- Discrete Audio and Speech Benchmark | Jun 20, 2024 | BenchmarkingEmotion Recognition | —Unverified | 0 |
| Benchmarking Foundation Models with Language-Model-as-an-Examiner | Jun 7, 2023 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Applicability and Challenges of Deep Reinforcement Learning for Satellite Frequency Plan Design | Oct 15, 2020 | BenchmarkingDecision Making | —Unverified | 0 |
| Apples to Apples: Learning Semantics of Common Entities Through a Novel Comprehension Task | Jul 1, 2017 | BenchmarkingPart-Of-Speech Tagging | —Unverified | 0 |
| Benchmarking Foundation Models for Zero-Shot Biometric Tasks | May 30, 2025 | AttributeBenchmarking | —Unverified | 0 |
| Benchmarking foundation models as feature extractors for weakly-supervised computational pathology | Aug 28, 2024 | BenchmarkingDiversity | —Unverified | 0 |
| Advocating Character Error Rate for Multilingual ASR Evaluation | Oct 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Data Analysis in the Era of Generative AI | Sep 27, 2024 | Benchmarking | —Unverified | 0 |
| Benchmarking for Public Health Surveillance tasks on Social Media with a Domain-Specific Pretrained Language Model | Apr 9, 2022 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Benchmarking for Metaheuristic Black-Box Optimization: Perspectives and Open Challenges | Jul 1, 2020 | BenchmarkingMetaheuristic Optimization | —Unverified | 0 |
| Adversarial Reinforcement Learning Framework for Benchmarking Collision Avoidance Mechanisms in Autonomous Vehicles | Jun 4, 2018 | Autonomous NavigationAutonomous Vehicles | —Unverified | 0 |
| Benchmarking for Bayesian Reinforcement Learning | Sep 14, 2015 | Benchmarkingreinforcement-learning | —Unverified | 0 |
| Benchmarking Floworks against OpenAI & Anthropic: A Novel Framework for Enhanced LLM Function Calling | Oct 23, 2024 | Benchmarking | —Unverified | 0 |
| A Platform for Event Extraction in Hindi | May 1, 2020 | ArticlesBenchmarking | —Unverified | 0 |
| Ensuring Reliability of Curated EHR-Derived Data: The Validation of Accuracy for LLM/ML-Extracted Information and Data (VALID) Framework | Jun 9, 2025 | BenchmarkingFairness | —Unverified | 0 |
| Benchmarking fixed-length Fingerprint Representations across different Embedding Sizes and Sensor Types | Jul 17, 2023 | Benchmarking | —Unverified | 0 |
| Benchmarking five global optimization approaches for nano-optical shape optimization and parameter reconstruction | Sep 18, 2018 | Bayesian OptimizationBenchmarking | —Unverified | 0 |
| DailyQA: A Benchmark to Evaluate Web Retrieval Augmented LLMs Based on Capturing Real-World Changes | May 22, 2025 | BenchmarkingRAG | —Unverified | 0 |
| Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization | Feb 3, 2022 | 3D ReconstructionBenchmarking | —Unverified | 0 |
| Data and its (dis)contents: A survey of dataset development and use in machine learning research | Dec 9, 2020 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning | Jan 14, 2025 | BenchmarkingManagement | —Unverified | 0 |
| Benchmarking federated strategies in Peer-to-Peer Federated learning for biomedical data | Feb 15, 2024 | BenchmarkingFederated Learning | —Unverified | 0 |
| Benchmarking Federated Machine Unlearning methods for Tabular Data | Apr 1, 2025 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| A Pipeline for Post-Crisis Twitter Data Acquisition | Jan 17, 2018 | Active LearningBenchmarking | —Unverified | 0 |
| Benchmarking FedAvg and FedCurv for Image Classification Tasks | Mar 31, 2023 | BenchmarkingClassification | —Unverified | 0 |
| A Perspective on Neural Capacity Estimation: Viability and Reliability | Mar 22, 2022 | BenchmarkingCapacity Estimation | —Unverified | 0 |
| Accelerating the discovery of steady-states of planetary interior dynamics with machine learning | Aug 30, 2024 | Benchmarking | —Unverified | 0 |