| Benchmarking the Performance and Energy Efficiency of AI Accelerators for AI Training | Sep 15, 2019 | BenchmarkingCPU | —Unverified | 0 | 0 |
| Performance Benchmarking of Psychomotor Skills Using Wearable Devices: An Application in Sport | Nov 25, 2024 | Benchmarking | —Unverified | 0 | 0 |
| Performance Comparison of Surrogate-Assisted Evolutionary Algorithms on Computational Fluid Dynamics Problems | Feb 26, 2024 | BenchmarkingEvolutionary Algorithms | —Unverified | 0 | 0 |
| Performance Evaluation Methodology for Long-Term Visual Object Tracking | Jun 19, 2019 | BenchmarkingObject | —Unverified | 0 | 0 |
| Benchmark Dataset for Pore-Scale CO2-Water Interaction | Mar 22, 2025 | Benchmarking | —Unverified | 0 | 0 |
| TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations | Jul 2, 2024 | Benchmarkingtext-to-speech | —Unverified | 0 | 0 |
| Performance Evaluation of Transcriptomics Data Normalization for Survival Risk Prediction | Feb 8, 2021 | BenchmarkingPrediction | —Unverified | 0 | 0 |
| Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale | Nov 7, 2024 | Active LearningBenchmarking | —Unverified | 0 | 0 |
| Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding | May 25, 2025 | BenchmarkingMulti-Agent Path Finding | —Unverified | 0 | 0 |
| Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As | Jun 6, 2024 | ArticlesBenchmarking | —Unverified | 0 | 0 |
| Performance prediction of data streams on high-performance architecture | Jan 7, 2019 | BenchmarkingDimensionality Reduction | —Unverified | 0 | 0 |
| Periocular Recognition in the Wild with Orthogonal Combination of Local Binary Coded Pattern in Dual-stream Convolutional Neural Network | Feb 18, 2019 | Benchmarking | —Unverified | 0 | 0 |
| Which models are innately best at uncertainty estimation? | Jun 5, 2022 | BenchmarkingOut-of-Distribution Detection | —Unverified | 0 | 0 |
| PerMedCQA: Benchmarking Large Language Models on Medical Consumer Question Answering in Persian Language | May 23, 2025 | BenchmarkingQuestion Answering | —Unverified | 0 | 0 |
| WeQA: A Benchmark for Retrieval Augmented Generation in Wind Energy Domain | Aug 21, 2024 | Answer GenerationBenchmarking | —Unverified | 0 | 0 |
| Perona: Robust Infrastructure Fingerprinting for Resource-Efficient Big Data Analytics | Nov 15, 2022 | Benchmarking | —Unverified | 0 | 0 |
| PerSEval: Assessing Personalization in Text Summarizers | Jun 29, 2024 | BenchmarkingHuman Judgment Correlation | —Unverified | 0 | 0 |
| A Conformance Checking-based Approach for Drift Detection in Business Processes | Jul 9, 2019 | BenchmarkingDrift Detection | —Unverified | 0 | 0 |
| Personalised Feedback Framework for Online Education Programmes Using Generative AI | Oct 14, 2024 | BenchmarkingManagement | —Unverified | 0 | 0 |
| Benchmark Data Repositories for Better Benchmarking | Oct 31, 2024 | Benchmarking | —Unverified | 0 | 0 |
| Personalized Multimodal Large Language Models: A Survey | Dec 3, 2024 | BenchmarkingSurvey | —Unverified | 0 | 0 |
| Personalized On-Device E-health Analytics with Decentralized Block Coordinate Descent | Dec 17, 2021 | BenchmarkingDiagnostic | —Unverified | 0 | 0 |
| Person Re-Identification by Unsupervised Video Matching | Nov 25, 2016 | BenchmarkingDynamic Time Warping | —Unverified | 0 | 0 |
| Person Re-Identification in Identity Regression Space | Jun 25, 2018 | BenchmarkingIncremental Learning | —Unverified | 0 | 0 |
| Person Re-identification in the Wild | Apr 9, 2016 | BenchmarkingPedestrian Detection | —Unverified | 0 | 0 |
| Person Search by Multi-Scale Matching | Jul 23, 2018 | BenchmarkingHuman Detection | —Unverified | 0 | 0 |
| Person Search by Multi-Scale Matching | Sep 1, 2018 | BenchmarkingHuman Detection | —Unverified | 0 | 0 |
| Perspective on recent developments and challenges in regulatory and systems genomics | Nov 7, 2024 | Benchmarking | —Unverified | 0 | 0 |
| Perspectives on the State and Future of Deep Learning -- 2023 | Dec 7, 2023 | BenchmarkingDeep Learning | —Unverified | 0 | 0 |
| Perturbation-based exploration methods in deep reinforcement learning | Nov 10, 2020 | Atari GamesBenchmarking | —Unverified | 0 | 0 |
| Benchmark Analysis of Various Pre-trained Deep Learning Models on ASSIRA Cats and Dogs Dataset | Jan 9, 2024 | Benchmarkingimage-classification | —Unverified | 0 | 0 |
| BENCHIP: Benchmarking Intelligence Processors | Oct 23, 2017 | BenchmarkingDiversity | —Unverified | 0 | 0 |
| PGLearn -- An Open-Source Learning Toolkit for Optimal Power Flow | May 28, 2025 | Benchmarking | —Unverified | 0 | 0 |
| PGLib-CO2: A Power Grid Library for Computing and Optimizing Carbon Emissions | Jun 17, 2025 | Benchmarking | —Unverified | 0 | 0 |
| BenchCouncil's View on Benchmarking AI and Other Emerging Workloads | Dec 2, 2019 | Benchmarking | —Unverified | 0 | 0 |
| PhD Thesis on Code Modulated Interferometric Imaging System using Phased Arrays | Jul 19, 2021 | Benchmarking | —Unverified | 0 | 0 |
| Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle | Jul 18, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 | 0 |
| PhilHumans: Benchmarking Machine Learning for Personal Health | May 4, 2024 | Action AnticipationBenchmarking | —Unverified | 0 | 0 |
| @Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology | Sep 21, 2024 | BenchmarkingDepth Estimation | —Unverified | 0 | 0 |
| PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding | Jan 27, 2025 | BenchmarkingCommon Sense Reasoning | —Unverified | 0 | 0 |
| PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models | May 30, 2025 | Benchmarking | —Unverified | 0 | 0 |
| Physics-Learning AI Datamodel (PLAID) datasets: a collection of physics simulations for machine learning | May 5, 2025 | Benchmarking | —Unverified | 0 | 0 |
| Benanza: Automatic μBenchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs | Nov 16, 2019 | BenchmarkingGPU | —Unverified | 0 | 0 |
| PhytoSynth: Leveraging Multi-modal Generative Models for Crop Disease Data Generation with Novel Benchmarking and Prompt Engineering Approach | May 3, 2025 | BenchmarkingImage-to-Image Translation | —Unverified | 0 | 0 |
| BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation | Nov 20, 2024 | BenchmarkingPoint Cloud Segmentation | —Unverified | 0 | 0 |
| Behavior Structformer: Learning Players Representations with Structured Tokenization | Jun 7, 2024 | Benchmarking | —Unverified | 0 | 0 |
| Yesil o1 Pro: Evidence-Based AI Model for Health and Benchmarking in Clinical Decision Support | Feb 15, 2025 | BenchmarkingEpidemiology | —Unverified | 0 | 0 |
| PieTrack: An MOT solution based on synthetic data training and self-supervised domain adaptation | Jul 22, 2022 | BenchmarkingDomain Adaptation | —Unverified | 0 | 0 |
| BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI Agents | Jun 13, 2022 | Benchmarking | —Unverified | 0 | 0 |
| Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data | Sep 23, 2023 | BenchmarkingSuper-Resolution | —Unverified | 0 | 0 |