| A CUDA-Based Real Parameter Optimization Benchmark | Jul 29, 2014 | BenchmarkingCPU | —Unverified | 0 |
| Beyond Text: A Deep Dive into Large Language Models' Ability on Understanding Graph Data | Oct 7, 2023 | Benchmarking | —Unverified | 0 |
| BEADs: Bias Evaluation Across Domains | Jun 6, 2024 | BenchmarkingBias Detection | —Unverified | 0 |
| FedSym: Unleashing the Power of Entropy for Benchmarking the Algorithms for Federated Learning | Oct 11, 2023 | BenchmarkingDiversity | —Unverified | 0 |
| FERA 2017 - Addressing Head Pose in the Third Facial Expression Recognition and Analysis Challenge | Feb 14, 2017 | BenchmarkingFacial Action Unit Detection | —Unverified | 0 |
| Energy Models for Better Pseudo-Labels: Improving Semi-Supervised Classification with the 1-Laplacian Graph Energy | Jun 20, 2019 | BenchmarkingMulti-class Classification | —Unverified | 0 |
| Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages | May 12, 2022 | BenchmarkingDiversity | —Unverified | 0 |
| Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages | May 26, 2025 | BenchmarkingTransliteration | —Unverified | 0 |
| BEACON: A Benchmark for Efficient and Accurate Counting of Subgraphs | Apr 15, 2025 | BenchmarkingSubgraph Counting | —Unverified | 0 |
| FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks | Oct 17, 2022 | BenchmarkingGraph Neural Network | —Unverified | 0 |
| Beyond Single-Model Views for Deep Learning: Optimization versus Generalizability of Stochastic Optimization Algorithms | Mar 1, 2024 | BenchmarkingStochastic Optimization | —Unverified | 0 |
| Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems | Feb 20, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities | Oct 4, 2024 | Benchmarkingcounterfactual | —Unverified | 0 |
| BBOB Instance Analysis: Landscape Properties and Algorithm Performance across Problem Instances | Nov 29, 2022 | Benchmarking | —Unverified | 0 |
| A Benchmark for Multi-speaker Anonymization | Jul 8, 2024 | BenchmarkingDisentanglement | —Unverified | 0 |
| FedHPO-B: A Benchmark Suite for Federated Hyperparameter Optimization | Jun 8, 2022 | BenchmarkingFederated Learning | —Unverified | 0 |
| FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks | Jan 16, 2022 | BenchmarkingFederated Learning | —Unverified | 0 |
| FER-C: Benchmarking Out-of-Distribution Soft Calibration for Facial Expression Recognition | Dec 16, 2023 | BenchmarkingFacial Expression Recognition | —Unverified | 0 |
| A Modular Framework for Centrality and Clustering in Complex Networks | Nov 23, 2021 | BenchmarkingClustering | —Unverified | 0 |
| Beyond Monocular Deraining: Stereo Image Deraining via Semantic Understanding | Aug 1, 2020 | BenchmarkingRain Removal | —Unverified | 0 |
| Beyond Monocular Deraining: Parallel Stereo Deraining Network Via Semantic Prior | May 9, 2021 | BenchmarkingRain Removal | —Unverified | 0 |
| Bayesian Neural Networks at Scale: A Performance Analysis and Pruning Study | May 23, 2020 | BenchmarkingNetwork Pruning | —Unverified | 0 |
| SPINEX-TimeSeries: Similarity-based Predictions with Explainable Neighbors Exploration for Time Series and Forecasting Problems | Aug 4, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks | Jul 29, 2024 | BenchmarkingLanguage Model Evaluation | —Unverified | 0 |
| Bayesian Multi-type Mean Field Multi-agent Imitation Learning | Dec 1, 2020 | BenchmarkingImitation Learning | —Unverified | 0 |