| BioDSA-1K: Benchmarking Data Science Agents for Biomedical Research | May 22, 2025 | Benchmarking | —Unverified | 0 |
| A Multimodal, Full-Surround Vehicular Testbed for Naturalistic Studies and Benchmarking: Design, Calibration and Deployment | Sep 21, 2017 | Autonomous DrivingBenchmarking | —Unverified | 0 |
| Binary Classification with Positive Labeling Sources | Aug 2, 2022 | BenchmarkingBinary Classification | —Unverified | 0 |
| Benanza: Automatic μBenchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs | Nov 16, 2019 | BenchmarkingGPU | —Unverified | 0 |
| Featuremetric benchmarking: Quantum computer benchmarks based on circuit features | Apr 17, 2025 | Benchmarking | —Unverified | 0 |
| A Multi-Labeled Dataset for Indonesian Discourse: Examining Toxicity, Polarization, and Demographics Information | Mar 1, 2025 | Benchmarking | —Unverified | 0 |
| BigDataBench: A Scalable and Unified Big Data and AI Benchmark Suite | Feb 23, 2018 | BenchmarkingCPU | —Unverified | 0 |
| BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation | Nov 20, 2024 | BenchmarkingPoint Cloud Segmentation | —Unverified | 0 |
| Feature Encodings for Gradient Boosting with Automunge | Sep 25, 2022 | BenchmarkingBinarization | —Unverified | 0 |
| Feature Selection and Classification of Hyperspectral Images With Support Vector Machines | Oct 15, 2007 | BenchmarkingClassification | —Unverified | 0 |
| Bi-Discriminator Class-Conditional Tabular GAN | Nov 12, 2021 | Benchmarking | —Unverified | 0 |
| Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check | Jun 4, 2024 | BenchmarkingRepresentation Learning | —Unverified | 0 |
| Behavior Structformer: Learning Players Representations with Structured Tokenization | Jun 7, 2024 | Benchmarking | —Unverified | 0 |
| BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI Agents | Jun 13, 2022 | Benchmarking | —Unverified | 0 |
| BIAS: Transparent reporting of biomedical image analysis challenges | Oct 9, 2019 | Benchmarking | —Unverified | 0 |
| AM-RADIO: Agglomerative Vision Foundation Model Reduce All Domains Into One | Jan 1, 2024 | AllBenchmarking | —Unverified | 0 |
| A Benchmark for Out of Distribution Detection in Point Cloud 3D Semantic Segmentation | Nov 11, 2022 | 3D Semantic SegmentationAutonomous Driving | —Unverified | 0 |
| Feature-based Evolutionary Diversity Optimization of Discriminating Instances for Chance-constrained Optimization Problems | Jan 24, 2025 | BenchmarkingDiversity | —Unverified | 0 |
| Feature selection in linear SVMs via a hard cardinality constraint: a scalable SDP decomposition approach | Apr 15, 2024 | Benchmarkingfeature selection | —Unverified | 0 |
| Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey | Jul 14, 2022 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Beyond Visual Understanding: Introducing PARROT-360V for Vision Language Model Benchmarking | Nov 20, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Beyond Uniform Lipschitz Condition in Differentially Private Optimization | Jun 21, 2022 | Benchmarkingregression | —Unverified | 0 |
| FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding | Mar 19, 2025 | BenchmarkingMultiple-choice | —Unverified | 0 |
| Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis | Feb 13, 2025 | Benchmarking | —Unverified | 0 |
| Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing | Jan 20, 2025 | BenchmarkingEvolutionary Algorithms | —Unverified | 0 |