| Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check | Jun 4, 2024 | BenchmarkingRepresentation Learning | —Unverified | 0 | 0 |
| BIAS: Transparent reporting of biomedical image analysis challenges | Oct 9, 2019 | Benchmarking | —Unverified | 0 | 0 |
| Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey | Jul 14, 2022 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 | 0 |
| Genicious: Contextual Few-shot Prompting for Insights Discovery | Mar 15, 2025 | BenchmarkingDecision Making | —Unverified | 0 | 0 |
| Beyond Visual Understanding: Introducing PARROT-360V for Vision Language Model Benchmarking | Nov 20, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 | 0 |
| Beyond Uniform Lipschitz Condition in Differentially Private Optimization | Jun 21, 2022 | Benchmarkingregression | —Unverified | 0 | 0 |
| Writing as a testbed for open ended agents | Mar 25, 2025 | BenchmarkingDiversity | —Unverified | 0 | 0 |
| GenSpace: Benchmarking Spatially-Aware Image Generation | May 30, 2025 | BenchmarkingImage Generation | —Unverified | 0 | 0 |
| GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks | Sep 29, 2024 | Benchmarking | —Unverified | 0 | 0 |
| GenzIQA: Generalized Image Quality Assessment using Prompt-Guided Latent Diffusion Models | Jun 7, 2024 | BenchmarkingDenoising | —Unverified | 0 | 0 |
| Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis | Feb 13, 2025 | Benchmarking | —Unverified | 0 | 0 |
| Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing | Jan 20, 2025 | BenchmarkingEvolutionary Algorithms | —Unverified | 0 | 0 |
| Beyond Text: A Deep Dive into Large Language Models' Ability on Understanding Graph Data | Oct 7, 2023 | Benchmarking | —Unverified | 0 | 0 |
| Energy Models for Better Pseudo-Labels: Improving Semi-Supervised Classification with the 1-Laplacian Graph Energy | Jun 20, 2019 | BenchmarkingMulti-class Classification | —Unverified | 0 | 0 |
| GeoGebra Tools with Proof Capabilities | Mar 3, 2016 | Automated Theorem ProvingBenchmarking | —Unverified | 0 | 0 |
| Language Models as a Service: Overview of a New Paradigm and its Challenges | Sep 28, 2023 | Benchmarking | —Unverified | 0 | 0 |
| Geometric feature performance under downsampling for EEG classification tasks | Feb 15, 2021 | BenchmarkingClassification | —Unverified | 0 | 0 |
| Geometry-Based Next Frame Prediction from Monocular Video | Sep 20, 2016 | Autonomous DrivingBenchmarking | —Unverified | 0 | 0 |
| Geometry Matters: Benchmarking Scientific ML Approaches for Flow Prediction around Complex Geometries | Dec 31, 2024 | BenchmarkingOut-of-Distribution Generalization | —Unverified | 0 | 0 |
| GeoNet: Benchmarking Unsupervised Adaptation across Geographies | Mar 27, 2023 | BenchmarkingDomain Adaptation | —Unverified | 0 | 0 |
| Geospatial Foundation Models to Enable Progress on Sustainable Development Goals | May 30, 2025 | BenchmarkingEarth Observation | —Unverified | 0 | 0 |
| GermanPartiesQA: Benchmarking Commercial Large Language Models for Political Bias and Sycophancy | Jul 25, 2024 | Benchmarking | —Unverified | 0 | 0 |
| Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages | May 12, 2022 | BenchmarkingDiversity | —Unverified | 0 | 0 |
| Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages | May 26, 2025 | BenchmarkingTransliteration | —Unverified | 0 | 0 |
| GFPNet: A Deep Network for Learning Shape Completion in Generic Fitted Primitives | Jun 3, 2020 | BenchmarkingObject | —Unverified | 0 | 0 |