| Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check | Jun 4, 2024 | BenchmarkingRepresentation Learning | —Unverified | 0 | 0 |
| BIAS: Transparent reporting of biomedical image analysis challenges | Oct 9, 2019 | Benchmarking | —Unverified | 0 | 0 |
| Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey | Jul 14, 2022 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 | 0 |
| Genicious: Contextual Few-shot Prompting for Insights Discovery | Mar 15, 2025 | BenchmarkingDecision Making | —Unverified | 0 | 0 |
| Beyond Visual Understanding: Introducing PARROT-360V for Vision Language Model Benchmarking | Nov 20, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 | 0 |
| Beyond Uniform Lipschitz Condition in Differentially Private Optimization | Jun 21, 2022 | Benchmarkingregression | —Unverified | 0 | 0 |
| Writing as a testbed for open ended agents | Mar 25, 2025 | BenchmarkingDiversity | —Unverified | 0 | 0 |
| GenSpace: Benchmarking Spatially-Aware Image Generation | May 30, 2025 | BenchmarkingImage Generation | —Unverified | 0 | 0 |
| GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks | Sep 29, 2024 | Benchmarking | —Unverified | 0 | 0 |
| GenzIQA: Generalized Image Quality Assessment using Prompt-Guided Latent Diffusion Models | Jun 7, 2024 | BenchmarkingDenoising | —Unverified | 0 | 0 |
| Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis | Feb 13, 2025 | Benchmarking | —Unverified | 0 | 0 |
| Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing | Jan 20, 2025 | BenchmarkingEvolutionary Algorithms | —Unverified | 0 | 0 |
| Beyond Text: A Deep Dive into Large Language Models' Ability on Understanding Graph Data | Oct 7, 2023 | Benchmarking | —Unverified | 0 | 0 |
| Energy Models for Better Pseudo-Labels: Improving Semi-Supervised Classification with the 1-Laplacian Graph Energy | Jun 20, 2019 | BenchmarkingMulti-class Classification | —Unverified | 0 | 0 |
| GeoGebra Tools with Proof Capabilities | Mar 3, 2016 | Automated Theorem ProvingBenchmarking | —Unverified | 0 | 0 |
| Language Models as a Service: Overview of a New Paradigm and its Challenges | Sep 28, 2023 | Benchmarking | —Unverified | 0 | 0 |
| Geometric feature performance under downsampling for EEG classification tasks | Feb 15, 2021 | BenchmarkingClassification | —Unverified | 0 | 0 |
| Geometry-Based Next Frame Prediction from Monocular Video | Sep 20, 2016 | Autonomous DrivingBenchmarking | —Unverified | 0 | 0 |
| Geometry Matters: Benchmarking Scientific ML Approaches for Flow Prediction around Complex Geometries | Dec 31, 2024 | BenchmarkingOut-of-Distribution Generalization | —Unverified | 0 | 0 |
| GeoNet: Benchmarking Unsupervised Adaptation across Geographies | Mar 27, 2023 | BenchmarkingDomain Adaptation | —Unverified | 0 | 0 |
| Geospatial Foundation Models to Enable Progress on Sustainable Development Goals | May 30, 2025 | BenchmarkingEarth Observation | —Unverified | 0 | 0 |
| GermanPartiesQA: Benchmarking Commercial Large Language Models for Political Bias and Sycophancy | Jul 25, 2024 | Benchmarking | —Unverified | 0 | 0 |
| Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages | May 12, 2022 | BenchmarkingDiversity | —Unverified | 0 | 0 |
| Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages | May 26, 2025 | BenchmarkingTransliteration | —Unverified | 0 | 0 |
| GFPNet: A Deep Network for Learning Shape Completion in Generic Fitted Primitives | Jun 3, 2020 | BenchmarkingObject | —Unverified | 0 | 0 |
| A Hong Kong Sign Language Corpus Collected from Sign-interpreted TV News | May 2, 2024 | BenchmarkingSign Language Recognition | —Unverified | 0 | 0 |
| GiCCS: A German in-Context Conversational Similarity Benchmark | Dec 16, 2022 | BenchmarkingSemantic Textual Similarity | —Unverified | 0 | 0 |
| GIMMICK -- Globally Inclusive Multimodal Multitask Cultural Knowledge Benchmarking | Feb 19, 2025 | Benchmarking | —Unverified | 0 | 0 |
| GIQ: Benchmarking 3D Geometric Reasoning of Vision Foundation Models with Simulated and Real Polyhedra | Jun 9, 2025 | 3D ReconstructionBenchmarking | —Unverified | 0 | 0 |
| Beyond Single-Model Views for Deep Learning: Optimization versus Generalizability of Stochastic Optimization Algorithms | Mar 1, 2024 | BenchmarkingStochastic Optimization | —Unverified | 0 | 0 |
| Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems | Feb 20, 2025 | BenchmarkingDecision Making | —Unverified | 0 | 0 |
| The Benchmark Lottery | Jul 14, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 | 0 |
| Global Rice Multi-Class Segmentation Dataset (RiceSEG): A Comprehensive and Diverse High-Resolution RGB-Annotated Images for the Development and Benchmarking of Rice Segmentation Algorithms | Apr 2, 2025 | BenchmarkingSemantic Segmentation | —Unverified | 0 | 0 |
| Global Wheat Head Dataset 2021: more diversity to improve the benchmarking of wheat head localization methods | May 17, 2021 | BenchmarkingDiversity | —Unverified | 0 | 0 |
| Beyond Monocular Deraining: Stereo Image Deraining via Semantic Understanding | Aug 1, 2020 | BenchmarkingRain Removal | —Unverified | 0 | 0 |
| GLOVER++: Unleashing the Potential of Affordance Learning from Human Behaviors for Robotic Manipulation | May 17, 2025 | Benchmarking | —Unverified | 0 | 0 |
| GNNBENCH: Fair and Productive Benchmarking for Single-GPU GNN System | Apr 5, 2024 | BenchmarkingGPU | —Unverified | 0 | 0 |
| A Benchmark for Multi-speaker Anonymization | Jul 8, 2024 | BenchmarkingDisentanglement | —Unverified | 0 | 0 |
| Beyond Monocular Deraining: Parallel Stereo Deraining Network Via Semantic Prior | May 9, 2021 | BenchmarkingRain Removal | —Unverified | 0 | 0 |
| Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks | Jul 29, 2024 | BenchmarkingLanguage Model Evaluation | —Unverified | 0 | 0 |
| GNUMAP: A Parameter-Free Approach to Unsupervised Dimensionality Reduction via Graph Neural Networks | Jul 30, 2024 | BenchmarkingContrastive Learning | —Unverified | 0 | 0 |
| Goal-Driven Sequential Data Abstraction | Jul 29, 2019 | BenchmarkingGeneral Reinforcement Learning | —Unverified | 0 | 0 |
| A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation | Mar 11, 2024 | BenchmarkingTraffic Signal Control | —Unverified | 0 | 0 |
| Domain Adaptation with Joint Learning for Generic, Optical Car Part Recognition and Detection Systems (Go-CaRD) | Jun 15, 2020 | BenchmarkingDomain Adaptation | —Unverified | 0 | 0 |
| Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding | Jul 1, 2022 | Benchmarking | —Unverified | 0 | 0 |
| The Brain Tumor Segmentation (BraTS-METS) Challenge 2023: Brain Metastasis Segmentation on Pre-treatment MRI | Jun 1, 2023 | BenchmarkingBrain Tumor Segmentation | —Unverified | 0 | 0 |
| GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models | Apr 10, 2024 | BenchmarkingDenoising | —Unverified | 0 | 0 |
| GreenPCO: An Unsupervised Lightweight Point Cloud Odometry Method | Dec 8, 2021 | BenchmarkingObject | —Unverified | 0 | 0 |
| Ahead-of-Time P-Tuning | May 18, 2023 | Benchmarkingparameter-efficient fine-tuning | —Unverified | 0 | 0 |
| Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding | Jan 16, 2022 | Benchmarking | —Unverified | 0 | 0 |