| Benchmarking Pretrained Vision Embeddings for Near- and Duplicate Detection in Medical Images | Dec 12, 2023 | BenchmarkingRetrieval | —Unverified | 0 |
| Galvatron: An Automatic Distributed System for Efficient Foundation Model Training | Apr 30, 2025 | Benchmarking | —Unverified | 0 |
| FAIRification of MLC data | Nov 23, 2022 | BenchmarkingManagement | —Unverified | 0 |
| A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking | Sep 5, 2023 | BenchmarkingKnowledge Distillation | —Unverified | 0 |
| GANmut: Generating and Modifying Facial Expressions | Jun 16, 2024 | BenchmarkingDiversity | —Unverified | 0 |
| GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR | Apr 15, 2025 | Benchmarking | —Unverified | 0 |
| A Normative Framework for Benchmarking Consumer Fairness in Large Language Model Recommender System | May 3, 2024 | BenchmarkingCollaborative Filtering | —Unverified | 0 |
| GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics | Mar 27, 2025 | BenchmarkingNatural Language Queries | —Unverified | 0 |
| A Survey of Spanish Clinical Language Models | Aug 4, 2023 | BenchmarkingSurvey | —Unverified | 0 |
| AI Matrix - Synthetic Benchmarks for DNN | Nov 27, 2018 | BenchmarkingCPU | —Unverified | 0 |
| Factuality or Fiction? Benchmarking Modern LLMs on Ambiguous QA with Citations | Dec 23, 2024 | BenchmarkingQuestion Answering | —Unverified | 0 |
| Benchmarking the Performance of Pre-trained LLMs across Urdu NLP Tasks | May 24, 2024 | BenchmarkingDecoder | —Unverified | 0 |
| Identifying patterns and recommendations of and for sustainable open data initiatives: a benchmarking-driven analysis of open government data initiatives among European countries | Dec 1, 2023 | Benchmarking | —Unverified | 0 |
| FactLens: Benchmarking Fine-Grained Fact Verification | Nov 8, 2024 | BenchmarkingFact Verification | —Unverified | 0 |
| FACT: Learning Governing Abstractions Behind Integer Sequences | Sep 20, 2022 | Benchmarking | —Unverified | 0 |
| Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy | Dec 4, 2024 | AnatomyBenchmarking | —Unverified | 0 |
| Face Morphing Attack Generation & Detection: A Comprehensive Survey | Nov 3, 2020 | BenchmarkingFace Recognition | —Unverified | 0 |
| A Unified Taylor Framework for Revisiting Attribution Methods | Aug 21, 2020 | BenchmarkingDecision Making | —Unverified | 0 |
| Face Detection on Surveillance Images | Oct 22, 2019 | BenchmarkingFace Detection | —Unverified | 0 |
| GenderBias-VL: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing | Jun 30, 2024 | Benchmarkingcounterfactual | —Unverified | 0 |
| GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases | May 25, 2024 | BenchmarkingHallucination | —Unverified | 0 |
| A Survey of Small Language Models | Oct 25, 2024 | BenchmarkingModel Compression | —Unverified | 0 |
| Identifying the Context Shift between Test Benchmarks and Production Data | Jul 3, 2022 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Exploring the Decentraland Economy: Multifaceted Parcel Attributes, Key Insights, and Benchmarking | Apr 11, 2024 | AttributeBenchmarking | —Unverified | 0 |
| Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning | Apr 19, 2024 | Benchmarkingcounterfactual | —Unverified | 0 |
| ExtremeAIGC: Benchmarking LMM Vulnerability to AI-Generated Extremist Content | Mar 13, 2025 | BenchmarkingImage Generation | —Unverified | 0 |
| Extraction of Research Objectives, Machine Learning Model Names, and Dataset Names from Academic Papers and Analysis of Their Interrelationships Using LLM and Network Analysis | Aug 22, 2024 | Benchmarking | —Unverified | 0 |
| Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization | May 23, 2022 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow | Feb 14, 2025 | Benchmarking | —Unverified | 0 |
| Generalized Conflict-directed Search for Optimal Ordering Problems | Mar 31, 2021 | BenchmarkingScheduling | —Unverified | 0 |
| A Survey of Predictive Maintenance Methods: An Analysis of Prognostics via Classification and Regression | Jun 25, 2025 | BenchmarkingManagement | —Unverified | 0 |
| General Scales Unlock AI Evaluation with Explanatory and Predictive Power | Mar 9, 2025 | BenchmarkingSpecificity | —Unverified | 0 |
| Extraction of clinical information from the non-invasive fetal electrocardiogram | May 27, 2016 | BenchmarkingHeart Rate Variability | —Unverified | 0 |
| Generating Artificial Outliers in the Absence of Genuine Ones -- a Survey | Jun 5, 2020 | BenchmarkingExperimental Design | —Unverified | 0 |
| Extensible Logging and Empirical Attainment Function for IOHexperimenter | Sep 28, 2021 | Benchmarking | —Unverified | 0 |
| Extended Labeled Faces in-the-Wild (ELFW): Augmenting Classes for Face Segmentation | Jun 24, 2020 | BenchmarkingData Augmentation | —Unverified | 0 |
| Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design | Apr 14, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Generating Synthetic Electronic Health Record (EHR) Data: A Review with Benchmarking | Nov 6, 2024 | Benchmarking | —Unverified | 0 |
| Generation of Large District Heating System Models Using Open-Source Data and Tools: An Exemplary Workflow | Dec 18, 2024 | Benchmarking | —Unverified | 0 |
| Synthetic Observational Health Data with GANs: from slow adoption to a boom in medical research and ultimately digital twins? | May 27, 2020 | BenchmarkingFraud Detection | —Unverified | 0 |
| Generative Adversarial Networks with Limited Data: A Survey and Benchmarking | Apr 7, 2025 | BenchmarkingImage Generation | —Unverified | 0 |
| Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors | Jun 29, 2023 | Benchmarking | —Unverified | 0 |
| A Survey of Parameters Associated with the Quality of Benchmarks in NLP | Oct 14, 2022 | Benchmarking | —Unverified | 0 |
| Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning | Jun 16, 2024 | BenchmarkingMath | —Unverified | 0 |
| Benchmarking Post-Hoc Unknown-Category Detection in Food Recognition | Mar 24, 2025 | BenchmarkingFood Recognition | —Unverified | 0 |
| Exploring Thermography Technology: A Comprehensive Facial Dataset for Face Detection, Recognition, and Emotion | May 28, 2024 | BenchmarkingEmotion Recognition | —Unverified | 0 |
| Exploring the Impact of a Transformer's Latent Space Geometry on Downstream Task Performance | Jun 18, 2024 | Benchmarking | —Unverified | 0 |
| Generative Models at the Frontier of Compression: A Survey on Generative Face Video Coding | Jun 9, 2025 | BenchmarkingVideo Compression | —Unverified | 0 |
| Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models | Feb 4, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| AI Idea Bench 2025: AI Research Idea Generation Benchmark | Apr 19, 2025 | Benchmarkingscientific discovery | —Unverified | 0 |