Galactica: A Large Language Model for Science Nov 16, 2022 Anachronisms Bias Detection
Code Code Available 4Explainable AI in Spatial Analysis May 1, 2025 Bias Detection Explainable artificial intelligence
Code Code Available 2Benchmarking Bias Mitigation Algorithms in Representation Learning through Fairness Metrics Jun 8, 2021 Age And Gender Classification Benchmarking
Code Code Available 1Neural Media Bias Detection Using Distant Supervision With BABE -- Bias Annotations By Experts Sep 29, 2022 Articles Bias Detection
Code Code Available 1BiasAsker: Measuring the Bias in Conversational AI System May 21, 2023 Bias Detection
Code Code Available 1Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations Apr 15, 2024 Benchmarking Bias Detection
Code Code Available 1MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation Dec 16, 2024 All Benchmarking
Code Code Available 1Counterfactual Token Generation in Large Language Models Sep 25, 2024 Bias Detection counterfactual
Code Code Available 1BAD: BiAs Detection for Large Language Models in the context of candidate screening May 17, 2023 Bias Detection Fairness
Code Code Available 1New Job, New Gender? Measuring the Social Bias in Image Generation Models Jan 1, 2024 Bias Detection Fairness
Code Code Available 1Towards explainable classifiers using the counterfactual approach -- global explanations for discovering bias in data May 5, 2020 Bias Detection counterfactual
Code Code Available 1Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases Jun 6, 2020 Bias Detection Sentence
Code Code Available 1Debiased Visual Question Answering from Feature and Sample Perspectives Dec 1, 2021 Bias Detection Question Answering
Code Code Available 1Amazon SageMaker Clarify: Machine Learning Bias Detection and Explainability in the Cloud Sep 7, 2021 Bias Detection BIG-bench Machine Learning
Code Code Available 1Neural Media Bias Detection Using Distant Supervision With BABE - Bias Annotations By Experts Nov 1, 2021 Articles Bias Detection
Code Code Available 1Explainable AI for computational pathology identifies model limitations and tissue biomarkers Sep 4, 2024 Bias Detection counterfactual
Code Code Available 1A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets May 29, 2023 Bias Detection Code Generation
Code Code Available 1Learning to Split for Automatic Bias Detection Apr 28, 2022 Bias Detection image-classification
Code Code Available 1OpenBias: Open-set Bias Detection in Text-to-Image Generative Models Apr 11, 2024 Bias Detection Fairness
Code Code Available 1The Hidden Language of Diffusion Models Jun 1, 2023 Bias Detection Image Manipulation
Code Code Available 1SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation May 16, 2024 Bias Detection Diversity
Code Code Available 1StereoSet: Measuring stereotypical bias in pretrained language models Apr 20, 2020 Bias Detection Math
Code Code Available 1Introducing MBIB -- the first Media Bias Identification Benchmark Task and Dataset Collection Apr 25, 2023 Bias Detection
Code Code Available 1Exploring Visual Engagement Signals for Representation Learning Apr 15, 2021 Bias Detection Emotion Recognition
Code Code Available 1Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists Mar 17, 2022 Abuse Detection Bias Detection
Code Code Available 1A Review of the Challenges with Massive Web-mined Corpora Used in Large Language Models Pre-Training Jul 10, 2024 Bias Detection
— Unverified 0BENN: Bias Estimation Using Deep Neural Network Dec 23, 2020 Bias Detection
— Unverified 0Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models Feb 14, 2023 Bias Detection Data Augmentation
— Unverified 0Beyond Explanation: A Case for Exploratory Text Visualizations of Non-Aggregated, Annotated Datasets Jun 1, 2022 Bias Detection Hate Speech Detection
— Unverified 0BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs Jul 14, 2024 Bias Detection Question Answering
— Unverified 0BEADs: Bias Evaluation Across Domains Jun 6, 2024 Benchmarking Bias Detection
— Unverified 0Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector May 21, 2025 Bias Detection In-Context Learning
— Unverified 0Constructive Interpretability with CoLabel: Corroborative Integration, Complementary Features, and Collaborative Learning May 20, 2022 Bias Detection
— Unverified 0Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema Apr 16, 2021 Artifact Detection Bias Detection
— Unverified 0A Novel Method for News Article Event-Based Embedding May 20, 2024 Articles Bias Detection
— Unverified 0A Keyword Based Approach to Understanding the Overpenalization of Marginalized Groups by English Marginal Abuse Models on Twitter Oct 7, 2022 Bias Detection Fairness
— Unverified 0Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs Apr 15, 2024 Bias Detection Logical Reasoning
— Unverified 0Annotating and Analyzing Biased Sentences in News Articles using Crowdsourcing May 1, 2020 Articles Bias Detection
— Unverified 0Auditing Predictive Models for Intersectional Biases Jun 22, 2023 Bias Detection Fairness
— Unverified 0Accurate Uncertainty Estimation and Decomposition in Ensemble Learning Nov 11, 2019 Bias Detection Ensemble Learning
— Unverified 0Can we Debias Social Stereotypes in AI-Generated Images? Examining Text-to-Image Outputs and User Perceptions May 27, 2025 Bias Detection
— Unverified 0Auditing Algorithmic Fairness in Machine Learning for Health with Severity-Based LOGAN Nov 16, 2022 Bias Detection Clustering
— Unverified 0Can We Trust AI Agents? A Case Study of an LLM-Based Multi-Agent System for Ethical AI Oct 25, 2024 Bias Detection Ethics
— Unverified 0Cascading Adversarial Bias from Injection to Distillation in Language Models May 30, 2025 Bias Detection Code Generation
— Unverified 0Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021): Workshop and Shared Task Report Aug 17, 2021 Bias Detection Learning Word Embeddings
— Unverified 0ChatGPT v.s. Media Bias: A Comparative Study of GPT-3.5 and Fine-tuned Language Models Mar 29, 2024 Bias Detection
— Unverified 0Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers Jan 1, 2025 Bias Detection Large Language Model
— Unverified 0Cognitive Bias Detection Using Advanced Prompt Engineering Mar 7, 2025 Bias Detection Decision Making
— Unverified 0Detecting Cross-Geographic Biases in Toxicity Modeling on Social Media Apr 14, 2021 Bias Detection
— Unverified 0Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool Feb 3, 2025 Bias Detection Clustering
— Unverified 0