SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 101150 of 199 papers

TitleStatusHype
A Review of the Challenges with Massive Web-mined Corpora Used in Large Language Models Pre-Training0
STOOD-X methodology: using statistical nonparametric test for OOD Detection Large-Scale datasets enhanced with explainability0
Subtle Misogyny Detection and Mitigation: An Expert-Annotated Dataset0
Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models0
Towards Integrating Fairness Transparently in Industrial Applications0
Target-Aware Contextual Political Bias Detection in News0
Team Kermit-the-frog at SemEval-2019 Task 4: Bias Detection Through Sentiment Analysis and Simple Linguistic Features0
Implications of the AI Act for Non-Discrimination Law and Algorithmic Fairness0
Improved Models for Media Bias Detection and Subcategorization0
Incorporating Subjectivity into Gendered Ambiguous Pronoun (GAP) Resolution using Style Transfer0
With a Grain of SALT: Are LLMs Fair Across Social Dimensions?0
Inferring bias and uncertainty in camera calibration0
InsideBias: Measuring Bias in Deep Networks and Application to Face Gender Biometrics0
Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector0
Investigating Bias in Image Classification using Model Explanations0
Accurate Uncertainty Estimation and Decomposition in Ensemble Learning0
iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability0
The Impact of Presentation Style on Human-In-The-Loop Detection of Algorithmic Bias0
Large Language Model (LLM) Bias Index -- LLMBI0
Large-scale news entity sentiment analysis0
A Novel Method for News Article Event-Based Embedding0
LLMs can be easily Confused by Instructional Distractions0
Unboxing Occupational Bias: Grounded Debiasing of LLMs with U.S. Labor Data0
The Point of View of a Sentiment: Towards Clinician Bias Detection in Psychiatric Notes0
Split and Expand: An inference-time improvement for Weakly Supervised Cell Instance Segmentation0
Uncovering Biases with Reflective Large Language Models0
MBIC -- A Media Bias Annotation Dataset Including Annotator Characteristics0
Visual Reasoning Evaluation of Grok, Deepseek Janus, Gemini, Qwen, Mistral, and ChatGPT0
MediaSpin: Exploring Media Bias Through Fine-Grained Analysis of News Headlines0
Unlocking Bias Detection: Leveraging Transformer-Based Models for Content Analysis0
Toward Holistic Evaluation of Recommender Systems Powered by Generative Models0
Modeling Multi-level Context for Informational Bias Detection by Contrastive Learning and Sentential Graph Network0
MRCLens: an MRC Dataset Bias Detection Toolkit0
MRCLens: an MRC Dataset Bias Detection Toolkit0
Annotating and Analyzing Biased Sentences in News Articles using Crowdsourcing0
Multilingual Bias Detection and Mitigation for Indian Languages0
Towards A Reliable Ground-Truth For Biased Language Detection0
Unmasking Bias in AI: A Systematic Review of Bias Detection and Mitigation Strategies in Electronic Health Record-based Models0
Towards Detecting Political Bias in Hindi News Articles0
An Interdisciplinary Approach for the Automated Detection and Visualization of Media Bias in News Articles0
Anatomizing Bias in Facial Analysis0
Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles0
Unmasking Conversational Bias in AI Multiagent Systems0
A Meta Survey of Quality Evaluation Criteria in Explanation Methods0
On the Mutual Influence of Gender and Occupation in LLM Representations0
A Keyword Based Approach to Understanding the Overpenalization of Marginalized Groups by English Marginal Abuse Models on Twitter0
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs0
Bias Analysis of AI Models for Undergraduate Student Admissions0
Beyond Explanation: A Case for Exploratory Text Visualizations of Non-Aggregated, Annotated Datasets0
Bias Detection via Maximum Subgroup Discrepancy0
Show:102550
← PrevPage 3 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2GemmaBest-of0.41Unverified
3BaselineBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified