SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 101125 of 199 papers

TitleStatusHype
Trade-Offs Between Fairness and Privacy in Language ModelingCode0
Language-Agnostic Bias Detection in Language Models with Bias ProbingCode0
BiasAsker: Measuring the Bias in Conversational AI SystemCode1
BAD: BiAs Detection for Large Language Models in the context of candidate screeningCode1
Introducing MBIB -- the first Media Bias Identification Benchmark Task and Dataset CollectionCode1
Bipol: A Novel Multi-Axes Bias Evaluation Metric with Explainability for NLPCode0
Disentangling Structure and Style: Political Bias Detection in News by Inducing Document HierarchyCode0
Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models0
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark DatasetsCode0
Auditing Algorithmic Fairness in Machine Learning for Health with Severity-Based LOGAN0
Galactica: A Large Language Model for ScienceCode4
Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language ModelsCode0
Exploiting Transformer-based Multitask Learning for the Detection of Media Bias in News Articles0
A Keyword Based Approach to Understanding the Overpenalization of Marginalized Groups by English Marginal Abuse Models on Twitter0
Neural Media Bias Detection Using Distant Supervision With BABE -- Bias Annotations By ExpertsCode1
Efficient Gender Debiasing of Pre-trained Indic Language Models0
A Unified Comparison of User Modeling Techniques for Predicting Data Interaction and Detecting Exploration BiasCode0
Robots Enact Malignant Stereotypes0
MRCLens: an MRC Dataset Bias Detection Toolkit0
A methodology to characterize bias and harmful stereotypes in natural language processing in Latin AmericaCode0
Incorporating Subjectivity into Gendered Ambiguous Pronoun (GAP) Resolution using Style Transfer0
HeteroCorpus: A Corpus for Heteronormative Language DetectionCode0
Personalized Detection of Cognitive Biases in Actions of Users from Their Logs: Anchoring and Recency Biases0
Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models0
Uncovering bias in the PlantVillage datasetCode0
Show:102550
← PrevPage 5 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2BaselineBest-of0.41Unverified
3GemmaBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified