SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 2650 of 199 papers

TitleStatusHype
A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet ClassifiersCode0
Language-Agnostic Bias Detection in Language Models with Bias ProbingCode0
How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contextsCode0
IFBiD: Inference-Free Bias DetectionCode0
Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative ModelsCode0
HeteroCorpus: A Corpus for Heteronormative Language DetectionCode0
GUS-Net: Social Bias Classification in Text with Generalizations, Unfairness, and StereotypesCode0
How Neural Networks Organize Concepts: Introducing Concept Trajectory Analysis for Deep Learning InterpretabilityCode0
IndiVec: An Exploration of Leveraging Large Language Models for Media Bias Detection with Fine-Grained Bias IndicatorsCode0
LOGAN: Local Group Bias Detection by ClusteringCode0
Fine-grained Classification of Political Bias in German News: A Data Set and Initial ExperimentsCode0
Fair is Better than Sensational:Man is to Doctor as Woman is to DoctorCode0
Forward Composition Propagation for Explainable Neural ReasoningCode0
Don’t Discard All the Biased Instances: Investigating a Core Assumption in Dataset Bias Mitigation TechniquesCode0
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark DatasetsCode0
Bipol: A Novel Multi-Axes Bias Evaluation Metric with Explainability for NLPCode0
A Domain-adaptive Pre-training Approach for Language Bias Detection in NewsCode0
Don't Discard All the Biased Instances: Investigating a Core Assumption in Dataset Bias Mitigation TechniquesCode0
Detection of Puffery on the English WikipediaCode0
A Unified Comparison of User Modeling Techniques for Predicting Data Interaction and Detecting Exploration BiasCode0
fairmodels: A Flexible Tool For Bias Detection, Visualization, And MitigationCode0
A methodology to characterize bias and harmful stereotypes in natural language processing in Latin AmericaCode0
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative ModelsCode0
Automated Dependence PlotsCode0
Detecting Media Bias in News Articles using Gaussian Bias DistributionsCode0
Show:102550
← PrevPage 2 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2GemmaBest-of0.41Unverified
3BaselineBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified