SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 5175 of 199 papers

TitleStatusHype
How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contextsCode0
Measuring Gender Bias in Word Embeddings across Domains and Discovering New Gender Bias Word CategoriesCode0
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative ModelsCode0
A methodology to characterize bias and harmful stereotypes in natural language processing in Latin AmericaCode0
GUS-Net: Social Bias Classification in Text with Generalizations, Unfairness, and StereotypesCode0
Gender Bias Detection in Court Decisions: A Brazilian Case StudyCode0
DispaRisk: Auditing Fairness Through Usable InformationCode0
Quantifying Gender Biases Towards Politicians on RedditCode0
A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet ClassifiersCode0
Second Order WinoBias (SoWinoBias) Test Set for Latent Gender Bias Detection in Coreference ResolutionCode0
Disentangling Structure and Style: Political Bias Detection in News by Inducing Document HierarchyCode0
HeteroCorpus: A Corpus for Heteronormative Language DetectionCode0
fairmodels: A Flexible Tool For Bias Detection, Visualization, And MitigationCode0
Detecting Media Bias in News Articles using Gaussian Bias DistributionsCode0
Fair is Better than Sensational:Man is to Doctor as Woman is to DoctorCode0
Fine-grained Classification of Political Bias in German News: A Data Set and Initial ExperimentsCode0
Can Global XAI Methods Reveal Injected Bias in LLMs? SHAP vs Rule Extraction vs RuleSHAPCode0
Don’t Discard All the Biased Instances: Investigating a Core Assumption in Dataset Bias Mitigation TechniquesCode0
Detection of Puffery on the English WikipediaCode0
Automated Dependence PlotsCode0
Forward Composition Propagation for Explainable Neural ReasoningCode0
How Neural Networks Organize Concepts: Introducing Concept Trajectory Analysis for Deep Learning InterpretabilityCode0
Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language ModelsCode0
Don't Discard All the Biased Instances: Investigating a Core Assumption in Dataset Bias Mitigation TechniquesCode0
The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias DetectionCode0
Show:102550
← PrevPage 3 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2GemmaBest-of0.41Unverified
3BaselineBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified