SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 76100 of 199 papers

TitleStatusHype
TinyEmo: Scaling down Emotional Reasoning via Metric ProjectionCode0
To Bias or Not to Bias: Detecting bias in News with bias-detectorCode0
Towards Automatic Bias Detection in Knowledge GraphsCode0
Towards Detection of Subjective Bias using Contextualized Word EmbeddingsCode0
Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM InteractionsCode0
Trade-Offs Between Fairness and Privacy in Language ModelingCode0
Uncovering bias in the PlantVillage datasetCode0
ViLBias: A Comprehensive Framework for Bias Detection through Linguistic and Visual Cues , presenting Annotation Strategies, Evaluation, and Key ChallengesCode0
Evaluating Fairness Metrics in the Presence of Dataset Bias0
Experiments in News Bias Detection with Pre-Trained Neural Transformers0
Auditing Algorithmic Fairness in Machine Learning for Health with Severity-Based LOGAN0
Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool0
Exploiting Transformer-based Multitask Learning for the Detection of Media Bias in News Articles0
A Survey on Predicting the Factuality and the Bias of News Media0
Extending Variability-Aware Model Selection with Bias Detection in Machine Learning Projects0
Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor0
Sexism in the Judiciary0
Sexism in the Judiciary: The Importance of Bias Definition in NLP and In Our Courts0
Fairness via AI: Bias Reduction in Medical Information0
FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing0
Fine-Grained Bias Detection in LLM: Enhancing detection mechanisms for nuanced biases0
Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models0
Sparse Interventions in Language Models with Differentiable Masking0
A Study on Bias Detection and Classification in Natural Language Processing0
A Deep Dive into Effects of Structural Bias on CMA-ES Performance along Affine Trajectories0
Show:102550
← PrevPage 4 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2GemmaBest-of0.41Unverified
3BaselineBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified