SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 101125 of 199 papers

TitleStatusHype
Personalized Detection of Cognitive Biases in Actions of Users from Their Logs: Anchoring and Recency Biases0
Pseudo-labelling Enhanced Media Bias Detection0
Quantifying Risk Propensities of Large Language Models: Ethical Focus and Bias Detection through Role-Play0
The Impact of Unstated Norms in Bias Analysis of Language Models0
Robots Enact Malignant Stereotypes0
Sample Complexity of Bias Detection with Subsampled Point-to-Subspace Distances0
Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms0
Sexism in the Judiciary0
Sexism in the Judiciary: The Importance of Bias Definition in NLP and In Our Courts0
Sparse Interventions in Language Models with Differentiable Masking0
STOOD-X methodology: using statistical nonparametric test for OOD Detection Large-Scale datasets enhanced with explainability0
Subtle Misogyny Detection and Mitigation: An Expert-Annotated Dataset0
Towards Integrating Fairness Transparently in Industrial Applications0
Target-Aware Contextual Political Bias Detection in News0
Team Kermit-the-frog at SemEval-2019 Task 4: Bias Detection Through Sentiment Analysis and Simple Linguistic Features0
The Impact of Presentation Style on Human-In-The-Loop Detection of Algorithmic Bias0
The Point of View of a Sentiment: Towards Clinician Bias Detection in Psychiatric Notes0
Toward Holistic Evaluation of Recommender Systems Powered by Generative Models0
Towards A Reliable Ground-Truth For Biased Language Detection0
Towards Detecting Political Bias in Hindi News Articles0
Towards Equitable AI: Detecting Bias in Using Large Language Models for Marketing0
Towards Fairer Health Recommendations: finding informative unbiased samples via Word Sense Disambiguation0
Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks0
Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation0
Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models0
Show:102550
← PrevPage 5 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2GemmaBest-of0.41Unverified
3BaselineBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified