SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 176199 of 199 papers

TitleStatusHype
Enhancing Bias Detection in Political News Using Pragmatic Presupposition0
Towards Integrating Fairness Transparently in Industrial Applications0
Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like BiasesCode1
NewB: 200,000+ Sentences for Political Bias DetectionCode0
Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor0
Towards explainable classifiers using the counterfactual approach -- global explanations for discovering bias in dataCode1
Annotating and Analyzing Biased Sentences in News Articles using Crowdsourcing0
The Impact of Presentation Style on Human-In-The-Loop Detection of Algorithmic Bias0
StereoSet: Measuring stereotypical bias in pretrained language modelsCode1
InsideBias: Measuring Bias in Deep Networks and Application to Face Gender Biometrics0
Designing Tools for Semi-Automated Detection of Machine Learning Biases: An Interview Study0
Towards Detection of Subjective Bias using Contextualized Word EmbeddingsCode0
Bias in word embeddings0
Automated Dependence PlotsCode0
My Approach = Your Apparatus? Entropy-Based Topic Modeling on Multiple Domain-Specific Text CollectionsCode0
Accurate Uncertainty Estimation and Decomposition in Ensemble Learning0
Predicting the Leading Political Ideology of YouTube Channels Using Acoustic, Textual, and Metadata InformationCode0
Multilingual sentence-level bias detection in WikipediaCode0
Detecting Political Bias in News Articles Using Headline Attention0
Measuring Gender Bias in Word Embeddings across Domains and Discovering New Gender Bias Word CategoriesCode0
Team Kermit-the-frog at SemEval-2019 Task 4: Bias Detection Through Sentiment Analysis and Simple Linguistic Features0
Fair is Better than Sensational:Man is to Doctor as Woman is to DoctorCode0
Evaluating Fairness Metrics in the Presence of Dataset Bias0
Large-scale news entity sentiment analysis0
Show:102550
← PrevPage 8 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2BaselineBest-of0.41Unverified
3GemmaBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified