SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 171180 of 199 papers

TitleStatusHype
How Neural Networks Organize Concepts: Introducing Concept Trajectory Analysis for Deep Learning InterpretabilityCode0
How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contextsCode0
IFBiD: Inference-Free Bias DetectionCode0
Predicting the Leading Political Ideology of YouTube Channels Using Acoustic, Textual, and Metadata InformationCode0
Towards Automatic Bias Detection in Knowledge GraphsCode0
Quantifying Gender Biases Towards Politicians on RedditCode0
IndiVec: An Exploration of Leveraging Large Language Models for Media Bias Detection with Fine-Grained Bias IndicatorsCode0
Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM InteractionsCode0
The BIAS Detection Framework: Bias Detection in Word Embeddings and Language Models for European LanguagesCode0
DeNetDM: Debiasing by Network Depth ModulationCode0
Show:102550
← PrevPage 18 of 20Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2BaselineBest-of0.41Unverified
3GemmaBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified