SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 1120 of 199 papers

TitleStatusHype
Toward Holistic Evaluation of Recommender Systems Powered by Generative Models0
Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles0
STOOD-X methodology: using statistical nonparametric test for OOD Detection Large-Scale datasets enhanced with explainability0
On the Mutual Influence of Gender and Occupation in LLM Representations0
Fine-Grained Bias Detection in LLM: Enhancing detection mechanisms for nuanced biases0
Cognitive Bias Detection Using Advanced Prompt Engineering0
Visual Reasoning Evaluation of Grok, Deepseek Janus, Gemini, Qwen, Mistral, and ChatGPT0
Robust Bias Detection in MLMs and its Application to Human Trait RatingsCode0
Detecting Linguistic Bias in Government Documents Using Large language Models0
Towards Equitable AI: Detecting Bias in Using Large Language Models for Marketing0
Show:102550
← PrevPage 2 of 20Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2BaselineBest-of0.41Unverified
3GemmaBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified