SOTAVerified|Agents Browse Leaderboard About Blog

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 199 papers

Title	Date	Tasks	Status	Hype
Toward Holistic Evaluation of Recommender Systems Powered by Generative Models	Apr 9, 2025	Bias DetectionRecommendation Systems	—Unverified	0
Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles	Apr 4, 2025	ArticlesBias Detection	—Unverified	0
STOOD-X methodology: using statistical nonparametric test for OOD Detection Large-Scale datasets enhanced with explainability	Apr 3, 2025	Bias DetectionOut of Distribution (OOD) Detection	—Unverified	0
On the Mutual Influence of Gender and Occupation in LLM Representations	Mar 9, 2025	Bias DetectionOccupation prediction	—Unverified	0
Fine-Grained Bias Detection in LLM: Enhancing detection mechanisms for nuanced biases	Mar 8, 2025	Bias Detectioncounterfactual	—Unverified	0
Cognitive Bias Detection Using Advanced Prompt Engineering	Mar 7, 2025	Bias DetectionDecision Making	—Unverified	0
Visual Reasoning Evaluation of Grok, Deepseek Janus, Gemini, Qwen, Mistral, and ChatGPT	Feb 23, 2025	Bias DetectionVisual Reasoning	—Unverified	0
Robust Bias Detection in MLMs and its Application to Human Trait Ratings	Feb 21, 2025	Bias Detection	CodeCode Available	0
Detecting Linguistic Bias in Government Documents Using Large language Models	Feb 19, 2025	Bias Detection	—Unverified	0
Towards Equitable AI: Detecting Bias in Using Large Language Models for Marketing	Feb 18, 2025	Bias DetectionMarketing	—Unverified	0

Show:10 25 50

← PrevPage 2 of 20Next →

All datasets StereoSet rt-inod-bias ICAT LLM bias PlantVillage_8px Wiki Neutrality Corpus

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-2 (small)	ICAT Score	72.97	—	Unverified
2	XLNet (large)	ICAT Score	72.03	—	Unverified
3	GPT-2 (medium)	ICAT Score	71.73	—	Unverified
4	BERT (base)	ICAT Score	71.21	—	Unverified
5	GPT-2 (large)	ICAT Score	70.54	—	Unverified
6	BERT (large)	ICAT Score	69.89	—	Unverified
7	RoBERTa (base)	ICAT Score	67.5	—	Unverified
8	GAL 120B	ICAT Score	65.6	—	Unverified
9	XLNet (base)	ICAT Score	62.1	—	Unverified
10	GPT-3 (text-davinci-002)	ICAT Score	60.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	Best-of	0.5	—	Unverified
2	Baseline	Best-of	0.41	—	Unverified
3	Gemma	Best-of	0.41	—	Unverified
4	Mistral	Best-of	0.36	—	Unverified
5	Llama2	Best-of	0.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BAD	ICAT Score	23.44	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RandomForest_default_hyperparameters	Accuracy (%)	49	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RoBERTa+ALBERT	F1	70.4	—	Unverified