SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 2650 of 199 papers

TitleStatusHype
Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool0
Unmasking Conversational Bias in AI Multiagent Systems0
Decoding News Bias: Multi Bias Detection in News Articles0
Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers0
ViLBias: A Comprehensive Framework for Bias Detection through Linguistic and Visual Cues , presenting Annotation Strategies, Evaluation, and Key ChallengesCode0
Improved Models for Media Bias Detection and Subcategorization0
MT-LENS: An all-in-one Toolkit for Better Machine Translation EvaluationCode1
Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation0
MediaSpin: Exploring Media Bias Through Fine-Grained Analysis of News Headlines0
Bias Analysis of AI Models for Undergraduate Student Admissions0
The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias DetectionCode0
Bias in Large Language Models: Origin, Evaluation, and Mitigation0
Mitigating Bias in Queer Representation within Large Language Models: A Collaborative Agent ApproachCode0
Current State-of-the-Art of Bias Detection and Mitigation in Machine Translation for African and European Languages: a Review0
Quantifying Risk Propensities of Large Language Models: Ethical Focus and Bias Detection through Role-Play0
Can We Trust AI Agents? A Case Study of an LLM-Based Multi-Agent System for Ethical AI0
debiaSAE: Benchmarking and Mitigating Vision-Language Model BiasCode0
With a Grain of SALT: Are LLMs Fair Across Social Dimensions?0
GUS-Net: Social Bias Classification in Text with Generalizations, Unfairness, and StereotypesCode0
TinyEmo: Scaling down Emotional Reasoning via Metric ProjectionCode0
Mitigating the Risk of Health Inequity Exacerbated by Large Language Models0
Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM InteractionsCode0
Counterfactual Token Generation in Large Language ModelsCode1
Towards Fairer Health Recommendations: finding informative unbiased samples via Word Sense Disambiguation0
Explainable AI for computational pathology identifies model limitations and tissue biomarkersCode1
Show:102550
← PrevPage 2 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2BaselineBest-of0.41Unverified
3GemmaBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified