SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 101150 of 199 papers

TitleStatusHype
Personalized Detection of Cognitive Biases in Actions of Users from Their Logs: Anchoring and Recency Biases0
Pseudo-labelling Enhanced Media Bias Detection0
Quantifying Risk Propensities of Large Language Models: Ethical Focus and Bias Detection through Role-Play0
The Impact of Unstated Norms in Bias Analysis of Language Models0
Robots Enact Malignant Stereotypes0
Sample Complexity of Bias Detection with Subsampled Point-to-Subspace Distances0
Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms0
Sexism in the Judiciary0
Sexism in the Judiciary: The Importance of Bias Definition in NLP and In Our Courts0
Sparse Interventions in Language Models with Differentiable Masking0
STOOD-X methodology: using statistical nonparametric test for OOD Detection Large-Scale datasets enhanced with explainability0
Subtle Misogyny Detection and Mitigation: An Expert-Annotated Dataset0
Towards Integrating Fairness Transparently in Industrial Applications0
Target-Aware Contextual Political Bias Detection in News0
Team Kermit-the-frog at SemEval-2019 Task 4: Bias Detection Through Sentiment Analysis and Simple Linguistic Features0
The Impact of Presentation Style on Human-In-The-Loop Detection of Algorithmic Bias0
The Point of View of a Sentiment: Towards Clinician Bias Detection in Psychiatric Notes0
Toward Holistic Evaluation of Recommender Systems Powered by Generative Models0
Towards A Reliable Ground-Truth For Biased Language Detection0
Towards Detecting Political Bias in Hindi News Articles0
Towards Equitable AI: Detecting Bias in Using Large Language Models for Marketing0
Towards Fairer Health Recommendations: finding informative unbiased samples via Word Sense Disambiguation0
Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks0
Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation0
Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models0
Unboxing Occupational Bias: Grounded Debiasing of LLMs with U.S. Labor Data0
Uncovering Biases with Reflective Large Language Models0
Unlocking Bias Detection: Leveraging Transformer-Based Models for Content Analysis0
Unmasking Bias in AI: A Systematic Review of Bias Detection and Mitigation Strategies in Electronic Health Record-based Models0
Unmasking Conversational Bias in AI Multiagent Systems0
Unsupervised Bias Detection in College Student Newspapers0
Visual Reasoning Evaluation of Grok, Deepseek Janus, Gemini, Qwen, Mistral, and ChatGPT0
With a Grain of SALT: Are LLMs Fair Across Social Dimensions?0
Efficient Fairness Testing in Large Language Models: Prioritizing Metamorphic Relations for Bias Detection0
Efficient Gender Debiasing of Pre-trained Indic Language Models0
Enhancing Bias Detection in Political News Using Pragmatic Presupposition0
Mitigating the Risk of Health Inequity Exacerbated by Large Language Models0
Epistemological Bias As a Means for the Automated Detection of Injustices in Text0
Evaluating AI fairness in credit scoring with the BRIO tool0
Evaluating Fairness Metrics in the Presence of Dataset Bias0
Experiments in News Bias Detection with Pre-Trained Neural Transformers0
Don’t Discard All the Biased Instances: Investigating a Core Assumption in Dataset Bias Mitigation TechniquesCode0
A methodology to characterize bias and harmful stereotypes in natural language processing in Latin AmericaCode0
Context in Informational Bias DetectionCode0
To Bias or Not to Bias: Detecting bias in News with bias-detectorCode0
Multilingual sentence-level bias detection in WikipediaCode0
Don't Discard All the Biased Instances: Investigating a Core Assumption in Dataset Bias Mitigation TechniquesCode0
MAGPIE: Multi-Task Media-Bias Analysis Generalization for Pre-Trained Identification of ExpressionsCode0
My Approach = Your Apparatus? Entropy-Based Topic Modeling on Multiple Domain-Specific Text CollectionsCode0
Can Global XAI Methods Reveal Injected Bias in LLMs? SHAP vs Rule Extraction vs RuleSHAPCode0
Show:102550
← PrevPage 3 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2BaselineBest-of0.41Unverified
3GemmaBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified