SOTAVerified

Fairness

Papers

Showing 150 of 5676 papers

TitleStatusHype
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model ServingCode7
h2oGPT: Democratizing Large Language ModelsCode6
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionCode6
Visual Identification of Problematic Bias in Large Label SpacesCode5
Data quality dimensions for fair AICode4
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and ChallengesCode4
Continual Learning with Pre-Trained Models: A SurveyCode4
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsCode4
Aequitas Flow: Streamlining Fair ML ExperimentationCode4
Fairness Implications of Encoding Protected Categorical AttributesCode4
TrustLLM: Trustworthiness in Large Language ModelsCode4
Holistic Evaluation of Language ModelsCode4
Kubric: A scalable dataset generatorCode4
GPFL: Simultaneously Learning Global and Personalized Feature Information for Personalized Federated LearningCode4
Beyond Reward Hacking: Causal Rewards for Large Language Model AlignmentCode4
RecBole 2.0: Towards a More Up-to-Date Recommendation LibraryCode4
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic DataCode4
FreeMatch: Self-adaptive Thresholding for Semi-supervised LearningCode3
Multi-objective Asynchronous Successive HalvingCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
Fairness in Serving Large Language ModelsCode3
An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use CasesCode3
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language ModelsCode3
Theoretically Achieving Continuous Representation of Oriented Bounding BoxesCode3
CBGBench: Fill in the Blank of Protein-Molecule Complex Binding GraphCode3
Calibre: Towards Fair and Accurate Personalized Federated Learning with Self-Supervised LearningCode3
A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray InterpretationCode3
PiML Toolbox for Interpretable Machine Learning Model Development and DiagnosticsCode3
LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language ModelsCode2
LibMOON: A Gradient-based MultiObjective OptimizatioN Library in PyTorchCode2
Large Language Models are Geographically BiasedCode2
Balanced MSE for Imbalanced Visual RegressionCode2
LEACE: Perfect linear concept erasure in closed formCode2
Multi-Agent Large Language Models for Conversational Task-SolvingCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
Graph Condensation: A SurveyCode2
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous DrivingCode2
FreeVA: Offline MLLM as Training-Free Video AssistantCode2
A Prescription of Methodological Guidelines for Comparing Bio-inspired Optimization AlgorithmsCode2
FairCLIP: Harnessing Fairness in Vision-Language LearningCode2
FairMedFM: Fairness Benchmarking for Medical Imaging Foundation ModelsCode2
Debiasing Multimodal Large Language ModelsCode2
Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound PropagationCode2
FairDiff: Fair Segmentation with Point-Image DiffusionCode2
Fairness Evaluation for Uplift Modeling in the Absence of Ground TruthCode2
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive ProgrammingCode2
Dawn of the transformer era in speech emotion recognition: closing the valence gapCode2
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence ActCode2
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness BenchmarkCode2
CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image FormatsCode2
Show:102550
← PrevPage 1 of 114Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)99.86Unverified
21D-CSNNPredictive Equality (age)97.8Unverified
#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)96.87Unverified
#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)98.97Unverified
#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)98.45Unverified
#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)98.68Unverified
#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)99.31Unverified
#ModelMetricClaimedVerifiedStatus
1Neighbour LearningDegree of Bias (DoB)0.49Unverified
#ModelMetricClaimedVerifiedStatus
1Neighbour LearningDegree of Bias (DoB)6.26Unverified
#ModelMetricClaimedVerifiedStatus
1Neighbour LearningDegree of Bias (DoB)1.96Unverified