SOTAVerified

Fairness

Papers

Showing 150 of 5676 papers

TitleStatusHype
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model ServingCode7
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionCode6
h2oGPT: Democratizing Large Language ModelsCode6
Visual Identification of Problematic Bias in Large Label SpacesCode5
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic DataCode4
Beyond Reward Hacking: Causal Rewards for Large Language Model AlignmentCode4
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and ChallengesCode4
Aequitas Flow: Streamlining Fair ML ExperimentationCode4
Continual Learning with Pre-Trained Models: A SurveyCode4
TrustLLM: Trustworthiness in Large Language ModelsCode4
GPFL: Simultaneously Learning Global and Personalized Feature Information for Personalized Federated LearningCode4
Data quality dimensions for fair AICode4
Holistic Evaluation of Language ModelsCode4
RecBole 2.0: Towards a More Up-to-Date Recommendation LibraryCode4
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsCode4
Kubric: A scalable dataset generatorCode4
Fairness Implications of Encoding Protected Categorical AttributesCode4
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language ModelsCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
Calibre: Towards Fair and Accurate Personalized Federated Learning with Self-Supervised LearningCode3
An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use CasesCode3
CBGBench: Fill in the Blank of Protein-Molecule Complex Binding GraphCode3
Theoretically Achieving Continuous Representation of Oriented Bounding BoxesCode3
A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray InterpretationCode3
Fairness in Serving Large Language ModelsCode3
PiML Toolbox for Interpretable Machine Learning Model Development and DiagnosticsCode3
FreeMatch: Self-adaptive Thresholding for Semi-supervised LearningCode3
Multi-objective Asynchronous Successive HalvingCode3
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive ProgrammingCode2
TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh OptimizationCode2
Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language ModelsCode2
Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A SurveyCode2
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous DrivingCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
Multi-Agent Large Language Models for Conversational Task-SolvingCode2
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 OutlookCode2
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence ActCode2
LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language ModelsCode2
LibMOON: A Gradient-based MultiObjective OptimizatioN Library in PyTorchCode2
Towards AI-Powered Video Assistant Referee System (VARS) for Association FootballCode2
FairDiff: Fair Segmentation with Point-Image DiffusionCode2
FairMedFM: Fairness Benchmarking for Medical Imaging Foundation ModelsCode2
TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation LearningCode2
CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language ModelsCode2
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness BenchmarkCode2
CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image FormatsCode2
FreeVA: Offline MLLM as Training-Free Video AssistantCode2
Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM EraCode2
FairCLIP: Harnessing Fairness in Vision-Language LearningCode2
Debiasing Multimodal Large Language ModelsCode2
Show:102550
← PrevPage 1 of 114Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)99.86Unverified
21D-CSNNPredictive Equality (age)97.8Unverified
#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)96.87Unverified
#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)98.97Unverified
#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)98.45Unverified
#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)98.68Unverified
#ModelMetricClaimedVerifiedStatus
11D-CSNNPredictive Equality (age)99.31Unverified
#ModelMetricClaimedVerifiedStatus
1Neighbour LearningDegree of Bias (DoB)0.49Unverified
#ModelMetricClaimedVerifiedStatus
1Neighbour LearningDegree of Bias (DoB)6.26Unverified
#ModelMetricClaimedVerifiedStatus
1Neighbour LearningDegree of Bias (DoB)1.96Unverified