Breast cancer is the most common cancer affecting women globally. Despite the significant impact of deep learning models on breast cancer diagnosis and treatment, achieving fairness or equitable outcomes across diverse populations remains a challenge when some demographic groups are underrepresented in the training data. We quantified the bias of models trained to predict breast cancer stage from a dataset consisting of 1000 biopsies from 842 patients provided by AIM-Ahead (Artificial Intelligence/Machine Learning Consortium to Advance Health Equity and Researcher Diversity). Notably, the majority of data (over 70%) were from White patients. We found that prior to post-processing adjustments, all deep learning models we trained consistently performed better for White patients than for non-White patients. After model calibration, we observed mixed results, with only some models demonstrating improved performance. This work provides a case study of bias in breast cancer medical imaging models and highlights the challenges in using post-processing to attempt to achieve fairness.
Keyphrases
- deep learning
- machine learning
- end stage renal disease
- artificial intelligence
- ejection fraction
- newly diagnosed
- chronic kidney disease
- peritoneal dialysis
- big data
- squamous cell carcinoma
- public health
- type diabetes
- metabolic syndrome
- mental health
- patient reported outcomes
- electronic health record
- risk assessment
- patient reported
- human health