Testing domain knowledge and risk of bias of a large-scale general artificial intelligence model in mental health.
Michael V HeinzSukanya BhattacharyaBrianna TrudeauRachel QuistSeo Ho SongCamilla M LeeNicholas C JacobsonPublished in: Digital health (2023)
Our findings demonstrate initial promise in the domain knowledge of a large AI model, with performance variability perhaps due to the more salient hallmark symptoms, narrower differential diagnosis, and higher prevalence of some disorders. We found limited evidence of model demographic bias, although we do observe some gender and racial differences in model outcomes mirroring real-world differential prevalence estimates.