An Online Mammography Database with Biopsy Confirmed Types.
Hong-Min CaiJinhua WangTingting DanJiao LiZhihao FanWeiting YiChunyan CuiXinhua JiangLi LiPublished in: Scientific data (2023)
Breast carcinoma is the second largest cancer in the world among women. Early detection of breast cancer has been shown to increase the survival rate, thereby significantly increasing patients' lifespan. Mammography, a noninvasive imaging tool with low cost, is widely used to diagnose breast disease at an early stage due to its high sensitivity. Although some public mammography datasets are useful, there is still a lack of open access datasets that expand beyond the white population as well as missing biopsy confirmation or with unknown molecular subtypes. To fill this gap, we build a database containing two online breast mammographies. The dataset named by Chinese Mammography Database (CMMD) contains 3712 mammographies involved 1775 patients, which is divided into two branches. The first dataset CMMD1 contains 1026 cases (2214 mammographies) with biopsy confirmed type of benign or malignant tumors. The second dataset CMMD2 includes 1498 mammographies for 749 patients with known molecular subtypes. Our database is constructed to enrich the diversity of mammography data and promote the development of relevant fields.
Keyphrases
- single cell
- rna seq
- end stage renal disease
- early stage
- contrast enhanced
- ejection fraction
- newly diagnosed
- healthcare
- chronic kidney disease
- low cost
- magnetic resonance imaging
- prognostic factors
- ultrasound guided
- adverse drug
- high resolution
- peritoneal dialysis
- minimally invasive
- pregnant women
- social media
- fine needle aspiration
- machine learning
- lymph node
- mental health
- health information
- fluorescence imaging
- photodynamic therapy
- electronic health record
- mass spectrometry
- patient reported
- data analysis
- drug induced
- pregnancy outcomes