Login / Signup

Measuring the impact of health research data in terms of data citations by scientific publications.

Yongmei BaiJian Du
Published in: Scientometrics (2022)
Health is a representative domain data-driven research since health research data are growingly generated at a massive scale. There is an intuitive logic that the degree to which disease burden and the number of data resources align. In order to figure out disease-specific data sharing and reuse level, we took the number of data records and their citations in the scientific literature in the Data Citation Index platform as approximate indicators. The results indicated that only a small percentage (7.5%) of health data records had received documented citations by scientific publications. We find the level of data sharing and reuse varies across diseases. Our study suggested that the more socioeconomic burden and the more research funding, the more likely scientific data for diseases will be produced and made available. But such a correlation could not be observed for the activity of data reuse. Secondary reuse of scientific data is a complex behavior.
Keyphrases
  • electronic health record
  • big data
  • healthcare
  • mental health
  • systematic review
  • data analysis
  • risk factors
  • high throughput
  • cross sectional