Cohort profile: St. Michael's Hospital Tuberculosis Database (SMH-TB), a retrospective cohort of electronic health record data and variables extracted using natural language processing.
David LandsmanAhmed AbdelbasitChristine WangMichael GuerzhoyUjash JoshiShaun MathewChloe Pou-PromDavid DaiVictoria PequegnatJoshua MurrayKamalprit ChokarMichaelia BanningMuhammad MamdaniSharmistha MishraJane BattPublished in: PloS one (2021)
SMH-TB is a unique database that includes a breadth of structured data derived from structured and unstructured EHR data by using NLP rulesets. The data are available for a variety of research applications, such as clinical epidemiology, quality improvement and mathematical modeling studies.