Login / Signup

Sharing Data from the Human Tumor Atlas Network through Standards, Infrastructure, and Community Engagement.

Ino De BruijnMilen NikolovClarisse LauAshley ClaytonDavid L GibbsElvira MitrakaDar'ya PozhidayevaAlex LashSelcuk Onur SumerJennifer AltreuterKristen AntonMialy DeFeliceXiang LiAaron LismanWilliam J R LongabaughJeremy MuhlichSandro SantagataSubhiksha NandakumarPeter K SorgerChristine SuverNikolaus SchultzAdam J TaylorVesteinn ThorssonEthan CeramiJames A Eddy
Published in: bioRxiv : the preprint server for biology (2024)
The Data Coordinating Center (DCC) of the Human Tumor Atlas Network (HTAN) has played a crucial role in enabling the broad sharing and effective utilization of HTAN data within the scientific community. Data from the first phase of HTAN are now available publicly. We describe the diverse datasets and modalities shared, multiple access routes to HTAN assay data and metadata, data standards, technical infrastructure and governance approaches, as well as our approach to sustained community engagement. HTAN data can be accessed via the HTAN Portal, explored in visualization tools-including CellxGene, Minerva, and cBioPortal-and analyzed in the cloud through the NCI Cancer Research Data Commons nodes. We have developed a streamlined infrastructure to ingest and disseminate data by leveraging the Synapse platform. Taken together, the HTAN DCC's approach demonstrates a successful model for coordinating, standardizing, and disseminating complex cancer research data via multiple resources in the cancer data ecosystem, offering valuable insights for similar consortia, and researchers looking to leverage HTAN data.
Keyphrases
  • electronic health record
  • big data
  • healthcare
  • mental health
  • social media
  • endothelial cells
  • early stage
  • lymph node
  • single cell
  • neoadjuvant chemotherapy
  • rna seq