Making Common Fund data more findable: catalyzing a data ecosystem.
Amanda L CharbonneauArthur BradyKarl CzajkowskiJain AluvathingalSaranya CanchiRobert L CarterKyle ChardDaniel J B ClarkeJonathan CrabtreeHeather H CreasyMike D'ArcyVictor FelixMichelle G GiglioAlicia A GingrichRayna Michelle HarrisTheresa K HodgesOlukemi IfeonuMinji JeonEryk KropiwnickiMarisa C W LimR Lee LimingJessica LumianAnup A MahurkarMeisha MandalJames B MunroSuvarna NadendlaRudyard RichterCia RomanoPhilippe Rocca-SerraMichael SchorRobert E SchulerHongsuda TangmunarunkitAlexander M WaldropCris WilliamsKaren WordSusanna-Assunta SansoneAvi Ma'ayanRick WagnerIan T FosterCarl KesselmanC Titus BrownOwen WhitePublished in: GigaScience (2022)
The Common Fund Data Ecosystem (CFDE) has created a flexible system of data federation that enables researchers to discover datasets from across the US National Institutes of Health Common Fund without requiring that data owners move, reformat, or rehost those data. This system is centered on a catalog that integrates detailed descriptions of biomedical datasets from individual Common Fund Programs' Data Coordination Centers (DCCs) into a uniform metadata model that can then be indexed and searched from a centralized portal. This Crosscut Metadata Model (C2M2) supports the wide variety of data types and metadata terms used by individual DCCs and can readily describe nearly all forms of biomedical research data. We detail its use to ingest and index data from 11 DCCs.