Plasma proteomic associations with genetics and health in the UK Biobank.
Benjamin B SunJoshua ChiouMatthew TraylorChristian BennerYi-Hsiang HsuTom G RichardsonPraveen SurendranAnubha MahajanChloe RobinsSteven G Vasquez-GrinnellLiping HouErika M KvikstadOliver S BurrenJonathan DavitteKyle L FerberChristopher E GilliesÅsa K HedmanSile HuTinchi LinRajesh MikkilineniRion K PendergrassCorran PickeringBram PrinsDenis BairdChia-Yen ChenLucas D WardAimee M DeatonSamantha WelshCarissa M WillisNick LehnerMatthias ArnoldMaria A WörheideKarsten SuhreGabi KastenmüllerAnurag SethiMadeleine CuleAnil Rajnull nullnull nullnull nullnull nullnull nullnull nullnull nullnull nullnull nullLucy Burkitt-GrayEugene MelamudMary Helen BlackEric B FaumanJoanna M M HowsonHyun Min KangMark I McCarthyPaul NioiSlavé PetrovskiRobert A ScottErin N SmithSandor SzalmaDawn M WaterworthLyndon J MitnaulJoseph D SzustakowskiBradford W GibsonMelissa R MillerChristopher D WhelanPublished in: Nature (2023)
The Pharma Proteomics Project is a precompetitive biopharmaceutical consortium characterizing the plasma proteomic profiles of 54,219 UK Biobank participants. Here we provide a detailed summary of this initiative, including technical and biological validations, insights into proteomic disease signatures, and prediction modelling for various demographic and health indicators. We present comprehensive protein quantitative trait locus (pQTL) mapping of 2,923 proteins that identifies 14,287 primary genetic associations, of which 81% are previously undescribed, alongside ancestry-specific pQTL mapping in non-European individuals. The study provides an updated characterization of the genetic architecture of the plasma proteome, contextualized with projected pQTL discovery rates as sample sizes and proteomic assay coverages increase over time. We offer extensive insights into trans pQTLs across multiple biological domains, highlight genetic influences on ligand-receptor interactions and pathway perturbations across a diverse collection of cytokines and complement networks, and illustrate long-range epistatic effects of ABO blood group and FUT2 secretor status on proteins with gastrointestinal tissue-enriched expression. We demonstrate the utility of these data for drug discovery by extending the genetic proxied effects of protein targets, such as PCSK9, on additional endpoints, and disentangle specific genes and proteins perturbed at loci associated with COVID-19 susceptibility. This public-private partnership provides the scientific community with an open-access proteomics resource of considerable breadth and depth to help to elucidate the biological mechanisms underlying proteo-genomic discoveries and accelerate the development of biomarkers, predictive models and therapeutics 1 .
Keyphrases
- poor prognosis
- genome wide
- long non coding rna
- healthcare
- label free
- copy number
- dna methylation
- mental health
- drug discovery
- high resolution
- public health
- mass spectrometry
- coronavirus disease
- quality improvement
- small molecule
- high throughput
- sars cov
- genome wide association study
- health information
- binding protein
- gene expression
- cross sectional
- high density
- risk assessment
- deep learning
- big data
- emergency department
- genome wide identification
- artificial intelligence
- human health