Variant antigen repertoires in Trypanosoma congolense populations and experimental infections can be profiled from deep sequence data using universal protein motifs.
Sara Silva PereiraAitor Casas-SanchezLee R HainesMoses OgugoKihara AbsolomonMandy SandersSteve KempÁlvaro Acosta-SerranoHarry NoyesMatthew BerrimanAndrew P JacksonPublished in: Genome research (2018)
African trypanosomes are vector-borne hemoparasites of humans and animals. In the mammal, parasites evade the immune response through antigenic variation. Periodic switching of the variant surface glycoprotein (VSG) coat covering their cell surface allows sequential expansion of serologically distinct parasite clones. Trypanosome genomes contain many hundreds of VSG genes, subject to rapid changes in nucleotide sequence, copy number, and chromosomal position. Thus, analyzing, or even quantifying, VSG diversity over space and time presents an enormous challenge to conventional techniques. Indeed, previous population genomic studies have overlooked this vital aspect of pathogen biology for lack of analytical tools. Here we present a method for analyzing population-scale VSG diversity in Trypanosoma congolense from deep sequencing data. Previously, we suggested that T. congolense VSGs segregate into defined "phylotypes" that do not recombine. In our data set comprising 41 T. congolense genome sequences from across Africa, these phylotypes are universal and exhaustive. Screening sequence contigs with diagnostic protein motifs accurately quantifies relative phylotype frequencies, providing a metric of VSG diversity, called the "variant antigen profile." We applied our metric to VSG expression in the tsetse fly, showing that certain, rare VSG phylotypes may be preferentially expressed in infective, metacyclic-stage parasites. Hence, variant antigen profiling accurately and rapidly determines the T. congolense VSG gene and transcript repertoire from sequence data, without need for manual curation or highly contiguous sequences. It offers a tractable approach to measuring VSG diversity across strains and during infections, which is imperative to understanding the host-parasite interaction at population and individual scales.
Keyphrases
- copy number
- genome wide
- mitochondrial dna
- electronic health record
- immune response
- big data
- plasmodium falciparum
- amino acid
- cell surface
- poor prognosis
- escherichia coli
- dna methylation
- single cell
- binding protein
- mass spectrometry
- toll like receptor
- protein protein
- long non coding rna
- genetic diversity
- small molecule
- liquid chromatography
- rna seq
- trypanosoma cruzi
- deep learning