Login / Signup

Development and evaluation of a genome-wide Coffee 8.5K SNP array and its application for high-density genetic mapping and for investigating the origin of Coffea arabica L.

Virginie Merot-L'anthoeneRémi TournebizeOlivier DarracqVimel RattinaMaud LepelleyLaurence BellangerChristine Tranchant-DubreuilManon CouléeMarie PégardSylviane MetaironCoralie FournierPiet StoffelenSteven B JanssensCatherine KiwukaPascal MusoliUcu SumiratHyacinthe LegnatéJean-Léon KambaleJoão Ferreira da Costa NetoClara RevelAlexandre de KochkoPatrick DescombesDominique CrouzillatValerie Poncet
Published in: Plant biotechnology journal (2019)
Coffee species such as Coffea canephora P. (Robusta) and C. arabica L. (Arabica) are important cash crops in tropical regions around the world. C. arabica is an allotetraploid (2n = 4x = 44) originating from a hybridization event of the two diploid species C. canephora and C. eugenioides (2n = 2x = 22). Interestingly, these progenitor species harbour a greater level of genetic variability and are an important source of genes to broaden the narrow Arabica genetic base. Here, we describe the development, evaluation and use of a single-nucleotide polymorphism (SNP) array for coffee trees. A total of 8580 unique and informative SNPs were selected from C. canephora and C. arabica sequencing data, with 40% of the SNP located in annotated genes. In particular, this array contains 227 markers associated to 149 genes and traits of agronomic importance. Among these, 7065 SNPs (~82.3%) were scorable and evenly distributed over the genome with a mean distance of 54.4 Kb between markers. With this array, we improved the Robusta high-density genetic map by adding 1307 SNP markers, whereas 945 SNPs were found segregating in the Arabica mapping progeny. A panel of C. canephora accessions was successfully discriminated and over 70% of the SNP markers were transferable across the three species. Furthermore, the canephora-derived subgenome of C. arabica was shown to be more closely related to C. canephora accessions from northern Uganda than to other current populations. These validated SNP markers and high-density genetic maps will be useful to molecular genetics and for innovative approaches in coffee breeding.
Keyphrases
  • high density
  • genome wide
  • dna methylation
  • copy number
  • gene expression
  • high resolution
  • machine learning
  • electronic health record
  • big data
  • artificial intelligence