The genome sequence of the orchid Phalaenopsis equestris.
Jing CaiXin LiuKevin VannesteSebastian ProostWen-Chieh TsaiKe-Wei LiuLi-Jun ChenYing HeQing XuChao BianZhijun ZhengFengming SunWeiqing LiuYu-Yun HsiaoZhao-Jun PanChia-Chi HsuYa-Ping YangYi-Chin HsuYu-Chen ChuangAnne DievartJean-Francois DufayardXun XuJun-Yi WangJun WangXin-Ju XiaoXue-Min ZhaoRong DuGuo-Qiang ZhangMeina WangYong-Yu SuGao-Chang XieGuo-Hui LiuLi-Qiang LiLai-Qiang HuangYi-Bo LuoHong-Hwa ChenYves Van de PeerZhong-Jian LiuPublished in: Nature genetics (2014)
Orchidaceae, renowned for its spectacular flowers and other reproductive and ecological adaptations, is one of the most diverse plant families. Here we present the genome sequence of the tropical epiphytic orchid Phalaenopsis equestris, a frequently used parent species for orchid breeding. P. equestris is the first plant with crassulacean acid metabolism (CAM) for which the genome has been sequenced. Our assembled genome contains 29,431 predicted protein-coding genes. We find that contigs likely to be underassembled, owing to heterozygosity, are enriched for genes that might be involved in self-incompatibility pathways. We find evidence for an orchid-specific paleopolyploidy event that preceded the radiation of most orchid clades, and our results suggest that gene duplication might have contributed to the evolution of CAM photosynthesis in P. equestris. Finally, we find expanded and diversified families of MADS-box C/D-class, B-class AP3 and AGL6-class genes, which might contribute to the highly specialized morphology of orchid flowers.