Adaptive protein evolution through length variation of short tandem repeats in Arabidopsis .
William B ReinarAnne GreulichIda M StøJonfinn B KnutsenTrond ReitanOle Kristian TørresenSissel JentoftMelinka A ButenkoKjetill Sigurd JakobsenPublished in: Science advances (2023)
Intrinsically disordered protein regions are of high importance for biotic and abiotic stress responses in plants. Tracts of identical amino acids accumulate in these regions and can vary in length over generations because of expansions and retractions of short tandem repeats at the genomic level. However, little attention has been paid to what extent length variation is shaped by natural selection. By environmental association analysis on 2514 length variable tracts in 770 whole-genome sequenced Arabidopsis thaliana , we show that length variation in glutamine and asparagine amino acid homopolymers, as well as in interaction hotspots, correlate with local bioclimatic habitat. We determined experimentally that the promoter activity of a light-stress gene depended on polyglutamine length variants in a disordered transcription factor. Our results show that length variations affect protein function and are likely adaptive. Length variants modulating protein function at a global genomic scale has implications for understanding protein evolution and eco-evolutionary biology.