Assessing the low complexity of protein sequences via the low complexity triangle.
Pablo MierMiguel A Andrade-NavarroPublished in: PloS one (2020)
The low complexity triangle proves to be a suitable procedure to represent the general low complexity of a sequence or protein dataset. Homorepeats, direpeats, compositionally biased regions and globular regions occupy characteristic positions in the triangle. The described pipeline can be used to characterize LCRs and may help in quantifying the content of degenerated tandem repeats in proteins and proteomes.