RAFTS3G: an efficient and versatile clustering software to analyses in large protein datasets.
Bruno Thiago de Lima NichioAryel Marlus Repula de OliveiraCamilla Reginatto de PierriLeticia Graziela Costa SantosAlexandre Quadros LejambreRicardo Assunção VialleNilson Antônio da Rocha CoimbraDieval GuizeliniJeroniza Nunes MarchaukoskiFabio de Oliveira PedrosaRoberto Tadeu RaittzPublished in: BMC bioinformatics (2019)
In general, RAFTS3G is able to group up to millions of biological sequences into large datasets, which is a remarkable option of efficiency in clustering. RAFTS3G compared to other "standard-gold" methods in the clustering of large biological data maintains the balance between the reduction of biological information redundancy and the creation of consistent groups. We bring the binary search concept applied to grouped sequences which shows maintaining sensitivity/accuracy relation and up to minimize the time of data generated with RAFTS3G process.