Genome Clusters

Click here to access the tool for getting clusters of similar genomes as defined at different GSS or DNA signature thresholds.

Citation: Moreno-Hagelsieb G, Wang Z, Walsh S, ElSherbiny A (2013) Phylogenomic clustering for selecting non-redundant genomes for comparative genomics. Bioinformatics 29 (7): 947-949..

Find here data regarding genomic similarity scores (GSS) and their use for reducing the set of genomes in our database into a non-redundant set for comparative genomics, or for whatever else you might want to do with each group or cluster of similar genomes. The measure for similarity is explained in several articles by members of our group. We also provide distances between genomes based on di-, tri- and tetra-nucleotide signatures. The R scripts require three libraries: cluster, MCMCpack and ape. The scripts will produce groups of redundant genomes, transform the score tables into matrices, and provide dendogram files in Newick format (common format for phylogenetic trees).

  1. February 10th, 2013
