- 14 proteome data used obtained from NCBI refseqDB which are enlisted in "
" and "
"
- 2 random proteomes generated by shuffling each protein's sequence order in proteome. The original proteomes obtained from Mycocosm DB, Joint Genome Institute (JGI)
- "
" contains 5 Ascomycota, 5 Basidiomycota, 3 Protists, 1 Monokaryote, and 2 randoms
- Input is proteome (amino acids), -a
- Feature length (l-mer) is 13, -s 13
- Normalize output (frequency), -n
- Output folder is "./FFP_13", manually created
./FFP_compress -a -s 13 -n 931890 ./FFP_13/931890
./FFP_compress -a -s 13 -n 332648 ./FFP_13/332648
./FFP_compress -a -s 13 -n 367775 ./FFP_13/367775
./FFP_compress -a -s 13 -n 418459 ./FFP_13/418459
./..
./..
./..
./FFP_compress -a -s 13 -n R990650 ./FFP_13/R990650
- Using 3 threads, -t 3
- Standard output to "16_items_13.matrix", is a symmetric matrix using [-s]
./JSD_maxtrix -s -t 3 ./FFP_13/* > 16_items_13.matrix
- You can use either BIONJ or NJ. However, BIONJ requires to input a symmetric matrix
- For this example, we use BIONJ provided here (http://www.atgc-montpellier.fr/bionj/) but any method that takes distance/divergence matrix works
./BIONJ 16_items_13.matrix 16_items_tree.newick
- For this example, we use ITOL (http://itol.embl.de/)
- Sample output:

