- This script takes genpept file and fasta sequences as input
- Remove sequence duplicates and extracts the longest isoforms for each gene (useful in ortholog analysis)
Usage: ./extract_oneProt_PerGene.sh Organism_Refseq.gp Organism_RefSeq.fasta ## Input filename should not have more than one period (.)