SARS-CoV-2 Genomic Surveillance and RNASeqAnalysis (technically DNA) has two components:
- Variant Surveillance
- Epitope Surveillance
Variant Surveillance is an additional component of this project to monitor the circulating variants.
python src/SarsCov2Variants.py
This will generate csv files in output/12_variants and images in output/11_regions
Note that you will need to download the tsv files from GISAID since we are not authorized to redistribute datasets from it.
https://www.usacfi.net/covid-19-surveillance.html
Epitope Surveillance aligns multiple COVID DNA sequences, trims the genes of interest, translates to protein, and identifies mutations in epitope regions. Download the entire RNASeqAnalysis folder
python main.py -i "references/Sequences/omicron_variant" -m "references/Sequences/omicron_variant"
Optional parameters:
-idirectory which contains the fasta files-mdirectory which contains the metadata files-refreference genome name (Wuhan strain) the default is the name of the first genome in the input fasta file.-alnTRUE when the input needs alignment and FALSE when the input is already aligned-epsinput text file that lists the epitope regions-locinput text file that indicates what and where are the relevant reference genes-gfasta file with only the reference genome-ooutput fasta file
Input files:
sequences.fastametadata.tsvReference_gene_locations.txtNCBI Reference Sequence_NC_045512.2.fastaepitopes.txt
Output files:
01_aligned.fasta02_trimmed.fasta03_protein.fasta04_mutations.fasta05_aminoacid_replacements.csv06_unique_mutations.csv07_heatmap_[protein].html08_table_[protein]_[#].html09_mutation_geomap.html10_geoplot_variants.html11_regions12_variants13_filtered_seq_epitsurve.csv
Fasta files can be viewed using any alignment viewers, e.g. AliView
- Running the program in Windows or Linux might yield an error when (-aln TRUE). This is because the current MUSCLE and MAFFT programs in the folder are only compatible for MAC computers.
Solution: You can download the MUSCLE/MAFFT programs compatible to your OS free online. Or you can change the input parameter (-aln FALSE), but note here that the program assumes that your input file is already aligned.
- An error might come up when you are running in Python 2.
Solution: convert the codes into Python 2 or install Python3 (recommended)
- An error might come up when you don't have the following python modules:
- argparse
- numpy
- pandas
- matplotlib
- itertools
- scipy
- plotly
- datetime
Solution: Install the modules using pip in the command line/ terminal: pip install [module_name]

