Skip to content

radhika-kataria/PATCH-pipeline

Repository files navigation

PAThogen CHaracterisation (PATCH) pipeline

PATCH pipeline implementation Nextflow pipeline for processing host transcriptomic and genomic sequencing data.

The pipeline was written by the Cancer Bioinformatics and Translational Systems Biology group at King's College London, UK.

patch_pipeline

Pipeline summary - Pathogen characterisation

  1. After QC steps (FastQC, trimmomatic), sequencing reads were aligned to the host reference genome (HISAT2 for transcriptomics and bwa for whole genome sequencing).
  2. Extracting unaligned reads (SAMtools)
  3. De novo assembley of host unmapped reads (SPAdes)
  4. Pathogen classification using 3 tools: Kraken2, BLASTn, Centrifuge where the consensus of two or more is taken forward.
  5. Classified reads from the pathogen of interest are extracted and functionally annotated using BLASTn against indexed RefSeq for transcripts/genomes of the pathogen of interest.

Pipeline summary - Pathogen integration

  1. A custom combined reference genome is created using the host and pathogen of interest reference genoemes (bwa)
  2. Whole genome sequencing data is aligned to the combined reference genome (bwa)
  3. Discordant reads where one read maps to the pathogen of interest and it's mate to the host reference genome are extracted (SAMtools)
  4. Filtering of duplicated reads and alignemnt quality (MAPQ scores), (Picard tools, SAMtools)
  5. As before - classified reads from the pathogen of interest are extracted and functionally annotated using BLASTn against indexed RefSeq for transcripts/genomes of the pathogen of interest.
  6. Discordant read coordinates extracted (Bedtools)

Credits

The pipeline was written by the Cancer Bioinformatics and Translational Systems Biology group at King's College London, UK.

Pipeline development and implementation by Radhika Kataria.

Study concept and design Radhika Kataria, Anita Grigoriadis, Saeed Shoaie

About

PAThogen CHaracterisation from host sequencing data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages