This repository contains bioinformatics scripts and pipelines for analyzing oncology-related genomic data, with a focus on single-cell RNA sequencing (scRNA-seq) and whole-exome sequencing (WES) analyses.
-
ScRNAseq/: Single-cell RNA sequencing analysis scriptscluster_cells.py: Advanced cell clustering with quality assessment using scanpyCNV.py: Copy number variation analysis using infercnvpycrisper.py: CRISPR UMI data processing, normalization and guide RNA enrichment analysismast.R: Seurat-based scRNA-seq analysis using the MAST differential expression frameworkml_models.py: Machine learning models for treatment response prediction using scVI for latent representationscanpy.py: Standard scanpy-based scRNA-seq analysis pipeline with quality controlseurat.R: Comprehensive Seurat-based workflow for cancer scRNA-seq with cell cycle analysisspatial_analysis.py: Spatial transcriptomics processing and visualization using Scanpy/Visium
-
WES/: Whole-exome sequencing analysis scriptsjwes.sh: Shell script for WES data processing
- Cancer genomics: Tools optimized for tumor heterogeneity and copy number analysis
- Single-cell analytics: Implementation of best practices for scRNA-seq quality control and clustering
- Treatment response: ML integration for predictive modeling from transcriptomic signatures
- Spatial analysis: Support for spatial transcriptomics in tumor microenvironment studies
- Reproducible workflows: Command-line interface with standardized parameters
- Languages: Python, R, Shell
- Core libraries:
- Python: scanpy, scVI, infercnvpy, scikit-learn, PyTorch
- R: Seurat, MAST, dplyr, ggplot2