9+ Years | 6 Publications in Top Journals | Roche/Genentech Bioinformatician
Transforming genomic data into insights through reproducible pipelines and AI-powered workflows
🌐 VISIT MY PORTFOLIO | Built with Quarto
I specialize in end-to-end genomic data analysis - from raw FASTQ files to publication-ready figures and clinical insights. My work bridges bioinformatics, machine learning, and precision medicine, with a focus on reproducible workflows that accelerate drug discovery and biomarker identification.
Tools: Seurat, edgeR, DESeq2, limma, crisprVerse, MAGeCK |
Tools: caret, scikit-learn, TensorFlow, Plumber APIs |
Tools: R, Python, Docker, Git, Quarto |
Founder & Owner | 🔗 actn3.pl | 💻 GitHub
Named after the ACTN3 "Speed Gene" - fast, efficient, high-performance bioinformatics.
ACTN3 Bioinformatics delivers genomic analysis services to biotech, pharma, and academic institutions:
- AI-powered genomic workflows (LLM-assisted analysis, automated QC, ML integration)
- Reproducible research infrastructure (Snakemake pipelines, Quarto documentation)
- Clinical trial bioinformatics (biomarker validation, patient stratification)
- Custom R package development
- Training & consulting (R/Bioconductor workshops, pipeline development)
# R/Bioconductor Ecosystem
✅ Differential Expression: limma-voom, edgeR, DESeq2
✅ Single-Cell Analysis: Seurat, Scanpy, UMAP, cell type annotation
✅ CRISPR Screens: crisprVerse, screenCounter, TMM normalization
✅ Pathway Analysis: fgsea, MSigDB, IPA, Metacore GeneGO
✅ ChIP-seq/ATAC-seq: BWA, peak calling, motif enrichment
✅ Machine Learning: caret, PAM, consensus NMF, predictive modeling
✅ Spatial Transcriptomics: 10X Visium, Xenium analysis# R Package Development
✅ Package structure (devtools, usethis, roxygen2)
✅ Documentation (pkgdown websites, vignettes)
✅ Testing (testthat, unit tests, CI/CD)
✅ CRAN/Bioconductor submission standards
# Workflow Management
✅ Pipelines: Snakemake, Nextflow, Bash scripting
✅ Reproducibility: RMarkdown, Quarto, knitr
✅ Version Control: Git, GitHub/GitLab, CI/CD
✅ Compute: HPC clusters, Docker, parallel processing
✅ Data Handling: FASTQ/BAM/VCF parsing, dplyr, data.table# Publication-Quality Graphics
✅ Static: ggplot2, ComplexHeatmap, UMAPs, volcano plots, survival curves etc.
✅ Interactive: Shiny apps, plotly, custom dashboards
✅ Reports: Quarto websites, dynamic RMarkdown, GitHub PagesReproducible Snakemake pipeline for ML-ready perturbation datasets
- Harmonized scRNA-seq data from CRISPR screens
- Balanced control/perturbation groups for unbiased ML training
- Automated QC reports with integrated visualizations
Tech: Snakemake, Python/Scanpy, Quarto, CI/CD
🔗 View Repository | 📖 Documentation
Comprehensive Quarto knowledge portal for R/Pharma 2025 workshops
- Structured documentation of tools, trends, and best practices
- Reproducible examples (Positron IDE, Shiny, Quarto workflows)
- Curated resources for pharmaceutical bioinformatics
- GitHub Pages deployment
Tech: Quarto, RMarkdown, GitHub Pages, modern R workflows
🔗 View Repository | 🌐 Live Site
Bioinformatics Role: CRISPR screen analysis (~160K sgRNAs), bulk RNA-seq (HTSeqGenie), ChIP-seq (BWA), TCGA integration
Methods: edgeR, limma-voom, TMM normalization, Cox survival models, COSMIC signatures
Impact: Identified ZFX as biomarker for immunotherapy response
2️⃣ T cell-dependent bispecific therapy enhances innate immunity | Cancer Immunology Research (2024)
Bioinformatics Role: NK cell RNA-seq (Smart-Seq V4), Differential gene expression (DGE) analysis, GSEA (fgsea)
Methods: HTSeqGenie, edgeR (logCPM), limma, MSigDB (c2/c5/c7), pathway enrichment
Impact: Discovered NK cell activation mechanism in bispecific antibody therapy
Bioinformatics Role: scRNA-seq (10X Genomics, 22 clusters), bulk RNA-seq, WES
Methods: ImmGen annotation, PANTHER enrichment, UMAPs, COSMIC signatures
Impact: Multi-modal analysis revealed EGFR/ERBB-mediated immune escape
4️⃣ Transcriptional subtypes resolve lung adenocarcinoma heterogeneity | Clinical Cancer Research (2021)
Bioinformatics Role: NMF subtype discovery (>800 patients), PAM classifier (113 genes), drug response modeling
Methods: Consensus NMF, PAM, Camera GSEA, Spearman correlations (526 compounds), survfit, Cox models
Impact: Clinical classifier for MEK inhibitor response prediction | 📊 Code
Bioinformatics Role: Microarray analysis (Affymetrix Rat Exon 1.0ST), qRT-PCR validation
Methods: RMA normalization, limma (moderated t-test), FDR correction
Impact: Identified PDE4 as therapeutic target for multiple sclerosis | 📊 Data: GSE50042
|
|
🤖 AI in Bioinformatics:
- LLM-assisted code generation and debugging
- AI-powered variant annotation
- Automated literature mining for pathway enrichment
🧬 Multi-Modal Omics:
- Spatial transcriptomics (10X Visium, Xenium)
- Single-cell + proteomics + epigenomics integration
- Multi-omic predictive modeling for precision medicine
🧪 Non-coding RNA:
- lncRNA, miRNA, circRNA in cancer & development
- Integration and correlation analysis between ncRNA and mRNA
- Identifying ncRNA regulators of protein-coding genes
- Regulatory network reconstruction
⚙️ Software Development:
- R package development (Bioconductor standards)
- Positron IDE + Quarto for literate programming
- Nextflow DSL2 + Wave containers for cloud scalability
- Git-based collaboration for reproducible science
🎯 CRISPR & Functional Genomics:
- Perturb-seq analysis pipelines
- Base editor/prime editor screen optimization
- Screen hit validation workflows
🏥 Precision Medicine:
- Real-time clinical decision support systems
- Pharmacogenomics + liquid biopsy integration
- Patient-specific therapy modelingCompany Contact (ACTN3 Bioinformatics):
Languages: English | Polish (Native)
Experience: 9+ years in pharma/biotech | International teams | Multiple time zones
| Service | Description | Timeline |
|---|---|---|
| 🧬 NGS Analysis | RNA-seq, scRNA-seq, ChIP-seq, CRISPR screens, WGS/WES | 2-8 weeks |
| 📦 R Package Development | Custom Bioconductor packages, documentation, testing, CRAN submission | 4-12 weeks |
| ⚙️ Pipeline Development | Snakemake/Nextflow workflows, Docker containers, HPC optimization | 4-12 weeks |
| 📊 Shiny Dashboards | Interactive data exploration tools, real-time monitoring | 2-6 weeks |
| 📖 Publication Support | Methods sections, supplementary analyses, figures | 2-6 weeks |
| 🎓 Training & Workshops | R/Bioconductor, reproducible workflows, package development | 1-3 days |
| Advantage | Description |
|---|---|
| Proven Track Record | 6 publications in top journals (iScience, Clinical Cancer Research, EMBO MM) |
| Pharma Experience | 9+ years at Roche/Genentech - understand industry needs & regulatory requirements |
| Fast Delivery | Established pipelines ensure quick turnaround without compromising quality |
| End-to-End Solutions | From FASTQ files to publication-ready figures & clinical insights |
| Software Engineering | Custom R packages, production ML systems, API deployment |
| AI Integration | LLM-assisted workflows, automated QC, model deployment |
| Publication-Ready | Reproducible code, detailed methods, GitHub repos, Quarto reports |
| Remote Expertise | 9+ years remote work with international teams across time zones |
| Confidentiality | NDAs, ISO compliance experience, pharma-grade data security |