Skip to content

ACTN3/ACTN3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🧬 Szymon Myrta

Bioinformatics Expert & AI Workflows Architect

LinkedIn ORCID Email Website

9+ Years | 6 Publications in Top Journals | Roche/Genentech Bioinformatician

Transforming genomic data into insights through reproducible pipelines and AI-powered workflows

🌐 VISIT MY PORTFOLIO | Built with Quarto


💡 What I Do

I specialize in end-to-end genomic data analysis - from raw FASTQ files to publication-ready figures and clinical insights. My work bridges bioinformatics, machine learning, and precision medicine, with a focus on reproducible workflows that accelerate drug discovery and biomarker identification.

Core Expertise

🧬 NGS Analysis

  • RNA-seq (bulk & single-cell)
  • ChIP-seq, ATAC-seq, CUT&RUN
  • CRISPR/ORF screens (~160K sgRNAs)
  • scTCR/scBCR-seq repertoires
  • WGS/WES variant calling
  • Spatial transcriptomics (10X Visium)
  • Long-read sequencing (PacBio, Nanopore)

Tools: Seurat, edgeR, DESeq2, limma, crisprVerse, MAGeCK

🤖 AI & Machine Learning

  • ML-ready dataset preparation
  • Feature engineering & normalization
  • Multi-omics integration
  • Consensus NMF, PAM classifiers
  • Automated QC systems

Tools: caret, scikit-learn, TensorFlow, Plumber APIs

⚙️ Software Development

  • R package development
  • Snakemake/Nextflow pipelines
  • Docker containerization
  • HPC cluster optimization
  • CI/CD integration (GitHub Actions)
  • Shiny dashboards

Tools: R, Python, Docker, Git, Quarto


🏢 About ACTN3 Bioinformatics

Founder & Owner | 🔗 actn3.pl | 💻 GitHub

Named after the ACTN3 "Speed Gene" - fast, efficient, high-performance bioinformatics.

ACTN3 Bioinformatics delivers genomic analysis services to biotech, pharma, and academic institutions:

  • AI-powered genomic workflows (LLM-assisted analysis, automated QC, ML integration)
  • Reproducible research infrastructure (Snakemake pipelines, Quarto documentation)
  • Clinical trial bioinformatics (biomarker validation, patient stratification)
  • Custom R package development
  • Training & consulting (R/Bioconductor workshops, pipeline development)

🛠️ Technical Stack

R Python Bash Docker Git Linux

📊 Bioinformatics Core

# R/Bioconductor EcosystemDifferential Expression: limma-voom, edgeR, DESeq2Single-Cell Analysis: Seurat, Scanpy, UMAP, cell type annotationCRISPR Screens: crisprVerse, screenCounter, TMM normalizationPathway Analysis: fgsea, MSigDB, IPA, Metacore GeneGOChIP-seq/ATAC-seq: BWA, peak calling, motif enrichmentMachine Learning: caret, PAM, consensus NMF, predictive modelingSpatial Transcriptomics: 10X Visium, Xenium analysis

🔧 Software Engineering

# R Package Development
✅ Package structure (devtools, usethis, roxygen2)
✅ Documentation (pkgdown websites, vignettes)
✅ Testing (testthat, unit tests, CI/CD)
✅ CRAN/Bioconductor submission standards

# Workflow Management
✅ Pipelines: Snakemake, Nextflow, Bash scripting
✅ Reproducibility: RMarkdown, Quarto, knitr
✅ Version Control: Git, GitHub/GitLab, CI/CD
✅ Compute: HPC clusters, Docker, parallel processing
✅ Data Handling: FASTQ/BAM/VCF parsing, dplyr, data.table

📈 Visualization & Deployment

# Publication-Quality GraphicsStatic: ggplot2, ComplexHeatmap, UMAPs, volcano plots, survival curves etc.Interactive: Shiny apps, plotly, custom dashboardsReports: Quarto websites, dynamic RMarkdown, GitHub Pages

📂 Featured Open-Source Projects

Reproducible Snakemake pipeline for ML-ready perturbation datasets

  • Harmonized scRNA-seq data from CRISPR screens
  • Balanced control/perturbation groups for unbiased ML training
  • Automated QC reports with integrated visualizations

Tech: Snakemake, Python/Scanpy, Quarto, CI/CD

🔗 View Repository | 📖 Documentation


Comprehensive Quarto knowledge portal for R/Pharma 2025 workshops

  • Structured documentation of tools, trends, and best practices
  • Reproducible examples (Positron IDE, Shiny, Quarto workflows)
  • Curated resources for pharmaceutical bioinformatics
  • GitHub Pages deployment

Tech: Quarto, RMarkdown, GitHub Pages, modern R workflows

🔗 View Repository | 🌐 Live Site


📚 Publications & Research Impact

Research Metrics

Publications Patients CRISPR IF>10

1️⃣ Transcription factor Zfx regulates tumor immune evasion | iScience (2025)

Bioinformatics Role: CRISPR screen analysis (~160K sgRNAs), bulk RNA-seq (HTSeqGenie), ChIP-seq (BWA), TCGA integration
Methods: edgeR, limma-voom, TMM normalization, Cox survival models, COSMIC signatures
Impact: Identified ZFX as biomarker for immunotherapy response

DOI


2️⃣ T cell-dependent bispecific therapy enhances innate immunity | Cancer Immunology Research (2024)

Bioinformatics Role: NK cell RNA-seq (Smart-Seq V4), Differential gene expression (DGE) analysis, GSEA (fgsea) Methods: HTSeqGenie, edgeR (logCPM), limma, MSigDB (c2/c5/c7), pathway enrichment
Impact: Discovered NK cell activation mechanism in bispecific antibody therapy

DOI


3️⃣ ERBB signalling in KRAS-driven lung cancer immune evasion | bioRxiv (2023)

Bioinformatics Role: scRNA-seq (10X Genomics, 22 clusters), bulk RNA-seq, WES
Methods: ImmGen annotation, PANTHER enrichment, UMAPs, COSMIC signatures
Impact: Multi-modal analysis revealed EGFR/ERBB-mediated immune escape

DOI


4️⃣ Transcriptional subtypes resolve lung adenocarcinoma heterogeneity | Clinical Cancer Research (2021)

Bioinformatics Role: NMF subtype discovery (>800 patients), PAM classifier (113 genes), drug response modeling
Methods: Consensus NMF, PAM, Camera GSEA, Spearman correlations (526 compounds), survfit, Cox models
Impact: Clinical classifier for MEK inhibitor response prediction | 📊 Code

DOI


5️⃣ Phosphodiesterase-4 inhibition promotes CNS remyelination | EMBO Molecular Medicine (2013)

Bioinformatics Role: Microarray analysis (Affymetrix Rat Exon 1.0ST), qRT-PCR validation
Methods: RMA normalization, limma (moderated t-test), FDR correction
Impact: Identified PDE4 as therapeutic target for multiple sclerosis | 📊 Data: GSE50042

DOI


6️⃣ + 1 Ongoing | In preparation (2025)

Full Publication List: ORCID


🎯 What I Bring to Your Project

For Biotech/Pharma Companies

  • Clinical Trial Support: Biomarker discovery, patient stratification, companion diagnostics
  • Target Validation: CRISPR screen analysis, drug response modeling, pathway mapping
  • Regulatory-Ready Deliverables: ISO-compliant documentation, reproducible workflows, audit trails
  • Typical Timelines:
    • RNA-seq: 2-4 weeks
    • scRNA-seq: 4-8 weeks
    • CRISPR screens: 3-6 weeks

For Startups & Academic Labs

  • Pipeline Development: Snakemake/Nextflow workflows, Docker containers, HPC optimization
  • R Package Development: Custom packages, documentation, CRAN submission
  • Shiny Dashboards: Interactive data exploration tools
  • AI Integration: Feature engineering for ML models, automated QC, model deployment
  • Multi-Omics Integration: Transcriptomics + genomics + epigenomics analyses
  • Publication Support: Methods sections, supplementary analyses, figure preparation
  • Training: Workshops on R/Bioconductor, reproducible research, GitHub workflows

🌐 What Excites Me (2025 & Beyond)

🤖 AI in Bioinformatics:
  - LLM-assisted code generation and debugging
  - AI-powered variant annotation
  - Automated literature mining for pathway enrichment

🧬 Multi-Modal Omics:
  - Spatial transcriptomics (10X Visium, Xenium)
  - Single-cell + proteomics + epigenomics integration
  - Multi-omic predictive modeling for precision medicine

🧪 Non-coding RNA:
  - lncRNA, miRNA, circRNA in cancer & development
  - Integration and correlation analysis between ncRNA and mRNA
  - Identifying ncRNA regulators of protein-coding genes
  - Regulatory network reconstruction

⚙️ Software Development:
  - R package development (Bioconductor standards)
  - Positron IDE + Quarto for literate programming
  - Nextflow DSL2 + Wave containers for cloud scalability
  - Git-based collaboration for reproducible science

🎯 CRISPR & Functional Genomics:
  - Perturb-seq analysis pipelines
  - Base editor/prime editor screen optimization
  - Screen hit validation workflows

🏥 Precision Medicine:
  - Real-time clinical decision support systems
  - Pharmacogenomics + liquid biopsy integration
  - Patient-specific therapy modeling

📫 Let's Collaborate

Available for Remote Projects | Consulting | Long-term Contracts

Personal Contact:
Email LinkedIn

Company Contact (ACTN3 Bioinformatics):
Website GitHub Email

Languages: English | Polish (Native)
Experience: 9+ years in pharma/biotech | International teams | Multiple time zones


💼 Professional Services

Service Description Timeline
🧬 NGS Analysis RNA-seq, scRNA-seq, ChIP-seq, CRISPR screens, WGS/WES 2-8 weeks
📦 R Package Development Custom Bioconductor packages, documentation, testing, CRAN submission 4-12 weeks
⚙️ Pipeline Development Snakemake/Nextflow workflows, Docker containers, HPC optimization 4-12 weeks
📊 Shiny Dashboards Interactive data exploration tools, real-time monitoring 2-6 weeks
📖 Publication Support Methods sections, supplementary analyses, figures 2-6 weeks
🎓 Training & Workshops R/Bioconductor, reproducible workflows, package development 1-3 days

📧 Contact for Custom Quote


Why ACTN3 Bioinformatics?

Advantage Description
Proven Track Record 6 publications in top journals (iScience, Clinical Cancer Research, EMBO MM)
Pharma Experience 9+ years at Roche/Genentech - understand industry needs & regulatory requirements
Fast Delivery Established pipelines ensure quick turnaround without compromising quality
End-to-End Solutions From FASTQ files to publication-ready figures & clinical insights
Software Engineering Custom R packages, production ML systems, API deployment
AI Integration LLM-assisted workflows, automated QC, model deployment
Publication-Ready Reproducible code, detailed methods, GitHub repos, Quarto reports
Remote Expertise 9+ years remote work with international teams across time zones
Confidentiality NDAs, ISO compliance experience, pharma-grade data security

🚀 From Raw Sequencing Data to Clinical Insights

Reproducible | Scalable | Publication-Ready

Portfolio

Built with Quarto | Hosted on GitHub Pages


"Transforming genomic data into insights through reproducible pipelines and AI-powered workflows"


Profile Views

About

I specialize in end-to-end genomic data analysis - from raw FASTQ files to publication-ready figures and clinical insights. My work bridges bioinformatics, machine learning, and precision medicine, with a focus on reproducible workflows that accelerate drug discovery and biomarker identification.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages