This repository contains solutions for the Analytical Data Science Programmer Coding Assessment.
The objective of this assessment was to demonstrate proficiency in:
- SDTM dataset creation using
{sdtm.oak} - ADaM dataset derivation using
{admiral} - Regulatory-style TLG (Tables, Listings, and Graphs) generation
- Reproducible, well-documented, and clean R programming practices
All scripts were executed in a clean R session, and log files are included as evidence of error-free execution.
Objective: Create SDTM DS domain using {sdtm.oak}.
Contents:
01_create_ds_domain.R— Script to generate the DS domainds.csv— Output SDTM dataset01_create_ds_domain.log— Log file confirming error-free execution
Objective: Create ADaM ADSL dataset using {admiral}.
Contents:
create_adsl.R— Script to generate ADSL datasetadsl.csv— Output ADaM datasetcreate_adsl.log— Log file confirming error-free execution
Objective: Generate regulatory-style AE summary outputs.
Contents:
01_create_ae_summary_table.R— Script for TEAE summary table02_create_visualizations.R— Script for AE visualizationsae_summary_table.html— FDA-style AE table outputae_severity_plot.png— AE severity distribution plottop10_ae_ci.png— Top 10 AEs with 95% Clopper-Pearson CIs- Log files for both scripts confirming error-free execution
Environment:
- R version ≥ 4.2.0
- Required packages:
- admiral
- sdtm.oak
- gtsummary
- ggplot2
- dplyr
- gt
Each script:
- Was executed in a clean R session
- Includes logging via
sink()for traceability - Produces reproducible outputs
Thank you for reviewing this submission.