I am excited to share my guest lecture for Department of Statistics at the University of Illinois STAT 447: Data Science Programming Methods. And thank you to Dirk Eddelbuettel for inviting me!
The talk was titled "Data Science: Street Fighting Statistics" and demonstrates two simple supervised modeling tasks in R.
A video of the lecture is here, the slides are here, and the code and data (and data attributions) are here.
Galton's height data from: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/T0HSJ1
Breast cancer data from:
“Data collection and sharing was supported by the National Cancer Institute (P01CA154292; U54CA163303), the Patient-Centered Outcomes Research Institute (PCS-1504-30370), and the Agency for Health Research and Quality (R01 HS018366-01A1). We thank the participating women, mammography facilities, and radiologists for the data they have provided for this study. You can learn more about the BCSC at: http://www.bcsc-research.org/."
https://www.bcsc-research.org/data/rf/documentation
https://www.bcsc-research.org/data/rf
https://www.bcsc-research.org/data/rf/risk-factor-dataset-download
https://www.bcsc-research.org/download_file/view/191/344