For the upcoming Career Day at my children's school.
Because children R the future, and if we don't teach them well, they'll end up hooked on dplyr or (shudder) pandas and matplotlib.
These scripts use a well-known children's story to illustrate data science tasks:
- Getting data: Use
data.tableto read raw text data (food.txt) into a wide data set - Transforming datasets: Uses
data.tableto melt data to long - Summarizing data: Uses
data.tableto count by group - Visualization: Uses
ggplot2to plot food eaten per day (coming soon) - Modeling: Build
glmmodels to predict how many items the caterpillar will eat (regression) and what kind of items (classification) (coming soon)
Disclaimer: This project uses plot elements of Eric Carle's The Very Hungry Caterpillar, a copyrighted work, under Fair Use for educational purposes. Please visit the Eric Carle website and buy his books for the little caterpillars in your life.
