This repository was submitted for The BrainHack School 2020 and QLSC612 Fundamentals of Neuro Data Science (McGill University and Dr. Jean-Baptiste Poline)
Running myanalysis.ipynb requires the following dependencies:
pandas(includingpandas.plotting)numpyrandomcsvscipy.statsmatplotlib.pyplot
- To install the required dependencies, use
pip install -r requirements.txt. - To run
myanalysis.ipynband print the expected outputs (statistics and figures), open the jupyter notebook using the commandjupyter notebookand run all kernels.
myanalysis.ipynb creates two continuous random variables: partY and partY2. Related t-test correlations (using the command scipy.stats.ttest_rel(data[{var1}], data[{var2}], nan_policy="omit")) are run on both random variables with each of the existing variables in the brainsize.csv file: FSIQ, VIQ, PIQ, Weight, Height, and MRI_Count.
The expected statistical test outputs are as follows:
| variable | t |
p |
|---|---|---|
FSIQ |
-10.91 | 2.05e-13 |
VIQ |
-10.99 | 1.66e-13 |
PIQ |
-10.93 | 1.93e-13 |
Weight |
-10.01 | 4.52e-12 |
Height |
-11.62 | 4.46e-14 |
MRI_Count |
79.44 | 9.47e-45 |
| variable | t |
p |
|---|---|---|
FSIQ |
-18.04 | 1.60e-20 |
VIQ |
-18.12 | 1.37e-20 |
PIQ |
-18.06 | 1.51e-20 |
Weight |
-16.82 | 6.44e-19 |
Height |
-18.51 | 1.31e-20 |
MRI_Count |
79.53 | 9.10e-45 |
It is clear from these two tables that each correlation with partY or partY2 is significant because all of the p-values with these two random variables are < 0.05. The negative t values mean that the sample mean was less than the hypothesized mean, while the positive t value for MRI_Count and partY or partY2 means that the sample mean was greater than the hypothesized mean. Plots of the most significant correlation for both partY and partY2 are below.
The expected figure outputs should look as follows:
- When plotting the statistical relationship between the most significant correlation for
partY, the figure should look like:
- When plotting the statistical relationship between the most significant correlation for
partY2, the figure should look like:

