In December 2021, I underwent two major brain surgeries resulting in Subdural Acute Hematomas, leading to Traumatic Brain Injuries (TBIs). Following these surgeries, I was admitted to The Institute for Rehabilitation and Research Memorial Hermann (TIRR) for therapeutic services, marking a transformative period in my recovery journey. This experience ignited my interest in the intersection of healthcare and data science, prompting me to explore the TBIMS [Traumatic Brain Injury Model Systems] Datasets — valuable resources in traumatic brain injury research. The goal of this project is to leverage data-driven approaches, ultimately aiming to identify risk factors related to GCS [Glasgow Coma Scale] scores and make predictive models utilizing such factors. This documentation represents a work-in-progress account of my exploration into this interdisciplinary endeavor.
-
Definition: The GCS is a scale used to measure the extent of impaired consciousness in trauma patients, and it looks at 3 aspects of responsiveness: eyes, verbal, and motor; the scores from each add up to the total Glasgow Coma Score (3 - 15 points)
-
Eyes [eye-opening response]:
1. No eye opening 2. Eye opening to pain 3. Eye opening to sound 4. Eyes open spontaneously -
Verbal [verbal response to speech]:
1. No verbal response 2. Incomprehensible sounds 3. Inappropriate words 4. Confused 5. Orientated -
Motor [physical responses]:
1. No motor response 2. Abnormal extension to pain 3. Abnormal flexion to pain 4. Withdrawal from pain 5. Localizing pain 6. Obeys commands -
Interferences [these things can cause slight inaccuracies with the GCS test]:
* Language Barriers [Can’t appropriately respond to commands if you don’t understand them] * Hearing loss/speech impediment [Can’t speak or hear to respond appropriately] * Intellectual/Neurological deficit [Brain cannot fully process commands] * Intubation [part of the treatment process where the tube is placed in the mouth, so the patient can’t speak] * Sedation [part of the treatment process, where the patient is put to sleep for procedures] * Orbital/Cranial Fracture [Specific factors of injuries that prevent actions part of the GCS test] * Spinal Cord damage [Specific factors of injuries that prevent actions part of the GCS test] * Hypoxic-ischemic encephalopathy [Specific factors of injuries that prevent actions part of the GCS test]
- Significance of the GSC Test:
- GCS helps with the early management of patients and is critical in monitoring the clinical course of a patient and guiding changes in management
- Side note: Total score [sum of eyes, verbal, and motor] has some loss of information as it is a summary of the 3 and isn't so specific
-
Find a correlation between GCS and intoxicating substance use (drugs and alcohol), and from there make a predictor to predict GCS levels based on the amount of substance in the patient’s bloodstream, or if the patient has generally used these substances before.
a. The correlation between GCS and intoxicating substances isn’t too clear but looks present. [Evidence]
b. Variables: GCSTot vs. [ALCDrinks, Drugs, and MJUse]
-
GCSTot [GCS Total]:
3 - 15: Marked Score; 77: Chemically Paralyzed or Sedated; 88 - Intubated; 999 - Unknown; -
ALCDrinks [A drink is 1 can or bottle of beer, 1 glass of wine, 1 can or bottle of wine cooler, 1 cocktail, or 1 shot of liquor. On the days when you drank, about how many drinks did you drink on the average?]:
666 - Variable Did Not Exist; 777 - Refused; 888 - Not Applicable; 999 - Unknown; -
Drugs [During the year before the injury, did you use any illicit or non-prescription drugs?]:
0 - No; 1 - Yes; 66 - Variable Did Not Exist; 77 - Refused; 88 - Not applicable; 99 - Unknown; -
MJUse [Did you use marijuana?]:
0 - No; 1 - Yes; 66 - Variable Did Not Exist; 77 - Refused; 88 - Not applicable; 99 - Unknown;
c. Results:
i. Equations: ALCDrinks: y = -0.068x + 4.407 Drugs: y = -0.010x + 0.340 MJUse: y = -0.017x + 0.398 ii. R-Squared Value [Model’s Accuracy]: ALCDrinks: 0.006 (0.6%) Drugs: 0.011 (1.1%) MJUse: 0.031 (3.1%) iii. Summary: Higher Previous Substance Use ~ Lower GCS; but it is not very accurate, since R-squared values are so low. The alcohol one is most interesting since it isn’t just a yes/no response but has more variance.d. Graphs:
-
-
Find a correlation between GCS and age, and from there make a predictor to predict GCS levels based on the patient’s age.
b. Variables: GCSTot vs. AGEnoPHI
-
GCSTot [GCS Total]:
3 - 15: Marked Score; 77: Chemically Paralyzed or Sedated; 88 - Intubated; 999 - Unknown; -
AGEnoPHI [Age of the patient]:
777 - 89 Years Old or Older; 999 - Unknown;
c. Results:
i. Equations: y = 2.219x + 22.608 ii. R-Squared Value [Model’s Accuracy]: 0.189 (18.9%) iii. Summary: Lower Age ~ Lower GCS; but this might be an issue with fewer individuals even getting their GCS recorded at older ages from earlier deaths.d. Graph:
-
-
Find a correlation between GCS and the highest level of education attained, and from there make a predictor to predict GCS levels based on the patient’s highest education.
a. Variables: GCSTot vs. EDUCATION
-
GCSTot [GCS Total]:
3 - 15: Marked Score; 77: Chemically Paralyzed or Sedated; 88 - Intubated; 999 - Unknown; -
EDUCATION [Highest level of education that patient has received]:
1 - 8th Grade or Less; 2 - 9th - 11th Grade; 2.5 - GED; 3 - HS/GED; 3.5 - HS; 4 - Trade; 5 - Some College; 6 - Associate; 7 - Bachelors; 8 - Masters; 9 - Doctorate; 77 - Other; 999 - Unknown;
b. Results:
i. Equations: y = 0.069x + 3.388 ii. R-Squared Value [Model’s Accuracy]: 0.018 (1.8%) iii. Summary: Lower Education ~ Lower GCS; the r-squared value is really low, and looking at the scatter plot for it shows that there really are no differential things you can notice about each categoryc. Graph:
-
-
Find a correlation between GCS and BMI, and from there make a predictor to predict GCS levels based on the patient’s BMI.
b. Variables: GCSTot vs. BMI or BMICat
-
GCSTot [GCS Total]:
3 - 15: Marked Score; 77: Chemically Paralyzed or Sedated; 88 - Intubated; 999 - Unknown; -
BMI [BMI at injury]:
Free Form Data Entry Allowed -
BMICat [BMI Category]
1 - Very severely underweight; 2 - Severely underweight; 3 - Underweight; 4 - Normal; 5 - Overweight; 6 - Obese Class I; 7 - Obese Class II; 8 - Obese Class III;
c. Results:
i. Equations: BMI: y = 0.097 + 25.465 BMICat: y = 0.018x + 4.628 ii. R-Squared Value [Model’s Accuracy]: BMI: 0.005 (0.5%) BMICat: 0.005 (0.5%) iii. Summary: Lower BMI ~ Lower GCS; the r-squared value is really low, and looking at the scatter plot for it, I did not see any definite noticeable signs of a pattern. Also, in the scatterplot, Underweight to Obese Class III are the same.d. Graphs:
-
-
Find a correlation between GCS and speed of relative recovery, and from there make a predictor to predict GCS levels based on the patient’s speed of recovery.
b. Variables: GCSTot vs. TFCDays
-
GCSTot [Days From Injury to Follow Commands]:
3 - 15: Marked Score; 77: Chemically Paralyzed or Sedated; 88 - Intubated; 999 - Unknown; -
TFCDays [Days From Injury to Follow Commands]:
7777 - Patient Never Able to Follow Simple Motor Commands; 9999 - Unknown;
c. Results:
i. Equations: y = -1.135x + 18.253 ii. R-Squared Value [Model’s Accuracy]: 0.170 (17%) iii. Summary: Longer Recovery Time ~ Lower GCS, pretty intuitive understanding that more severe cases take longer to recover from, so might be pointless.d. Graph:
-
-
Find a correlation between the speed of relative recovery and GCS separate scores and total score, and from there make a predictor to predict the speed of relative recovery based on the patient’s GCS Scale’s separate scores and total score.
a. Variables: TFCDays vs. [GCSEye, GCSVer, GCSMot, and GCSTot]
-
TFCDays [Days From Injury to Follow Commands]:
7777 - Patient Never Able to Follow Simple Motor Commands; 9999 - Unknown; -
GCSEye [GCS Eye opening]:
1 - None; 2 - To Pain; 3 - To Voice; 4 - Spontaneous; 7 - Chemically Paralyzed or Sedated; 99 - Unknown; -
GCSVer [GCS Verbal]:
1 - None; 2 - Incomprehensible Sounds; 3 - Inappropriate Speech; 4 - Confused; 5 - Oriented; 7 - Chemically Paralyzed or Sedated; 8 - Intubated; 99 - Unknown; -
GCSMot [GCS Motor]:
1 - None; 2 - Extension to Pain; 3 - Flexion to Pain; 4 - Withdraws from Pain; 5 - Localizes Pain; 6 - Obeys Commands; 7 - Chemically Paralyzed or Sedated; 99 - Unknown; -
GCSTot [GCS Total]:
3 - 15: Marked Score; 77: Chemically Paralyzed or Sedated; 88 - Intubated; 999 - Unknown;
b. Results:
i. Equations: GCSEye: y = -0.04x + 2.933 GCSVer: y = -0.053x + 3.64 GCSMot: y = -0.055x + 5.235 GCSTot: y = -0.15x + 12.127 ii. R-Squared Value [Model’s Accuracy]: GCSEye: 0.136 (13.6%) GCSVer: 0.141 (14.1%) GCSMot: 0.146 (14.6%) GCSTot: 0.171 (17.1%) iii. Summary: Higher Score ~ Shorter Recovery Time, the accuracy of each one is about the same and nothing really stands out in the scatter graph, but there do seem to be places of potential for further digging in terms of patterns.d. Graphs:
-
- Selected the correlation between GCSTot and TFCDays because of the higher r-squared value compared to the other linear regression models and the graph which had more apparent patterns that had stronger potential to dive further with other variables.
- TBIMS Data Dictionary Explorer
- Glasgow Coma Scale - StatPearls - NCBI Bookshelf
- The Impact of Drug and Alcohol Intoxication on Glasgow Coma Scale Assessment in Patients with Traumatic Brain Injury
- Effect of Age on Glasgow Coma Scale in Patients with Moderate and Severe Traumatic Brain Injury: An Approach with Propensity Score-Matched Population - PMC
- The impact of body mass index changes on traumatic brain injury patients' outcomes during hospitalization
- Gait and Glasgow Coma Scale scores can predict functional recovery in patients with traumatic brain injury - PMC











