SD_data_contest_education_3-10

2015 San Diego Data Analysis Contest

What is the demographic profile of the typical ELA student?

machine learning/regression analysis

What are the most important correlations of CELDT and with demographic factors.

correlations

What are the strongest predictors of performance on the CELDT and ELA test?

machine learning (predictive model)

Which schools perform better on the CST ELA than would be predicted by their demographic factors. Break out the performance of RFEP ( Reclassified as English Fluent Proficient ) students, if possible.

We need to answer the previous question first

Do schools with a high or low density of ELL students have better or worse performance than you would predict?

We need to answer the previous question first

Present any interesting or notable patterns. Are some schools better than others at preparing formerly ELL students to do well in school after they have been reclassified as proficient?

Is there a way to track specific students?

Do language learners of some languages have an easier or more difficult time learning English than native speakers of other languages? (eg. do Spanish speakers learn English quicker than say speakers of Kurdish)?

We can look at CELDT tests across primary language

Can we see if families utilize other social services available (tutoring, food assistance, community groups, etc.), and if so, does this type of community involvement have an effect on English learning ability?

We can look at the number of students getting school lunches and the EL tests

How to marry the files

All files have a cds_code column, with each school being associated with the same code across all files.

Each row needs to be one school with all of the data associated with that school. To create only one row for each school we need to transform data files by taking multiple rows and making them one. In order to do this and still track the necessary components we need to take each of the columns (listed below for each specific document), code the column into the rest of the column names, and then combine the columns. We should probably talk about this over the phone.

use: api_2012

celdt

subgroup ID 00
overall performance level 0
test purpose 0
grade 00 000000_columnName

star

subgroup 000
grade 00
test_id 00 0000000_columnName

enroll_2013

ethnic 0
gender L 0L_columnName

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LICENSE		LICENSE
README.md		README.md
education_dataAnalysis_2015.ipynb		education_dataAnalysis_2015.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SD_data_contest_education_3-10

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

SD_data_contest_education_3-10

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages