datathon

Inspiration

We wanted to get into data science because we see this field as exciting, expanding, and relevant. Data science is a great way to tackle real-world challenges using the intersection of CS and Statistics.

What it does

Our project visualizes two aspects of adverse outcomes in the CAERS data - frequency and severity - to determine the most harmful products/categories both overall and with respect to different demographics.

How we built it

We used Jupyter notebook to import, tidy, transform, visualize, and communicate. We took advantage of numpy, pandas, regexlib, matplotlib, and seaborn to go about this process more effectively .

Challenges we ran into

We were unable to fully consolidate the data, since there were still formatting errors in the product names that we could not account for (e. g vitamind3 vs Vitamin D3). We made use of regex and we made partial progress, but ultimately we continued to run into errors.

Accomplishments that we're proud of

No one on our team had any experience with data science and we felt proud of ourselves for being able to make such valuable insights in such a short period of time. We are glad we were able make strides in area of computing, and we are excited to try this more in the future.

What we learned

How to effectively clean data, use pandas, use regex, visualize with seaborn, visualize with matplotlib, and draw conclusions to make informed decisions.

What's next for CAERSDataAnalysis

We want to work towards further cleaning the data, analyze more demographics, work in unique intersections within the dataset, and join this dataset with another (e.g income data, geospacial data) to find other relationships.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.ipynb_checkpoints		.ipynb_checkpoints
CAERS_ProductBased.csv		CAERS_ProductBased.csv
FinalDraft.ipynb		FinalDraft.ipynb
README.md		README.md
cleaned_product_data.csv		cleaned_product_data.csv
cleaned_product_data2.csv		cleaned_product_data2.csv
datathon - R.ipynb		datathon - R.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

datathon

Inspiration

What it does

How we built it

Challenges we ran into

Accomplishments that we're proud of

What we learned

What's next for CAERSDataAnalysis

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

datathon

Inspiration

What it does

How we built it

Challenges we ran into

Accomplishments that we're proud of

What we learned

What's next for CAERSDataAnalysis

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages