This portfolio is a compilation of notebooks that I created for data analysis, data exploration, and the use of machine learning. Categories are seperated for ease of access. Most projects are done using Kaggle kernels.
Developed a predictive model for house prices based on square footage and bedroom count. Extracted and cleaned data from Kaggle's open-source datasets. Conducted statistical analysis and implemented multiple linear regression. Displayed use of descriptive statistics and exploratory data analysis. Effectively visualized data findings using Matplotlib. Code can be found here.
Using regression to predict a car's selling price based on multiple explanatory variables. Code can be found here here.
Predicting a students GPA using their SAT score. Introductory to sklearn. Code can be found here.
Parsed and pulled data from NBA stats website. Performed data cleaning and exploratory data analysis to find some patterns in the data. Wanted to explore which stats showed heavy correlation, the minutes played distribution, and to compare the stats percentage change from regular season to postseason. Code can be found here
Worked with a 4.2 million row dataset pertaining to variables about car accidents within the United States. Performed data preparation, cleaning, exploratory data analysis and visualization of the data. Code can be found here.
Identify trends among employment ratio to the population based on many factors such as education, ethnicity and gender. Code can be found here.
Cleaned and reformatted data for increased usability. Conducted Exploratory data analysis and answered inquiries about the data with visualizations. Code can be found here.
Observes the amount of bikers in the London area with correlation to other factors such as weather conditions, season, and time of the week. Data preparation for visualization in Tableau. Demonstrates proficiency in data wrangling and cleaning techniques. Code can be found here.
The objective was to observe and analyze crypto prices using the API coinmarketcap.com allows. Code can be found here.
Pulls daily data from a github page to track statistics of covid cases. Displays data in an easy to view manner to filter by state. Github page is here.