Titanic-ML-Project

Machine learning project predicting Titanic passenger survival using Python, pandas, and scikit-learn. Includes full data preprocessing, model training, and evaluation pipeline.

Project Overview

The goal is to build a model that predicts the Survived outcome (1 = survived, 0 = did not survive) based on passenger characteristics such as age, class, and sex.

Key Steps

Data Loading — Imported train.csv, test.csv, and gender_submission.csv
Data Cleaning — Filled missing values with median and mode
Feature Encoding — Converted categorical features like Sex and Embarked into numerical values
Model Training — Used Logistic Regression to train and validate survival predictions
Evaluation — Measured accuracy and classification metrics on a validation set
Prediction — Generated survival predictions for the test set and saved them as submission.csv

Model and Performance

Algorithm: Logistic Regression
Libraries: pandas, numpy, scikit-learn, seaborn, matplotlib

Typical accuracy achieved: around 80% on validation data.

Dataset Description

train.csv — Training data with labels (Survived)
test.csv — Test data without labels
gender_submission.csv — Sample submission file for Kaggle format

Key features include:

Feature	Description
`Pclass`	Ticket class (1 = 1st, 2 = 2nd, 3 = 3rd)
`Sex`	Passenger gender
`Age`	Passenger age in years
`SibSp`	Number of siblings/spouses aboard
`Parch`	Number of parents/children aboard
`Fare`	Ticket fare
`Embarked`	Port of embarkation (C = Cherbourg, Q = Queenstown, S = Southampton)

Packages

NumPy / Pandas — Data wrangling
Matplotlib / Seaborn — Visualization
Scikit-Learn — Machine learning modeling

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Titanic_ML.ipynb		Titanic_ML.ipynb
gender_submission.csv		gender_submission.csv
test.csv		test.csv
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Titanic-ML-Project

Project Overview

Key Steps

Model and Performance

Dataset Description

Packages

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Titanic-ML-Project

Project Overview

Key Steps

Model and Performance

Dataset Description

Packages

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages