Image Classification using AWS SageMaker

In this project, we utilize AWS Sagemaker to train an already pre-trained Resnet50 model capable of image classification on the given dataset for dog breed classification. Additionally, we incorporate Sagemaker profiling, debugger, hyperparameter tuning, and other commendable practices in machine learning engineering.

Project Set Up

This project was developed and tested AWS SageMaker. It was created from a starter file provided by udacity here.

AWS Execution Role:

The AWS execution role used for the project should have the following access:

AmazonSageMakerFullAccess
AmazonS3FullAccess

Dataset

The dataset provided is the dog breed classification dataset, accessible through this link. However, the project has been formulated in a manner that is not reliant on a specific dataset.

Access

Upload the data to an S3 bucket through the AWS Gateway so that SageMaker has access to the data.

!wget https://s3-us-west-1.amazonaws.com/udacity-aind/dog-project/dogImages.zip
!unzip dogImages.zip
unzip dogImages.zip
aws s3 sync dogImages/  s3://<default_s3_bucket>/data/

Hyperparameter Tuning

We utilize the pre-trained ResNet50 model from PyTorch, a convolutional neural network, to enable transfer learning. We then fine-tune this model using transfer learning techniques to classify dog breeds in images.

In this project, we focus on tuning two key parameters - the learning_rate and batch_size - as these impact both the model's accuracy and speed of conversion. The learning_rate falls within 0.001 to 0.1, while the batch_size can take on one of five values (32, 64, 128, 256, or 512).

To achieve the best results, we execute a hyperparameter tuning job that selects parameters from the search space, runs a training job, and then makes predictions. The primary objective of this process is to improve the Test Loss metric.

Ultimately, the optimal training hyperparameters will minimize the Test Loss metric.

Completed Hyperparameter Tuning Job 👇

Summery of the best training job 👇

Best hyperparameters 👇

batch_size: 32
learning_rate: 0.0034

Debugging and Profiling

We employed the SMDebug client library from Amazon SageMaker to facilitate model debugging and profiling. The Sagemaker debugger allows us to monitor our machine learning model's training performance, record training and evaluation metrics, and plot learning curves. Additionally, it can detect potential problems such as overfitting, overtraining, poor weight initialization, and vanishing gradients.

Underfitting occurs when the validation score does not improve over time, suggesting that the model is not learning enough from the data. On the other hand, overfitting refers to a situation in which training curve keeps improving while the validation curve is getting worse. Both of these issues can be addressed by tuning the hyperparameters, or by collecting more data samples.

Debugger Output 👇

To enhance the algorithm's performance, providing additional training time with an extended choice of hyperparameters would be beneficial.

Results

Increasing the training time can improve the algorithm's performance. Profiling the model revealed that the GPU on the compute instance was underutilized, likely due to the use of small batch size (32). We may want to consider either switching to a smaller instance type or increasing the batch size. This issue may have arisen because we used different tuning and model training instances.

profiler results can be found here

Model Deployment

To enable inference, the model was deployed to a Sagemaker endpoint using an ml.m5.large instance and The inference script is designed to accept an image URL as input.

The deployed endpoint can be queried using the predict function implemented in the notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.ipynb_checkpoints		.ipynb_checkpoints
ProfilerReport/profiler-output		ProfilerReport/profiler-output
screenshots		screenshots
scripts		scripts
.DS_Store		.DS_Store
LICENSE.txt		LICENSE.txt
README.md		README.md
train_and_deploy.html		train_and_deploy.html
train_and_deploy.ipynb		train_and_deploy.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Classification using AWS SageMaker

Project Set Up

AWS Execution Role:

Dataset

Access

Hyperparameter Tuning

Best hyperparameters 👇

Debugging and Profiling

Results

Model Deployment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Image Classification using AWS SageMaker

Project Set Up

AWS Execution Role:

Dataset

Access

Hyperparameter Tuning

Best hyperparameters 👇

Debugging and Profiling

Results

Model Deployment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages