Project Training and Profiling Guide

This README outlines the procedure for training neural network models on NVIDIA GPUs, capturing performance metrics through profiling, and visualizing this data using roofline models.

Getting Started

Accessing NYU HPC and Dataset Preparation OR Use Own GPU

Open Command Prompt: Start by opening your command prompt or terminal on your local machine.
SSH into NYU HPC Gateway: ssh @gw.hpc.nyu.edu
Connect to Greene Cluster: ssh @greene.hpc.nyu.edu
Access ImageNet Dataset: Obtain access to the ImageNet dataset on HPC and create a manageable subset for your experiments.
Mount Subset on Burst: Ensure the created subset is mounted on Burst for efficient access during training.

Environment Setup

SSH into Burst: ssh burst
Prepare the Environment: Load necessary modules for GPU access and singularity containers as per HPC documentation.

Profiling Preparation

Use my imagenet code file from this repository
Modify for Profiling: Integrate profiling commands into the training script as needed for detailed analysis.

Profiling Execution

Run Profiling Command: Example for ResNet18 on V100 GPU: ncu --profile-from-start off --metrics gpu__time_duration.sum,dram__bytes_read.sum,dram__bytes_write.sum,smsp__sass_thread_inst_executed_op_fadd_pred_on.sum,smsp__sass_thread_inst_executed_op_fmul_pred_on.sum,smsp__sass_thread_inst_executed_op_ffma_pred_on.sum --csv --page raw --log-file resnet18-v100.csv --target-processes all python main.py --arch resnet18 --epochs 1 --batch-size 4 --dummy --gpu 0 Adjust commands for other models (e.g., AlexNet) and GPUs (e.g., A100) as needed.

Retrieving CSV File

Download CSV File: Use scp to transfer the CSV file from HPC to your local system: scp @greene.hpc.nyu.edu:/path/to/resnet18-v100.csv /local/path

Data Analysis with Roofline Modeling

Upload CSV to Google Colab: Transfer the CSV files to Google Colab for analysis.
Plot Roofline Model: Utilize existing or create new Colab notebooks to visualize the performance data using roofline models.

This process facilitates a detailed comparison of neural network model performances across different hardware, enabling the identification of optimization opportunities.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Estimation_Using_Thop.ipynb		Estimation_Using_Thop.ipynb
Imagenet_analysis.pdf		Imagenet_analysis.pdf
README.md		README.md
Roofline_modeling.ipynb		Roofline_modeling.ipynb
alexnet_a100_m.csv		alexnet_a100_m.csv
alexnet_v100_m.csv		alexnet_v100_m.csv
imagenet_code.py		imagenet_code.py
resnet_a100_m.csv		resnet_a100_m.csv
resnet_v100_m.csv		resnet_v100_m.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Training and Profiling Guide

Getting Started

Accessing NYU HPC and Dataset Preparation OR Use Own GPU

Environment Setup

Profiling Preparation

Profiling Execution

Retrieving CSV File

Data Analysis with Roofline Modeling

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project Training and Profiling Guide

Getting Started

Accessing NYU HPC and Dataset Preparation OR Use Own GPU

Environment Setup

Profiling Preparation

Profiling Execution

Retrieving CSV File

Data Analysis with Roofline Modeling

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages