`yens-gpu-demo`: finBERT Training on Yen's GPU Node

This repository contains the code and scripts required to train a BERT model for sentiment analysis on financial news using GSB Yen's GPU node at Stanford. The BERT model is trained on a Kaggle dataset of financial phrase bank. The project utilizes the transformers library along with PyTorch Lightning for streamlined model training and evaluation.

Please refer to this RCpedia Topic Guide for details.

Getting Started

Clone the repository to the Yens to run the example trainig script.

$ git clone https://github.com/gsbdarc/yens-gpu-demo.git 
$ cd yens-gpu-demo

Repository Structure

Sentences_AllAgree.txt: dataset of 2,264 financial news headlines labeled with a sentiment (Malo et al., 2014).
finBERT-train.slurm: Slurm script for training the BERT model to be run on the Yens.
finBERT-train.py: Python script to train the model on Yen's GPU node.

Training Script Details

The training script finBERT-train.py performs the following:

Defines a custom dataset to read in financial news texts and their labels.
Utilizes the BERT model from the transformers library for sentiment analysis.
Trains the model using PyTorch Lightning's Trainer on Yen's GPU.

Usage

Modify the Slurm script to include your email address. Slurm will report useful metrics via email such as queue time, runtime, CPU and RAM utilization and will alert you if the job has failed.

Submit the Slurm script to initiate the model training on gpu partition on the Yens:

$ sbatch finBERT-train.slurm

Monitor the training progress by checking the Slurm queue for your username:

$ squeue -u $USER

Monitoring Training

Instructions for monitoring GPU utilization and other training metrics can be found in the Topic Guide.

Output

After the training is complete, check the output file finBERT-train.out for training and evaluation metrics:

$ cat finBERT-train.out

Authors

Stanford GSB DARC Team ([email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
Sentences_AllAgree.txt		Sentences_AllAgree.txt
finBERT-train.py		finBERT-train.py
finBERT-train.slurm		finBERT-train.slurm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`yens-gpu-demo`: finBERT Training on Yen's GPU Node

Table of Contents

Getting Started

Repository Structure

Training Script Details

Usage

Monitoring Training

Output

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

yens-gpu-demo: finBERT Training on Yen's GPU Node

Table of Contents

Getting Started

Repository Structure

Training Script Details

Usage

Monitoring Training

Output

Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`yens-gpu-demo`: finBERT Training on Yen's GPU Node

Packages