NEWS CATEGORY CLASSIFICATION USING RECURRENT NEURAL NETWORKS

Comparing two different architectures of RNNs, the former a bidirectional GRU and the latter a bidirection LSTM, on how they train on a text categorization task.

Technologies

Pytorch
Natural Language ToolKit (NLTK)
Scikit-learn
Pandas, Numpy
Netron.app
Matplotlib

Model Architectures

GRU (https://netron.app/?url=https://github.com/Asymmetric-OG/NewsClass/raw/refs/heads/master/grumodel.onnx)

LSTM (https://netron.app/?url=https://github.com/Asymmetric-OG/NewsClass/raw/refs/heads/master/lstm.onnx)

Observations (Training-Validation Curves)

Evidently, the GRU overfits early and heavily due to the vanishing gradients problems whereas the LSTM has more stable training due to its better performance on longer sequences of text.

LR=1e-3

Peak GRU validation accuracy : 66.2
Peak LSTM validation accuracy : 71.5

LR=1e-5

EPOCH(1-25)

EPOCH(25-50)

Peak GRU validation accuracy : 60+ (OVERFITTED)
Peak LSTM validation accuracy : 60+ (GENERALISES WELL)

This emphasizes on the LSTMs ability to tweak its gradients efficiently over a period of 50 epochs whereas they explode/vanish for the former model.

File Overview

Dataset.json : News Category Classification Dataset.
classifier.ipynb : The entire workflow.
grumodel.onnx : Post-training GRU model for visualisation
lstm.onnx : Post-training GRU model for visualisation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NEWS CATEGORY CLASSIFICATION USING RECURRENT NEURAL NETWORKS

Technologies

Model Architectures

Observations (Training-Validation Curves)

LR=1e-3

LR=1e-5

File Overview

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Dataset.json		Dataset.json
README.md		README.md
classifier.ipynb		classifier.ipynb
grumodel.onnx		grumodel.onnx
lstm.onnx		lstm.onnx

Folders and files

Latest commit

History

Repository files navigation

NEWS CATEGORY CLASSIFICATION USING RECURRENT NEURAL NETWORKS

Technologies

Model Architectures

Observations (Training-Validation Curves)

LR=1e-3

LR=1e-5

File Overview

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages