Multi-layer perceptron in C++

My multilayer perceptron implementation in C++ to recognize handwritten numbers in the MNIST database.

Environment

The code was developed and tested on Linux Mint 19.1, g++ version 7.3.0. Requirements:

C++ compiler with C++14 support
cmake

Introduction

The MNIST database is loaded into float vectors and can be preprocessed with the Preprocessor class. In the current version only a simple normalization with the maximum RGB value is applied.

At the time being there are several types of the activation functions:

relu
sigmoid
tanh
gauss
bent
softplus
sinusoid
inverse square root linear unit (ISRLU)

The number of layers is not limited. To add a new layer use one of the followings:

template <class ltype>
void MLP::addLayer(int outputN, int inputN, Activation afuncType);

template <class ltype> 
void MLP::addLayer(int outputN, Activation afuncType);

Example (the inputN argument is optional):

mlp->addLayer<Dense>(192, Activation::relu);

Currently only the Dense type of layer is implemented.

The initialization of the network occurs with the compile function:

void MLP::compile(const float &learningRate, const float &momentum);

Finally the training and validation:

void MLP::trainNetwork(vector<vector<float>> &train_x, vector<float> &train_y,
                       int epochs, int nbatch = 1);

void MLP::validateNetwork(vector<vector<float>> &test_x, vector<float> &test_y);

In addition, there is a possibility to save/load the trained network:

void MLP::saveNetwork(string fileName);

void MLP::loadNetwork(string fileName);

Run the examples

mkdir build && cd build && cmake ../
make -j4
./app/trainer ../data logfile_1 network_1

The trained network is saved into network_1 which can be loaded and re-tested (or trained further):

./app/load ../data logfile_1_load network_1

Example outputs

Several examples (training logs and the trained networks as well) can be found in the example_outputs folder. The best result so far is achieved with the following layout (after 5 epochs, momentum=0.4, learning rate=1e-4):

mlp->addLayer<Dense>(192, 768, Activation::isrlu);
mlp->addLayer<Dense>(10, Activation::tanh);

Accuracy: 82.02%

Used literature

Multi-layer perceptron: http://blog.refu.co/?p=931
Commonly used activation functions: https://en.wikipedia.org/wiki/Activation_function
Tensorflow/Keras: https://www.tensorflow.org/guide/keras
MNIST example 1: https://towardsdatascience.com/image-classification-in-10-minutes-with-mnist-dataset-54c35b77a38d
MNIST example 2: https://www.digitalocean.com/community/tutorials/how-to-build-a-neural-network-to-recognize-handwritten-digits-with-tensorflow#prerequisites
MNIST database: http://yann.lecun.com/exdb/mnist/
Read MNIST dataset in C++: http://eric-yuan.me/cpp-read-mnist/

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
Framework		Framework
app		app
data		data
example_outputs		example_outputs
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-layer perceptron in C++

Environment

Introduction

Run the examples

Example outputs

Used literature

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-layer perceptron in C++

Environment

Introduction

Run the examples

Example outputs

Used literature

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages