miniMNIST-python

This project is an implementation of the neural network found in miniMNIST-c, in Python, with NumPy.

It is a minimal neural network for classifying handwritten digits from the MNIST dataset, and the entire implementation is 87 lines of code according to cloc.

Unlike miniMNIST-c, this project makes use of one library: NumPy.

NumPy is used for its powerful N-dimensional arrays which are extremely fast and allow the entire network to be vectorised and trained rapidly.

It also makes translating the mathematics behind a simple feed-foward neural network into Python much easier as it provides useful functions such as the matrix dot product, the normal distribution, and the argmax function.

Features

Two-layer Neural Network (Input -> Hidden -> Output)
ReLU activation for the Hidden Layer and SoftMax activation for the Output Layer
Cross-entropy Loss function (Log Loss)
Stochastic Gradient Descent (SGD) optimizer

Differences

The code for miniMNIST-python is commented to an almost extreme degree, as this implementation was developed for a workshop presented by CompSoc - University of Galway's Computer Society on 2024/10/16
Many of the optimisations made to miniMNIST-c since its initially release have not been implemented in miniMNIST-python
Does not implement Momentum-based variation of Stochastic Gradient Descent
Utilises the t10k testing dataset rather than taking a slice of the MNIST training dataset for testing

Prerequisites and Usage

Python 3.12
NumPy
MNIST dataset files:
- train-images.idx3-ubyte
- train-labels.idx1-ubyte
- t10k-images.idx3-ubyte
- t10k-labels.idx1-ubyte

Place the MNIST dataset files in the same directory as main.py (the root of the project)
Install NumPy:

pip install numpy

Execute the program with Python 3.12:

python main.py

The script will train the neural network and output the accuracy and average loss for each training epoch.

Configuration Options

The constants at the top of main.py can be adjusted to change the behaviour of the network, namely:

HIDDEN_SIZE: The number of neurons in the Hidden Layer
LEARNING_RATE: The learning rate for Stochastic Gradient Descent
EPOCHS: The number of training epochs
BATCH_SIZE: The batch size for training (in this implementation it must be a number which divides cleanly into 60,000)

License

This project is open-source and available under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
t10k-images.idx3-ubyte		t10k-images.idx3-ubyte
t10k-labels.idx1-ubyte		t10k-labels.idx1-ubyte
train-images.idx3-ubyte		train-images.idx3-ubyte
train-labels.idx1-ubyte		train-labels.idx1-ubyte

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

miniMNIST-python

Features

Differences

Prerequisites and Usage

Configuration Options

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Daxorinator/miniMNIST-python

Folders and files

Latest commit

History

Repository files navigation

miniMNIST-python

Features

Differences

Prerequisites and Usage

Configuration Options

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages