Project - EITP40 - Handwriting recognition

Data Collection

We created a Conda environment for this project. To run this, make sure you have the Python packages installed.

In directory /ImageProcessing: Run script.py which will detect characters from image by running image_processing.py.

Image Processing

Input is a JPG/PNG image, which is processed to detect individual characters/letters.

At the top of the file image_processing.py you can configure if you want to apply the shadow filter (SHADOW variable) or not and change parameters. Also you can set DEV_MODE as True or False. Setting it to True will open some of the images in the processing steps and sopen the resulting image of detected characters marked in red rectangles.

The detected characters are stored in ImageProcessing/output/chars as JPG files. Then all images are transformed (reshaped by padding) into the same resolution.

Collect all images as dataset

After all images of the detected characters are padded into a uniform resolution and stored in ImageProcessing/output/padded_chars, they are resized into the desired shape (28,28) and stored in ImageProcessing/output/resized/. The numpy array of shape (n_samples,28,28) corresponding to all images is saved as a .npy file together with the EMNIST .mat dataset that we found online in data/training. The resizing is necessary as it drastically decreases the size of the .npy file.

Separating each word

Created separate datasets for each word in the input image. Now there will be several datasets for every input image. These are stored in data/test. The idea is to then store all the datasets in another folder data/training/words to create a larger training dataset with each word separated (this is done manually).

One can load and plot a random character by running the notebook pre_training.ipynb. Both for our own dataset and the EMNIST dataset. EMNIST dataset downloaded here.

We create a larger dataset of our own handwritten letters by concatenating multiple datasets created with individual images taken with the OV7675 camera.

Store image taken by OV7675 Camera

Run port.py <port> with provided port 74A1 as argument.

Philip's port: /dev/cu.usbmodem1421101 (might change)

Bash script

Run bash script run.sh to start the "pipeline". It will take an image, save it, then this image is processed and detected characters are stored in dataset chars.npy.

To make the file executable:

Appended this line at the top of the file #!/usr/bin/env bash (Mac OS)
Change file permissions chmod +x run.sh

Conda Environment

conda create --name <env-name>
conda activate <env-name>
kernel install --user --name=<env-name>

Communication part

`find the path to library on linux: home/user/arduino/src

Packages

conda install -c conda-forge opencv or pip install opencv-python
pip install Pillow
pip install scipy
pip install scikit-image
pip install pyserial
conda install tensorflow tensorflow-macos tensorflow-metal

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
ImageProcessing		ImageProcessing
__pycache__		__pycache__
data		data
model		model
training_on_device		training_on_device
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
Arduino.pdf		Arduino.pdf
README.md		README.md
confusion matrix.png		confusion matrix.png
data.h		data.h
export_data.py		export_data.py
output.txt		output.txt
pre_training.ipynb		pre_training.ipynb
run.sh		run.sh
true_labels.txt		true_labels.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project - EITP40 - Handwriting recognition

Data Collection

Image Processing

Collect all images as dataset

Separating each word

Store image taken by OV7675 Camera

Bash script

Conda Environment

Communication part

Packages

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

YuqinGuan/EITP40-finalProject

Folders and files

Latest commit

History

Repository files navigation

Project - EITP40 - Handwriting recognition

Data Collection

Image Processing

Collect all images as dataset

Separating each word

Store image taken by OV7675 Camera

Bash script

Conda Environment

Communication part

Packages

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages