Multi-Agent Image Recognition System for Mathematical Expressions appearing in natural images

Daniel Ogwok

Back

Multi-Agent Image Recognition System for Mathematical Expressions appearing in natural images

Thesis

Open access

Multi-Agent Image Recognition System for Mathematical Expressions appearing in natural images

Daniel Ogwok

Master of Science (MSc), University of Johannesburg

2021

Handle:

https://hdl.handle.net/10210/498907

Abstract

Image processing - Digital techniques

Computer Graphics

Images have been used to express and convey information for so many years. Over the years, the human visual cortex has adapted through the environments that we have lived in. This has paved way for cognitive modelling of the visual cortex, where computer models have been developed to carry out visual information processing functions. This dissertation presents a system that reads in digital images of mathematical expressions with noisy backgrounds, and then applies agents to the various stages of image recognition, identifying the characters in the mathematical expression. The model is called Natural Image Mathematical Expression Recognition Model (NIMER). The NIMER model applies both supervised and unsupervised learning methods to the process of recognition. NIMER follows the classic two step recognition process, which is segmentation and classification, applying multiple agents and ensemble learning at each of the stages. The segmentation stage is composed of Region-based Convolutional Neural Network (R-CNN), Minimum Spanning Tree (MST) and Connected Components Labelling (CCL) agents. The MST and CCL agents apply a form of unsupervised learning similar to clustering in order to segment images, and R-CNN uses supervised learning. The classification stage is made up of Convolutional Neural Network (CNN), K-Nearest Neighbour (KNN) and Support Vector Machine (SVM) agents which all use a supervised form of learning. The NIMER model presents ensemble results at each stage that are better than the individual agent results. M.Sc. (Computer Science)

Files and links (1)

pdf

Ogwok_Daniel Wtm.pdf6.93 MBDownload View

Open Access

Metrics

22 File views/ downloads

28 Record Views

Details

Title: Multi-Agent Image Recognition System for Mathematical Expressions appearing in natural images
Creators - without role: Daniel Ogwok
Contributors - without role: E.M. Ehlers
Awarding Institution: University of Johannesburg; Master of Science (MSc)
Theses and Dissertations: Master of Science (MSc), University of Johannesburg
Identifiers: 9910584807691
Copyright: University of Johannesburg
Academic Unit: Academy Computer Science and Software Engineering
Resource Type: Thesis