Enhancing azo dye synthesis through small dataset machine learning : a computational approach in chemistry

Claudius  du Plessis

Back

Thesis

Open access

Enhancing azo dye synthesis through small dataset machine learning : a computational approach in chemistry

Claudius du Plessis

Master of Science (MSc), University of Johannesburg

2025

Handle:

https://hdl.handle.net/10210/517006

Abstract

Deep learning (Machine learning)

Azo dyes.

The advent of the Fourth Industrial Revolution highlighted the need for greater efficiency and productivity across all systems and aspects of our lives, a demand machine learning techniques are poised to meet. This dissertation explores the application of machine learning techniques to small datasets, specifically within the domain of azo dye synthesis in chemistry. As traditional optimisation methods face challenges when applied to large datasets, machine learning can be a viable alternative for extracting meaningful insights from limited data. The research study set out to develop predictive models that can accurately forecast outcomes in azo dye synthesis, a critical process in the chemical industry that is labour-intensive and environmentally detrimental. The difficulty in addressing this problem stems from the limited datasets available for training machine learning models to make such predictions. The study employs a design science research methodology to guide the exploration of small dataset machine learning approaches, selecting suitable techniques for azo dye synthesis. A small dataset of 119 entries is utilised in an attempt to predict the colours synthesised. It investigates various machine learning models, including Support Vector Machines, Random Forests, and Gradient Boosting Machines, assessing their performance through metrics such as Mean Absolute Error and Root Mean Square Error. The findings demonstrate that while small dataset machine learning holds promise, the quality and breadth of data are crucial for achieving accurate predictions. This dissertation offers a valuable contribution to the field by providing a benchmark for machine learning applications in chemistry and proposing a baseline approach for handling small datasets. Future research is suggested to enhance model performance through alternative optimisation techniques and expanded datasets. Overall, this research underscores the potential for machine learning to transform chemical processes by improving efficiency and sustainability, while also highlighting the need for comprehensive datasets to fully realise these benefits.

Files and links (1)

pdf

C_du Plessis9.11 MBDownload View

Open Access

Metrics

7 File views/ downloads

26 Record Views

Details

Title: Enhancing azo dye synthesis through small dataset machine learning : a computational approach in chemistry
Creators - without role: Claudius du Plessis
Contributors - without role: Wai Sze Leung Prof.
Khutso Lebea Mr.
Reinout Meijboom Prof.
Awarding Institution: University of Johannesburg; Master of Science (MSc)
Theses and Dissertations: Master of Science (MSc), University of Johannesburg
Identifiers: 9957405007691
Copyright: University of Johannesburg
Academic Unit: University of Johannesburg; Faculty of Science; Academy Computer Science and Software Engineering
Language: English
Resource Type: Thesis