An explainable ensemble machine learning approach for multi-domain, multiclass sentiment analysis in Amazon product reviews

Kamogelo Mokgwatjane; Thulane Paepae

Back

An explainable ensemble machine learning approach for multi-domain, multiclass sentiment analysis in Amazon product reviews

Journal article

Open access

An explainable ensemble machine learning approach for multi-domain, multiclass sentiment analysis in Amazon product reviews

Kamogelo Mokgwatjane and Thulane Paepae

2025

Handle:

https://hdl.handle.net/10210/518610

Abstract

Sentiment analysis

Machine learning

Ensemble learning

user-generated content volumes and class imbalances hinder accurate multiclass predictions and model interpretability. This study introduces a novel explainable ensemble learning framework for multiclass SA (positive, neutral, negative) across three Amazon product domains: appliances, groceries, and clothing. The framework integrates diverse supervised classifiers in a stacking ensemble, with SHapley Additive exPlanations (SHAP) innovatively employed not only to elucidate feature contributions but also to rank and interpret the individual impacts of base classifiers on ensemble predictions, a pioneering application in domain-specific SA, as it enables global insights into model dynamics and base model selection, addressing gaps in prior studies that relied on local explanations like LIME (Local Interpretable Model-agnostic Explanations). Evaluated using imbalancesensitive metrics (weighted/macro F1-score, Matthews Correlation Coefficient, Cohen’s Kappa, Geometric Mean), the ensemble surpasses individual classifiers and demonstrates higher macro F1 and G-Mean than the transformer-based ALBERT model, while ALBERT excels in weighted F1, MCC, and Cohen's Kappa. Extra Trees notably excelled in the G-Mean for minority classes. SHAP analysis uncovers domain-specific drivers and base model roles, enhancing transparency. The results underscore the framework’s efficacy in delivering robust performance and actionable insights for trust modelling, automated analytics, and personalized recommendations. This work lays the groundwork for extensions to low-resource domains, multimodal data, and finer rating scales, advancing interpretable SA in e-commerce.

Files and links (1)

pdf

Research (37)8.65 MBDownload View

Open Access

Metrics

1 Record Views

Details

Title: An explainable ensemble machine learning approach for multi-domain, multiclass sentiment analysis in Amazon product reviews
Creators - without role: Kamogelo Mokgwatjane
Thulane Paepae
Identifiers: 9959606507691
Academic Unit: University of Johannesburg; Faculty of Science; Department of Mathematics and Applied Mathematics
Language: English
Resource Type: Journal article