Logo image
The news in black and white : word embeddings quantify racism in South African news
Journal article   Open access   Peer reviewed

The news in black and white : word embeddings quantify racism in South African news

Nnaemeka Ohamadike, Kevin Durrheim and Mpho Primus
EPJ data science, Vol.14(1), p.83
01/12/2025
Handle:
https://hdl.handle.net/10210/519239
PMID: 41323296

Abstract

Mathematical Methods In Social Sciences Mathematics, Interdisciplinary Applications Science & Technology Social Sciences, Mathematical Methods Mathematics Physical Sciences Social Sciences
Does race bias manifest in South African news, and how can computational methods like word embeddings reveal it? After apartheid's end in 1994, South Africa implemented policies to address racial and economic divides and transform institutions and structures, including the news media. This study introduces a computational approach to quantify race bias in South African news using neural embeddings. We trained word2vec word embeddings on COVID-19 vaccination news articles from 76 South African news sources. These large-scale embeddings are unbiased by design but can detect and reveal hidden biases in language. We found consistent race bias in the coverage of socioeconomic phenomena, while health results were weaker, mixed and likely corpus-dependent. COVID-19 may have also amplified associations between "Black" and unhealthy terms in news coverage. Our methodology complements traditional qualitative techniques and allows for a more objective and representative way of investigating racism in South African news. Findings are validated through multiple methods, including human ratings, and have implications for South African news and this research field.
pdf
Research (29)3.97 MBDownloadView
CC BY-NC-ND V4.0 Open Access
url
https://doi.org/10.1140/epjds/s13688-025-00594-2View
Published (Version of record) Open

Metrics

1 Record Views

Details

Logo image