Cross-encoder-based semantic evaluation of extractive and generative question answering in low-resourced african languages

doi:10.3390/ technologies13030119

Back

Cross-encoder-based semantic evaluation of extractive and generative question answering in low-resourced african languages

Journal article

Open access

Cross-encoder-based semantic evaluation of extractive and generative question answering in low-resourced african languages

2025

DOI: https://doi.org/10.3390/ technologies13030119

Handle:

https://hdl.handle.net/10210/514932

Abstract

cross-lingual question answering

extractive question answering

large language models

Efficient language analysis techniques and models are crucial in the artificial intelligence age for enhancing cross-lingual question answering. Transfer learning with state-of-the-art models has been beneficial in this regard, but the performance of lowresource African languages with morphologically rich grammatical structures and unique typologies has shown deficiencies linkable to evaluation techniques and scarce training data. To enhance the former, this paper proposes an evaluation pipeline leveraging the semantic answer similarity method enhanced with automatic answer annotation. The pipeline uses the Language-agnostic BERT Sentence Embedding model integrated with an adapted vector measure to perform cross-lingual text analysis after answer prediction. Experimental results from the multilingual-T5 and AfroXLMR models on nine languages of the AfriQA dataset surpassed existing benchmarks deploying string-based methods for question answer evaluation. The results are also superior to the F1-score-based GPT4 and Llama-2 performances on the same downstream task. The automatic answer annotation technique effectively reduced the labelling time while maintaining a high performance. Thus, the proposed pipeline is more efficient than the prevailing string-based F1 and Exact Match metrics in mixed answer type question–answer evaluations, and it is a more natural performance estimator for models targeting real-world deployment.

Files and links (1)

pdf

GetDocument (19)3.42 MBDownload View

Open Access

Metrics

1 Record Views

Details

Title: Cross-encoder-based semantic evaluation of extractive and generative question answering in low-resourced african languages
Contributors - without role: Funebi Francis Ijebu
Yuanchao Liu
Chengjie Sun
Nobert Jere
Ibomoiye Domor Ibomoiye
Identifiers: 9954298907691
Academic Unit: University of Johannesburg
Language: English
Resource Type: Journal article