Main Article Content

Improving the Computational Morphological Analysis of a Swahili Corpus for Lexicographic Purposes


G De Pauw
G-M de Schryver

Abstract

Abstract: Computational morphological analysis is an important first step in the automatic treatment of natural language and a useful lexicographic tool. This article describes a corpus-based
approach to the morphological analysis of Swahili. We particularly focus our discussion on its ability to retrieve lemmas for word forms and evaluate it as a tool for corpus-based dictionary
compilation.

Keywords: LEXICOGRAPHY, MORPHOLOGY, CORPUS ANNOTATION, LEMMATIZATION, MACHINE LEARNING, SWAHILI (KISWAHILI)

Journal Identifiers


eISSN: 2224-0039
print ISSN: 1684-4904