Main Article Content
Spoken language corpora for the nine official African languages of South Africa
Abstract
normative vs. non-normative approaches to corpus planning. We then give an
outline of the design of a spoken language corpus for the nine official African
languages of South Africa. We consider issues such as representativity and
sampling (urban–rural, dialects, gender, social class and activities),
transcription standards and conventions as well as the problems emanating from
widespread loans and code switching and other forms of language mix
characteristic of spoken language. Finally, we summarise the status of the
project at present and plans for the future.
Southern African Linguistics and
Applied Language Studies 2003, 21(4): 189–201