2011-06-21

Meeting the Challenge: Edmond Kachale on Chichewa

Chichewa is a Bantu language spoken in four countries in southern Africa; namely Malawi, Mozambique, Zambia, and Zimbabwe. It is widely spoken in Malawi where at one point it had official status as a national language. Currently, it is no longer a national language, but it is a popular lingua franca with 60% of the population having full command of the language and 75% able to understand it. In Zambia, it is the third most popular language, spoken mostly in the Eastern Province and in Lusaka, and it is among the seven official African languages.  In Malawi, the language is taught in primary and secondary schools but never at tertiary levels, except for those specialising in African languages, where students learn the structure of the language (through the medium of English). 

Chichewa, like other Bantu languages, has a very complicated verbal system; a single verb root can take on thousands or even millions of different forms, depending on such things as tense, agreement with subject and object nouns, and a number of other features.   This makes even simple computer applications like spell checkers exceedingly challenging.  For speakers of similarly-complicated languages, one of the takeaway messages from the interview below is that tackling these challenges is possible, and even one person can make great strides.

Edmond Kachale is a software developer by profession, based in Malawi, and has been involved in developing natural language processing tools for Chichewa for several years now.   When I began working on Chichewa in around 2004 there were virtually no resources for the language, not even a good electronic word list.  Now, thanks to Edmond's efforts, there are some advanced resources such as a part-of-speech tagger (ChicPOS), tools for morphological analysis and generation (ChicMorph, AffixGen), and a program for visualizing phrase structure (ChiVisualize).  He has also been involved in translating important software packages and web sites into Chichewa, including the Google search interface.  Edmond tweets in Chichewa and English as @ceekays, and also writes the blog Edmond pa Kanjedza, where you'll find descriptions and screenshots of some of the software just mentioned. 

KPS: Chichewa has an interesting history in terms of its status as an official language in Malawi.   Please tell us a bit about that.

Edmond Kachale
EK: Indeed, Chichewa was previously declared by the late Hastings Kamuzu Banda, the First Head of State and Government, as the national language because the president himself was also a Chewa. But with the coming in of the democracy and the change of government administration from Kamuzu Banda to Bakili Muluzi, Chichewa lost its status as a national language on grounds that it was discriminatory to other indigenous languages. It became one of the official lingua francas in Malawi, together with Chitumbuka, Chiyao, Chilhomwe and Chisena. Even the current Malawian Constitution highlights this. Of course, Chichewa remains the only indigenous language that is taught in schools, though some linguists and educationalists are fighting for the other indigenous languages outlined above to be taught in (primary) schools too. I have been following two policies that are addressing the issue of indigenous languages in schools: 'Language in Education' Policy (LIE) and 'Language Across the Curriculum' Policy (LAC). The first policy seeks to guide on which language should used for what subject matter or which language should be taught as a subject. On the other hand, LAC advocates for use of a familiar language as medium of instruction in [primary] schools. Unfortunately, none have been adopted yet.

The then Chichewa Board was dissolved and replaced by the Centre of Language Studies (CLS). CLS was established to reflect the change in the status of Chichewa language as a national language. It was mandated to conduct research mainly on indigenous languages. Of course, right now the Chichewa Board has made a comeback and is sometimes referred by a new name, Chichewa Heritage Foundation. In addition, it has realigned its duties to focus on promotion and preservation of the other unifying cultural factors of the Chewa than the language itself. They focus on dances, Kulamba Ceremony (Paying Homage to the Paramount Chief, Kalonga Gawa Undi (implying His Lordship Gawa Undi) currently resident in Zambia) and other customs. Interestingly, the foundation/board pays less attention to the issue of language development itself. 

KPS: Chichewa (or "Chicheŵa"!) is written with just one diacritical mark, the w-with-circumflex (ŵ), but as I understand it, there has been some disagreement over its use in spelling.  What is the history of this letter and where do things stand now?

EK: Kamuzu Banda was said to be an authority (encyclopedia) of Chichewa.  Even though he used to speak in English, when his [Chichewa] interpreter would make a mistake he would correct him right away.  He was given an honorary Professorship of Chichewa by the University of Malawi for setting most of the rules of Chichewa grammar, including emphasizing the use of the bilabial affricative ŵ.[Around the time of his departure from power,] ŵ was removed from the writing/alphabet system.  Currently, it is still fighting for its way in with the "linguistic courts" (i.e. linguistic forums). It is also trying to enter via the back door into the Chitumbuka orthographical system. I have also observed that primary school books written in Chichewa still have ŵ, while Chichewa language and grammar books used in secondary/high schools do not have it. A grammatical confusion!!
 
KPS: What opportunities are there to use the language online?  Is internet connectivity or access to computers an issue for your community? How about translations of software and websites?

EK: Like other Bantu languages, usage of Chichewa in computing technologies is very marginal. However, there are a number of opportunities to use Chichewa online. Malawi has an agro-based type of economy. Most farmers live in remote areas and do not understand English, which is Malawi's official business language. This is reason enough to have software (like word processors and spreadsheets) and online content in Chichewa.

Currently, there are a few websites with Chichewa articles. There are also a few Wikipedia entries on the Chichewa/Chinyanja portal. In addition, the language is used on Google. There are four homepages that are enjoying this: Google Malawi, Google Mozambique, Google Zambia and Google Zimbabwe.

There is also one website that has a bilingual online dictionary. It offers a Chichewa to English (and vice versa) dictionary. It also provides other Chichewa resources though at a limited scale, but the online dictionary is their great artwork. These resources are available to paid registered members only.

Access to computers is a very big issue in my society. Close to 60% of the society are poor and cannot afford buying computers. In addition, most of these people live in rural areas where issues of computer access, internet connectivity and power supply are big problems. Thus, the computer and the internet have not been fully embraced in my society.

Malawian girls playing netball
There are efforts to develop localised software in Chichewa. Currently, there are spell-checking plugins for OpenOffice.org and Mozilla Firefox, though the latter has not yet found its way to the official Mozilla Addons page yet. There were once some talks to localise the OpenOffice.org Office package and the Mozilla Firefox web browser, and they were backed up by Government authorities, but now they have proved futile as there has been no progress at all. However there are still people who are interested in furthering this initiative. There was also another intervention to develop an online Bantu dictionary by one of the local language authorities, but the idea died in its premature stages.

KPS: Many speakers of indigenous and minority languages are reluctant to use their languages online, because of difficulties with keyboard input, or because they don't know terminology for talking about computing, or simply because they learned computing in a language like English or French.  Are any of these issues relevant for speakers of your language?  What is the general attitude toward using the language online?   

EK: I think I can agree that these sentiments are very common among most less popular languages like ours.  Indeed most Malawians feel ashamed to associate themselves with the language. A question of the usage of a language in Malawi is determined by factors such as status and attitudes towards what language is indigenous vis-à-vis what language provides more economic opportunities. Generally, local languages are associated with illiteracy and poverty. As such, there are a few people that are often comfortable to express themselves in the vernacular. I remember having outlined similar arguments in one paper that was published online by OSISA.
Lake Malawi

Of course, in addition to these arguments I have heard most people saying Chichewa has a complex writing system, Chichewa has less scientific terminologies and other arguments as you have rightly outlined them. But to the contrary too, people have often times blamed me (among other frequent users of Chichewa online) for using deep words in expressing myself, especially when it comes to computing  terms. So I do not understand where the issue of “Chichewa being shallow” comes from.

KPS: I mentioned above that many indigenous languages lack computing terminology.  Is this an issue for your language?  How is/was terminology developed?

EK: Yes, for sure! Our language is another victim of “lack of computing terminology syndrome”. I remember one linguist, Prof. Pascal Kishindo, also observing the same that scientific and technological terminology in Chichewa in a disordered state. Currently, the media and other corporate stakeholders are left on their own to deal with the plethora of new foreign scientific terminology. The media (TV, radios and newspapers) sometimes mislead people with wrong spellings and meaningless terms, often they are English-based paraphrased loanwords like "kompuyta" for computer, and "pulinta/printa" for printer.

Frankly, I am not a fan of official standardized terminologies. I have often felt that standardization limits language enhancement through development of terminological synonyms. Thus, standardized terminologies limit communities from developing new words for the same term. I have always believed that the communities should be left at liberty to develop terms on their own,  thereby enriching the language database. This is how words like "email", "laptop" and "netizen" found their way into the English vocabulary. What we need is a body that will only be collecting and documenting such terms, and disseminate them by publishing new dictionaries or public gazettes at least annually.

Malawian children playing
Despite the current state of affairs, I am not happy with the way the media deal with new terms. They are lazy in generating pure Chichewa terminologies for the fast growing technology usage. Now, it is becoming a challenge for Chichewa speakers to manage the constantly rising communicative complexity induced by these paraphrased loanwords in their communities. Paraphrased loanwords do not make use of the inherent terminology “generators” of the language. In addition, these loanwords often break grammatical rules. For example, in Chichewa the word printa (printer) is wrong in several aspects,  some of which include the following: (i) the consonant combination pr is non-existent, (ii) the plural form maprinta breaks the formal classification structure of nouns in not only Chichewa but also the entire Bantu noun classification system, as it traverses from a singular of class 9 to form a plural of class 6 instead of class 10, which is ungrammatical and semantically senseless!

Of course, there is the Chichewa Board but it is more interested in promoting and preserving cultural customs and traditions of the Chewa tribe than in the development of the language itself. In addition, there is the Centre for Language Studies, an academic research centre in the University of Malawi, which deals very much with issues of language development, from standardization of writing system (orthography) through to development of terminologies. Chichewa has been one of the language receiving more attention at the Centre to extent that they were able to produce Mtanthauzira-Mawu, a monolingual Chichewa dictionary. However, of late the Centre has been receiving less attention due to funding issues and “technical” restructuring within the University.

KPS: Are there other special challenges your community faces in terms of developing technology for the language and/or communicating online?

EK: There are several challenges. Issues of dialect differences are a common denominator, I think, across most indigenous languages. These are sometimes accelerated by prejudice and exasperation from tribes that claim to ancestrally own the indigenous languages. Within Malawi, people from various regions speak differing dialects with differences in spelling systems. The situation is worse when one crosses Malawian borders to Mozambique, Zambia or Zimbabwe where the Chichewa that is spoken there is completely divergent from the standard one in both grammatical and semantical structures.
Tea plantations

National politics and “cold tribal wars” also play a part in impinging development of Chichewa. When one starts issues of development or promotion of indigenous languages in Malawi, they are likely to be unfruitful as they end up in emotional damages deep-cutting into some sort of tribal competitions over language dominance and inborn prejudice over other indigenous languages. This is a very big problem even in Zambia to the extent that the government there had to declare that English is the national language as well as official mode of communication.

Another challenge is that there is less interest from foreign investors in ICT projects for the language itself. In contrast to the situation with other popular African languages like Kiswahili and Zulu, there is less tendency to assume that indigenous Malawian languages will be used on some level, especially that which is economically-oriented. I have tried contacting Microsoft on the possibilities of developing localised systems, but it is almost two years since I wrote them; all I got was “I have forwarded your request to right authorities”.  I have never heard of any efforts from Apple on development of technology for Chichewa. With John Duffel, a friend of mine, I once proposed to Facebook to add Chichewa to the list of translations to allow us localise facebook.com, just as other friends have done. But our efforts have led us nowhere as they have not responded yet since our proposal two years ago.

However, I should commend Google for its positive intervention. We are now proud that we have a Chichewa version of Google Web Search as explained above. In addition, there are talks on extending the localization project to other significant applications like Gmail. Using Google Technology User Group (GTUGs) set up in Malawi, Google is also working with local developers in trying to promote development of applications using its API that will have local usage.

KPS: Are young people using the language online?  Do you think social media sites like facebook and twitter are helping encourage language use by younger speakers?

EK: Yes, to some extent people are using Chichewa online, especially the youth. In addition, as I alluded to earlier, there are a few websites that publish articles in Chichewa. Social networks also encourage usage of the language online. Of course, language use on social forums has led to excessive growth of code-switching between English and Chichewa phrases, leading into the development a new Internet language altogether. For example, someone may say "Ndikupanga apudeti pa Fesibuku" to mean "I will update you on Facebook" or they can also say "Ndinayesa kuchigugula" to mean "I tried to google for it".

Mulanje Mountain, highest mountain in Malawi
KPS: What is your vision for your language in ten years, both in general terms and in terms of software/online use?   

EK: This question a bit difficult to answer affirmatively because there are so many factors that can affect the usage of the language in ICT.  But in general, if the situation on the ground does not change, ten years will come like a tick of a second. The Centre for Language Studies needs a special intervention and the Chichewa Board needs a different approach for the language to develop and have rich resources. I hope we need a multi-sectoral approach; both the government and private investors should take the issue of  language use and development earnestly.  Especially the government's intervention is very significant. I observed that due to political influence on natural development in Africa, some issues like those that concern language development cannot move without government intervention.

In terms of online content, I think we can do better than what we already have. Currently, much web content with intended local application, even concerning Malawians themselves and originating from Malawi, tend to use languages understood internationally, especially English. In addition, content on the Internet should also incorporate the subject matter that reflects the culture and the needs of the Malawian nation.

From the software perspective, there is a need for many Malawian developers to start working on localised systems. Otherwise we are way far behind, and if this continues like this, we will just be watching as the technological era passes us by. I have always believed that localization is another way of preserving and enhancing language. If we can take local content (such as games and systems) online, and spicing them with localised content, that will be a great stride. With high illiteracy levels, we may also want to take advantage of other forms of technology to preserve our ideas. For example, working on animations for most of the popular folk tales will do us more good, making technology more exciting and appropriate to our community. Of course, I also recognise that some few local developers have already started but we need more and more developers to join the bandwagon.

11 comments:

  1. What sort of Chichewa (or Chinyanja?) would you call this?

    http://www.smo.uhi.ac.uk/sengoidelc/donncha/tm/ilteangach/?teanga=nya

    ReplyDelete
  2. This is standard (contemporary) Chichewa that is well accepted. It is taught in schools.

    ReplyDelete
  3. Interesting comment about grassroots terminology development. I guess it makes a difference, it being a majority language. It's very frustrating for users of a language like Scots Gaelic (where at best 1.5% of the population speak the language) to be faced with multiple words for the same concept when it comes to new technology.

    ReplyDelete
  4. @ Edmond - Thanks!

    @ Michael - Scottish Gaelic needs the equivalent of focal.ie, which has been and continues to be very successful in providing standard terminology in Irish. But then, focal.ie draws on a much larger corpus of technical dictionaries and technical translation than Scottish Gaelic has at its disposal. It's a shame that Scottish Gaelic can't simply piggy-back on the Irish efforts.

    ReplyDelete
  5. @Michael Is variation more of a problem for learners or fluent speakers? Do we know it causes problems for learners?

    You might like this page, which shows the state of Irish computing terminology circa the mid-1990's: http://www.smo.uhi.ac.uk/gaeilge/focloiri/abhair/riomhaire.html

    These are words that were chosen by the people "on the ground" using Irish on email lists like Gaelic-L. It would be interesting to count up how many of these are now the official terms (mercifully many are no longer with us, like "útamáladóir" for "browser")

    ReplyDelete
  6. @Dennis We have www.faclair.com which is a hybrid, it lists variants but also allows users to vote on their preferred terms. It's getting better, mainly through yours truly cutting back on sleep (only wimps sleep every night, real men just flush their RAM :b.

    @Kevin both, though native speakers more so I'd say based on anecdotal evidence. For learners, it's annoying because there's 4 words for "export" and with neither can they be 100% someone will understand them. Natives tend to be reluctant users of new terminology anyway, some cognitive thing I think.

    And I must get útamáladóir on a tshirt, LOL what the heck!

    ReplyDelete
  7. www.faclair.com - Obair de do chuid féin, bail ó dhia ort! Bhain mé triail as “toothpaste" (taos fiacla), “insecticide” (feithidicíd) agus “calorie” (calra). Fuair mé an tríú ceann. Cad é an Ghàidhlig ar an dá cheann eile?

    Nár lagaí do lámh ar an méarchlár i lár na hoíche!
    :-)

    ReplyDelete
  8. tha beàrnan ann fhathast, tha fhios a'm. Chuir mi ris "uachdar fhiaclan" ach bha puinnsean meanbh-fhrìdean ann mu thràth!

    that aside, anyone know why I'm getting a carrot rather than my gravatar/wordpress logo?

    ReplyDelete
  9. akerbeltzalba said...
    that aside, anyone know why I'm getting a carrot rather than my gravatar/wordpress logo?


    To solve your problem, you may wish to read Why no picture in my blog comment? on http://www.sabahbloggers.com/.

    Cheers

    ReplyDelete
  10. keep it up Edmond always proud of you

    ReplyDelete