Chichewa, like other Bantu languages, has a very complicated verbal system; a single verb root can take on thousands or even millions of different forms, depending on such things as tense, agreement with subject and object nouns, and a number of other features. This makes even simple computer applications like spell checkers exceedingly challenging. For speakers of similarly-complicated languages, one of the takeaway messages from the interview below is that tackling these challenges is possible, and even one person can make great strides.
Edmond Kachale is a software developer by profession, based in Malawi, and has been involved in developing natural language processing tools for Chichewa for several years now. When I began working on Chichewa in around 2004 there were virtually no resources for the language, not even a good electronic word list. Now, thanks to Edmond's efforts, there are some advanced resources such as a part-of-speech tagger (ChicPOS), tools for morphological analysis and generation (ChicMorph, AffixGen), and a program for visualizing phrase structure (ChiVisualize). He has also been involved in translating important software packages and web sites into Chichewa, including the Google search interface. Edmond tweets in Chichewa and English as @ceekays, and also writes the blog Edmond pa Kanjedza, where you'll find descriptions and screenshots of some of the software just mentioned.
KPS: Chichewa has an interesting history in terms of its status as an official language in Malawi. Please tell us a bit about that.
The then Chichewa Board was dissolved and replaced by the Centre of Language Studies (CLS). CLS was established to reflect the change in the status of Chichewa language as a national language. It was mandated to conduct research mainly on indigenous languages. Of course, right now the Chichewa Board has made a comeback and is sometimes referred by a new name, Chichewa Heritage Foundation. In addition, it has realigned its duties to focus on promotion and preservation of the other unifying cultural factors of the Chewa than the language itself. They focus on dances, Kulamba Ceremony (Paying Homage to the Paramount Chief, Kalonga Gawa Undi (implying His Lordship Gawa Undi) currently resident in Zambia) and other customs. Interestingly, the foundation/board pays less attention to the issue of language development itself.
KPS: Chichewa (or "Chicheŵa"!) is written with just one diacritical mark, the w-with-circumflex (ŵ), but as I understand it, there has been some disagreement over its use in spelling. What is the history of this letter and where do things stand now?
EK: Kamuzu Banda was said to be an authority (encyclopedia) of Chichewa. Even though he used to speak in English, when his [Chichewa] interpreter would make a mistake he would correct him right away. He was given an honorary Professorship of Chichewa by the University of Malawi for setting most of the rules of Chichewa grammar, including emphasizing the use of the bilabial affricative ŵ.[Around the time of his departure from power,] ŵ was removed from the writing/alphabet system. Currently, it is still fighting for its way in with the "linguistic courts" (i.e. linguistic forums). It is also trying to enter via the back door into the Chitumbuka orthographical system. I have also observed that primary school books written in Chichewa still have ŵ, while Chichewa language and grammar books used in secondary/high schools do not have it. A grammatical confusion!!
KPS: What opportunities are there to use the language online? Is internet connectivity or access to computers an issue for your community? How about translations of software and websites?
EK: Like other Bantu languages, usage of Chichewa in computing technologies is very marginal. However, there are a number of opportunities to use Chichewa online. Malawi has an agro-based type of economy. Most farmers live in remote areas and do not understand English, which is Malawi's official business language. This is reason enough to have software (like word processors and spreadsheets) and online content in Chichewa.
Currently, there are a few websites with Chichewa articles. There are also a few Wikipedia entries on the Chichewa/Chinyanja portal. In addition, the language is used on Google. There are four homepages that are enjoying this: Google Malawi, Google Mozambique, Google Zambia and Google Zimbabwe.
There is also one website that has a bilingual online dictionary. It offers a Chichewa to English (and vice versa) dictionary. It also provides other Chichewa resources though at a limited scale, but the online dictionary is their great artwork. These resources are available to paid registered members only.
Access to computers is a very big issue in my society. Close to 60% of the society are poor and cannot afford buying computers. In addition, most of these people live in rural areas where issues of computer access, internet connectivity and power supply are big problems. Thus, the computer and the internet have not been fully embraced in my society.
|Malawian girls playing netball|
KPS: Many speakers of indigenous and minority languages are reluctant to use their languages online, because of difficulties with keyboard input, or because they don't know terminology for talking about computing, or simply because they learned computing in a language like English or French. Are any of these issues relevant for speakers of your language? What is the general attitude toward using the language online?
EK: I think I can agree that these sentiments are very common among most less popular languages like ours. Indeed most Malawians feel ashamed to associate themselves with the language. A question of the usage of a language in Malawi is determined by factors such as status and attitudes towards what language is indigenous vis-à-vis what language provides more economic opportunities. Generally, local languages are associated with illiteracy and poverty. As such, there are a few people that are often comfortable to express themselves in the vernacular. I remember having outlined similar arguments in one paper that was published online by OSISA.
Of course, in addition to these arguments I have heard most people saying Chichewa has a complex writing system, Chichewa has less scientific terminologies and other arguments as you have rightly outlined them. But to the contrary too, people have often times blamed me (among other frequent users of Chichewa online) for using deep words in expressing myself, especially when it comes to computing terms. So I do not understand where the issue of “Chichewa being shallow” comes from.
KPS: I mentioned above that many indigenous languages lack computing terminology. Is this an issue for your language? How is/was terminology developed?
EK: Yes, for sure! Our language is another victim of “lack of computing terminology syndrome”. I remember one linguist, Prof. Pascal Kishindo, also observing the same that scientific and technological terminology in Chichewa in a disordered state. Currently, the media and other corporate stakeholders are left on their own to deal with the plethora of new foreign scientific terminology. The media (TV, radios and newspapers) sometimes mislead people with wrong spellings and meaningless terms, often they are English-based paraphrased loanwords like "kompuyta" for computer, and "pulinta/printa" for printer.
Frankly, I am not a fan of official standardized terminologies. I have often felt that standardization limits language enhancement through development of terminological synonyms. Thus, standardized terminologies limit communities from developing new words for the same term. I have always believed that the communities should be left at liberty to develop terms on their own, thereby enriching the language database. This is how words like "email", "laptop" and "netizen" found their way into the English vocabulary. What we need is a body that will only be collecting and documenting such terms, and disseminate them by publishing new dictionaries or public gazettes at least annually.
|Malawian children playing|
Of course, there is the Chichewa Board but it is more interested in promoting and preserving cultural customs and traditions of the Chewa tribe than in the development of the language itself. In addition, there is the Centre for Language Studies, an academic research centre in the University of Malawi, which deals very much with issues of language development, from standardization of writing system (orthography) through to development of terminologies. Chichewa has been one of the language receiving more attention at the Centre to extent that they were able to produce Mtanthauzira-Mawu, a monolingual Chichewa dictionary. However, of late the Centre has been receiving less attention due to funding issues and “technical” restructuring within the University.
KPS: Are there other special challenges your community faces in terms of developing technology for the language and/or communicating online?
EK: There are several challenges. Issues of dialect differences are a common denominator, I think, across most indigenous languages. These are sometimes accelerated by prejudice and exasperation from tribes that claim to ancestrally own the indigenous languages. Within Malawi, people from various regions speak differing dialects with differences in spelling systems. The situation is worse when one crosses Malawian borders to Mozambique, Zambia or Zimbabwe where the Chichewa that is spoken there is completely divergent from the standard one in both grammatical and semantical structures.
National politics and “cold tribal wars” also play a part in impinging development of Chichewa. When one starts issues of development or promotion of indigenous languages in Malawi, they are likely to be unfruitful as they end up in emotional damages deep-cutting into some sort of tribal competitions over language dominance and inborn prejudice over other indigenous languages. This is a very big problem even in Zambia to the extent that the government there had to declare that English is the national language as well as official mode of communication.
Another challenge is that there is less interest from foreign investors in ICT projects for the language itself. In contrast to the situation with other popular African languages like Kiswahili and Zulu, there is less tendency to assume that indigenous Malawian languages will be used on some level, especially that which is economically-oriented. I have tried contacting Microsoft on the possibilities of developing localised systems, but it is almost two years since I wrote them; all I got was “I have forwarded your request to right authorities”. I have never heard of any efforts from Apple on development of technology for Chichewa. With John Duffel, a friend of mine, I once proposed to Facebook to add Chichewa to the list of translations to allow us localise facebook.com, just as other friends have done. But our efforts have led us nowhere as they have not responded yet since our proposal two years ago.
However, I should commend Google for its positive intervention. We are now proud that we have a Chichewa version of Google Web Search as explained above. In addition, there are talks on extending the localization project to other significant applications like Gmail. Using Google Technology User Group (GTUGs) set up in Malawi, Google is also working with local developers in trying to promote development of applications using its API that will have local usage.
KPS: Are young people using the language online? Do you think social media sites like facebook and twitter are helping encourage language use by younger speakers?
EK: Yes, to some extent people are using Chichewa online, especially the youth. In addition, as I alluded to earlier, there are a few websites that publish articles in Chichewa. Social networks also encourage usage of the language online. Of course, language use on social forums has led to excessive growth of code-switching between English and Chichewa phrases, leading into the development a new Internet language altogether. For example, someone may say "Ndikupanga apudeti pa Fesibuku" to mean "I will update you on Facebook" or they can also say "Ndinayesa kuchigugula" to mean "I tried to google for it".
|Mulanje Mountain, highest mountain in Malawi|
EK: This question a bit difficult to answer affirmatively because there are so many factors that can affect the usage of the language in ICT. But in general, if the situation on the ground does not change, ten years will come like a tick of a second. The Centre for Language Studies needs a special intervention and the Chichewa Board needs a different approach for the language to develop and have rich resources. I hope we need a multi-sectoral approach; both the government and private investors should take the issue of language use and development earnestly. Especially the government's intervention is very significant. I observed that due to political influence on natural development in Africa, some issues like those that concern language development cannot move without government intervention.
In terms of online content, I think we can do better than what we already have. Currently, much web content with intended local application, even concerning Malawians themselves and originating from Malawi, tend to use languages understood internationally, especially English. In addition, content on the Internet should also incorporate the subject matter that reflects the culture and the needs of the Malawian nation.
From the software perspective, there is a need for many Malawian developers to start working on localised systems. Otherwise we are way far behind, and if this continues like this, we will just be watching as the technological era passes us by. I have always believed that localization is another way of preserving and enhancing language. If we can take local content (such as games and systems) online, and spicing them with localised content, that will be a great stride. With high illiteracy levels, we may also want to take advantage of other forms of technology to preserve our ideas. For example, working on animations for most of the popular folk tales will do us more good, making technology more exciting and appropriate to our community. Of course, I also recognise that some few local developers have already started but we need more and more developers to join the bandwagon.
What sort of Chichewa (or Chinyanja?) would you call this?ReplyDelete
This is standard (contemporary) Chichewa that is well accepted. It is taught in schools.ReplyDelete
Interesting comment about grassroots terminology development. I guess it makes a difference, it being a majority language. It's very frustrating for users of a language like Scots Gaelic (where at best 1.5% of the population speak the language) to be faced with multiple words for the same concept when it comes to new technology.ReplyDelete
@ Edmond - Thanks!ReplyDelete
@ Michael - Scottish Gaelic needs the equivalent of focal.ie, which has been and continues to be very successful in providing standard terminology in Irish. But then, focal.ie draws on a much larger corpus of technical dictionaries and technical translation than Scottish Gaelic has at its disposal. It's a shame that Scottish Gaelic can't simply piggy-back on the Irish efforts.
@Michael Is variation more of a problem for learners or fluent speakers? Do we know it causes problems for learners?ReplyDelete
You might like this page, which shows the state of Irish computing terminology circa the mid-1990's: http://www.smo.uhi.ac.uk/gaeilge/focloiri/abhair/riomhaire.html
These are words that were chosen by the people "on the ground" using Irish on email lists like Gaelic-L. It would be interesting to count up how many of these are now the official terms (mercifully many are no longer with us, like "útamáladóir" for "browser")
@Dennis We have www.faclair.com which is a hybrid, it lists variants but also allows users to vote on their preferred terms. It's getting better, mainly through yours truly cutting back on sleep (only wimps sleep every night, real men just flush their RAM :b.ReplyDelete
@Kevin both, though native speakers more so I'd say based on anecdotal evidence. For learners, it's annoying because there's 4 words for "export" and with neither can they be 100% someone will understand them. Natives tend to be reluctant users of new terminology anyway, some cognitive thing I think.
And I must get útamáladóir on a tshirt, LOL what the heck!
www.faclair.com - Obair de do chuid féin, bail ó dhia ort! Bhain mé triail as “toothpaste" (taos fiacla), “insecticide” (feithidicíd) agus “calorie” (calra). Fuair mé an tríú ceann. Cad é an Ghàidhlig ar an dá cheann eile?ReplyDelete
Nár lagaí do lámh ar an méarchlár i lár na hoíche!
tha beàrnan ann fhathast, tha fhios a'm. Chuir mi ris "uachdar fhiaclan" ach bha puinnsean meanbh-fhrìdean ann mu thràth!ReplyDelete
that aside, anyone know why I'm getting a carrot rather than my gravatar/wordpress logo?
that aside, anyone know why I'm getting a carrot rather than my gravatar/wordpress logo?
To solve your problem, you may wish to read Why no picture in my blog comment? on http://www.sabahbloggers.com/.
keep it up Edmond always proud of youReplyDelete
@Edmond lets talkReplyDelete