2012-10-15

Facebook in your language

It's been a long time since I posted anything here.  The Indigenous Tweets project is still going strong, and the number of languages we're tracking on Twitter continues to grow - we added the 138th and 139th languages (Inari and South Saami) to the site a couple of weeks ago.  Last week, the team at Twitter was nice enough to feature Indigenous Tweets on their "Twitter Stories" site; you can read that piece here.

Since January, I've spent a lot of time working on another project aimed at encouraging indigenous language groups to use their languages in social media.  What we're trying to do is produce translations of Facebook's interface (the menus, navigation, etc.) into as many languages as possible.

You may be aware that Facebook has a nice system in place that allows volunteers to translate the site into about 100 different languages, including a number of languages that we care about here, like Irish, Cherokee, Northern Sámi, and Aymara.  This is about the same as the number of language teams currently translating Mozilla Firefox (105) and somewhat less than the number of languages the Google search interface is available in (150). 

The trouble is, neither Facebook nor Google has added any new languages to their translation systems for quite a while.  In the case of Google, this is stated explicity in their translation FAQ: "Right now, we're unable to support more languages in GIYL".  We haven't been able to reach anyone at Facebook about this, but we've heard second-hand that they have had problems with spam translations and poor quality from some of the smaller translation teams.  Whatever the reason, there are hundreds of language groups out there actively using Facebook to communicate in their language, but who are forced to use the site in English, Spanish, etc.  This flies in the face of Facebook's stated aim to "make Facebook available in every language across the world".

To solve this problem for his own language of Secwepemctsín, the late Neskie Manuel came up with a clever solution using a technology called Greasemonkey.  His code acts as a kind of "overlay" that runs in your web browser; as you navigate pages on Facebook, they are sent across the network to you in English, but then can be translated on the fly in your browser.

At one level this is just a "hack", and even Neskie viewed it as a temporary workaround: "It would be good to be able to use the official Facebook Translations App, but Secwepemctsín isn’t listed. Until then, we can use this script."  Personally, I think it's a bigger, more important idea than that.  What it means is that any language group can undertake a translation without having to wait for Facebook's approval or permission, and the same approach works in theory for Google or other popular web sites that aren't open to translation.   I've been working on open source software translations for more than ten years, and have contributed to the Irish translations of Mozilla Firefox, LibreOffice, KDE, etc.  I've strongly advocated [PDF] for an open source approach among indigenous language groups who are just starting out on software translation, because it means that the community itself can maintain control and ownership of their work, instead of having to rely on the goodwill of a big, for-profit corporation.  The trouble we're facing now, however, is that more and more of the software we use is "software as a service": Gmail instead of Mozilla Thunderbird, Google Docs instead of LibreOffice, etc., or social media sites like Twitter and Facebook.  This trend puts control of the online "linguistic landscape" firmly back in the hands of big corporations.  Neskie's approach gives us a way to maintain a measure of control over the language we choose to use online.

The response to this project has been overwhelming.  More than 60 different language groups have started translations, and we already have more than 30 that are in a usable state.  About two-thirds of these languages are endangered according to the UNESCO Atlas of the World's Languages in Danger, and in the majority of cases, I'm aware of no previous efforts to translate software into the language.

Doing a "complete" translation is quite easy.  Depending on how much terminology you have to make up, it can take as little as a couple of hours of work. I've picked out around 200 of the most common messages that appear on Facebook to be translated.  Of course this is only a small fraction of the entire site (which would be overwhelmingly large for a small language group to undertake), but by choosing these 200 messages carefully, we're able to achieve a convincing immersive experience in the target language with a minimum of effort.

There are a few technical terms needing translation (e.g. "Mobile Uploads", "email address", "Apps", "Cookies"), some site-specific jargon ("to like/unlike", "to poke someone", "status update"), and western concepts that have been difficult to render in some indigenous languages ("Privacy", "Advertising").   A useful technique for terminology creation is to see how other languages have dealt with a given concept.  To help with this, I've asked everyone who has contributed a new Facebook translation to also provide "back translations" of some of these tricky terms into English, in the hope that some of these might be helpful to new translators.   These back translations are stored on the project wiki, and we welcome additional contributions in any language.

I should also say that you don't need to translate all 200 messages if you don't want to.  For a language that is rarely, if ever, seen on the computer, I think there's great symbolic value in even a translation of just a few key words, for example "Like", "Unlike", "Comment", and "Share".

Would you like to try translating Facebook into your language?  Leave a comment below and I can send you detailed instructions!

15 comments:

  1. Yes it's a really annoying cycle we're going through, after having won at least some breathing space through the Open Source movement. In an ideal world, Firefox would join forces with LibreOffice and Co to produce online email, office and whatnot. But so far I've not heard anyone suggest that except for some muttering about some LibreOffice online project. Not sure what Mozilla is focussing on these days (apart from their new mobile OS)... from a localization point of view I they seem to be adding endless developer toys but nothing much else... I hope I'm wrong.
    So I'm fully behind you there, we need to think outside the box, be creative and at least temporarily wrestle an advantage.
    Until someone cooks up even more technology which is even less accessible. I'll stop now before I repeat my blog rant about Siri and predictive texting in Gaelic/Irish/Manx ;)

    ReplyDelete
  2. This comment has been removed by the author.

    ReplyDelete
  3. Twitter is doing a great work.. Localization is a fantastic idea.. it gains much more popularity.. Twitter should include some more languages where its gaining huge popularity. For Example: A Bengali version for Millions of Bengali users in India and Bangladesh... One more thing Twitter has not implemented Tailored Trends for Kolkata under India and also not for Bangladesh. Kolkata is one of the top most metro cities in India. I can help twitter for bengali version of India and Bangladesh. You can connect me.

    ReplyDelete
  4. Oh! This would be an excellent project for integration with http://translatewiki.net ! I've been disappointed with Facebook too.

    ReplyDelete
  5. We signed a petition to ask Facebook to include more languages but still nothing happened, to my knowledge. I am very much interested in translating Facebook as soon as there is.a way to do that. My language is Fulah (ff) and we have completed major projects like Firefox...

    ReplyDelete
  6. Hi Ibraahiima, I have your email from Mozilla mailing lists I think - I'll send you a message with some details on how to do this!

    ReplyDelete
  7. I'd translate even 2 thousand messages, but the biggest FB script I'd found had onl 700 :)

    I've done a script for Old Prussian (revived) on userscripts. Search: Piteris Sasnins

    ReplyDelete
  8. Hi Piteris, that's incredible! Thanks for sending the link - I'm looking at the script now. I have some questions about Old Prussian - if you wouldn't mind answering them maybe you could send me an email? My address is here: http://borel.slu.edu/.

    ReplyDelete
  9. I'd really like some detailed instructions on this - been trying to find out how this stuff works since I've heard only vague rumours tossed around about how to translate things like Facebook into minority languages. My networks with those working with Australian Aboriginal languages would be very interested to know.

    ReplyDelete
  10. Hi Andre, it's easiest for me if we work together via email; can you send me a message and I can set things up for you to translate? My contact info is on my home page: http://borel.slu.edu/

    ReplyDelete
  11. Hello Kevin,

    I would like to try this for Walloon. Would it be possible to get a PO file ?
    You should have my adress on gmail.

    Thanks for this work.

    ReplyDelete
  12. Hi Jean, I'll send you something through email.

    ReplyDelete
  13. HI Kevin

    Yes, I would like to translate my language bini (www.ethnologue.com/language/bin ) so that it can be used as a medium to enhance the teaching and learning of the language. I got your contact from Jane Simpson at the ANU who spoke well of your project. Look forward to working with you on this.
    Kingsley

    ReplyDelete
    Replies
    1. Hi Kingsley, I'd be happy to send you instructions by email. You can write to me at "kscanne" at gmail.com. My other contact info is here: http://borel.slu.edu/.

      Delete
  14. No worries. I will await your email

    ReplyDelete