Translate This: Google's Quest to End the Language Barrier

By Thomas Schulz

German Chancellor Angela Merkel and Israeli Prime Minister Benjamin Netanyahu seen in the reflection of the interpreter's booth at a Berlin press briefing in 2009. Zoom
AP

German Chancellor Angela Merkel and Israeli Prime Minister Benjamin Netanyahu seen in the reflection of the interpreter's booth at a Berlin press briefing in 2009.

Can the language barrier be breached? Google certainly thinks so: Under the leadership of a computer scientist from Germany, the company is making progress toward a universal translation tool. But competition is looming from Microsoft and Facebook.

When science-fiction writers envision the future of mankind, a number of ideas for improving the world repeatedly pop up. They include free, unlimited energy and spaceships traveling at the speed of light. And they include the creation of miniature computers that serve as universal translators, eliminating all language barriers.

The last of these dreams, at least, is something Google intends to make a reality. The man in charge of the project is a computer scientist from a small village near Erlangen, in southwestern Germany.

Franz Josef Och, 41, has long been focused on a single goal: to build the perfect translation computer, a machine that is so inconspicuous and fast "that you hardly notice it all, except as a whisper in your ear," a device that can promptly spit out any text, the content of a website or a conversation in any other language.

For Och, what could amount to trouble-free travel for tourists and a grim future for many interpreters is in fact much more: a path to artificial intelligence. Even as a teenager, he dreamed of machines that could simulate uniquely human attributes.

Och studied computer science and eventually earned a Ph.D., writing his dissertation on artificial intelligence. The Defense Advanced Research Projects Agency (DARPA), part of the United States Department of Defense, brought him to California. In 2004, Och received a call from Google co-founder Larry Page, who asked: Don't you want to scale up? Och was given his own department, Google Translate.

Exotic fantasies of the future are part of the Internet giant's business model, from the self-driving automobile to experiments on extending life expectancy. Although Google Translate has produced little to date, it is likely that no one has greater ambitions than Och and his team: They want to finally put an end to mankind's Babylonian mixture of languages.

Free of Charge

The German computer scientist and his team have already made substantial progress. Google Translate can now translate text back and forth between 71 languages, be it from English into German or from Icelandic into Japanese.

For instance, when a SPIEGEL article is entered into the program, the user can obtain a French translation within a fraction of a second. Italian websites can be displayed in German with the touch of a button. A puzzling sign at a Japanese train station merely has to be photographed with a smartphone; the Translate app automatically recognizes the text in the image and translates it.

The Google service was used about 200 million times last year. And, so far, it has been free of charge. But multinational companies, in particular, would attach a great deal of value to an automated translation service; Google could almost certainly charge a substantial amount of money for the tool in the future.

For now, however, the company's goal is to perfect the service, and its path leads through the smartphone. The Translate team has developed an app that transforms smartphones into a talking translation machine, with the ability to handle about two dozen languages so far.

The app works very well, as long as sentences are kept relatively simple. For instance, someone who wants to tell a taxi driver in Beijing that he urgently needs to get to a pharmacy simply has to speak into his smartphone in, for example, German, and it promptly repeats the sentence in Chinese, correctly but in a somewhat tinny voice: "Qing dai wo qu yijia yaodian."

Och feels that the application is still "slightly slow and awkward, because you have to press buttons." The quality of the translation is also inconsistent. But only a few years ago, people would have said he was crazy if he had predicted what Translate could do today.

'A Quantum Leap'

IBM laid the foundation for automated translation decades ago, but the project made no headway and was soon abandoned. For this reason, the first years at Google were "a pure research project" for Och and a handful of coworkers. Recently, however, they have made tremendous progress -- "a quantum leap," as Och calls it.

The Translate team, headquartered at Google's main campus in Silicon Valley, has grown considerably. It includes several German computer scientists, but not a single linguist. Och himself isn't exactly a talented linguist, either. On the contrary, he says, "I have trouble learning languages, and that's precisely the beauty of machine translation: The most important thing is to be good at math and statistics, and to be able to program."

Developing dictionaries, defining grammatical structures and all the rules that are normally fed into translation programs to mimic human translators hardly play a role for the Google team. In fact, rules often turn out to be too inflexible and are too much for the computer to handle. In English and in German, for example, adjectives precede the noun, whereas in French it's usually the other way around. As a result, a translation program quickly becomes "lost in translation."

Instead, Google emphasizes what it knows best: the sheer mass of data and statistics. "So what the system is basically doing (is) correlating existing translations and learning more or less on its own how to do that with billions and billions of words of text," says Och. "In the end, we compute probabilities of translation."

And the larger the mass of existing data, the better the system works. This is why the translation computer only became possible as a result of the Internet. The worldwide web offers an enormous wealth of existing translations.

Far from Perfect

The algorithms search through the clutter, gather data and learn along the way. For instance, when the program is tasked with translating a sentence from French into German, say, it searches for matching phrases that already exist and computes how to best compose a new sentence based on this information.

This often works well, but it is still far from perfect. Syntax, intonation and ambiguity remain a significant problem for automated translation programs. Often, the translations they produce are comprehensible, though barely, but are also a nightmare for linguists. The machine simply has no sense of aesthetics.

So does Google have to teach the program creativity to be truly successful? No, says Och, noting that it's really a question of "more effective learning." In other words, the computer needs to master the art of imitation.

The translations are generally better when, from the standpoint of the translation program, the languages are structurally similar in terms of grammar and word structure, and when there is a lot of data. The combination of English and Spanish works particularly well, whereas translating between English and Japanese isn't nearly as effective. "And, unfortunately, English and German happen to be among the difficult language pairs," says Och, "which really irks me."

But what exactly does Google plan to do with Translate? Does the company want to achieve a monopoly on human communication and supremacy over global interaction?

A Slice of Humanity

Och remains vague, saying that the business side of things isn't his department. But he is willing to say this much: "Machine translation makes a lot of information accessible for many people, and that makes many other things possible." The indirect benefit, he explains, is what's important. Google has taken a similar approach with its other products. Its search engine and email service are available at no charge, for now. But they also supply Google with hundreds of millions of users and their data.

At the same time, there is growing competition. A few weeks ago, Facebook acquired a small company that has developed a system for language entry and automated translation. The social network wants to use the system to facilitate communication among its users, of which there are more than a billion from upwards of 200 countries.

The European Union has developed its own automated translation program, called MT@EC, which is primarily designed to tackle bureaucratic jargon. Owing to budget cuts, the EU will likely have to eliminate 10 percent of its human translation workforce in the coming years.

Microsoft has also launched a translation project. Last fall, Rick Rashid, head of Microsoft Research's worldwide operations, unveiled the result at a conference in China. Rashid spoke in English, and a Microsoft computer simultaneously translated his words into Mandarin -- not in a tinny monotone, but in Rashid's own voice, assembled from previously recorded speech samples. At least it captured a slice of humanity.

Translated from the German by Christopher Sultan

Article...
  • For reasons of data protection and privacy, your IP address will only be stored if you are a registered user of Facebook and you are currently logged in to the service. For more detailed information, please click on the "i" symbol.
  • Post to other social networks

Comments
Discuss this issue with other readers!
13 total posts
Show all comments
    Page 1    
1. In the end, we compute probabilities
spon-facebook-685707152 09/13/2013
This is also how the Blackjack game was solved by statistically looking at the low value and high value of the remaining cards. More low value cards remaining increase the odds of house breaking...
2. machine translation
paul.postal 09/13/2013
So there is this great program to translate between 71 languages but the translation of this article from German to English is done by Christopher Sultan and not the great program. Maybe such articles should wait until people like Mr. Sultan are no longer needed.
3. Google translate?
riggbeck 09/13/2013
Google Translate is not up to the job, if English translations of foreign languages are anything to go by. Literal translations of idioms that produce unintentionally hilarious results.
4. ...
Newspeak 09/14/2013
Years ago there were simple pocket translators that already worked well for the most simple and practically needed phrases. I do not see how Google Translate has improved this situation that much. It is ok, well, for the layman. But before companies would get interested it should be able to manage highly technical speech and to a much more reliable outcome. It cannot replace a professional translator by any means in this context. Similar in politics. It seems very stupid to me, that the EU really aims to replace human translators by algorithmic ones as long as the results of translations are that poor. In diplomacy every nuance counts. Wrong translations may be able to even start a war. In fact, I would find it easier to enforce a common language like Esperanto or whatever (which nearly always failed in the past) than getting a perfect translator within the same amount of time.
5. Spanish
CAC1031 09/14/2013
I can attest that the Spanish-English translation is very good. It has improved before my eyes in the last few years. As the article suggested, it must be a matter of accumulating already translated web data as there must be many Spanish-English dual-language websites in the U.S. Hopefully, there will eventually be enough data available to make other combinations work as well.
Show all comments
    Page 1    
Keep track of the news

Stay informed with our free news services:

All news from SPIEGEL International
Twitter | RSS
All news from Europe section
RSS

SPIEGEL ONLINE 2013
All Rights Reserved
Reproduction only allowed with the permission of SPIEGELnet GmbH



From DER SPIEGEL

Franz Josef Och, 41, wants to eliminate the language barrier. Zoom
Jonathan Sprague/ DER SPIEGEL

Franz Josef Och, 41, wants to eliminate the language barrier.


European Partners
Presseurop

Politiken

Corriere della Sera

Garlasco Acquittals Overturned

Anti-Europe Fever


Facebook
Twitter