Author Archives: Google Translate Official Blogger

Ten years of Google Translate

Ten years ago, we launched Google Translate. Our goal was to break language barriers and to make the world more accessible. Since then we’ve grown from supporting two languages to 103, and from hundreds of users to hundreds of millions. And just like anyone’s first 10 years, we’ve learned to see and understand, talk, listen, have a conversation, write, and lean on friends for help.

But what we're most inspired by is how Google Translate connects people in communities around the world, in ways we never could have imagined—like two farmers with a shared passion for tomato farming, a couple discovering they're pregnant in a foreign country, and a young immigrant on his way to soccer stardom.

Here’s a look at Google Translate today, 10 years in:

1. Google Translate helps people make connections.
Translate can help people help each other, often in the most difficult of times. Recently we visited a community in Canada that is using Translate to break down barriers and make a refugee family feel more welcome:

2. There are more than 500 million of you using Google Translate.
The most common translations are between English and Spanish, Arabic, Russian, Portuguese and Indonesian.

3. Together we translate more than 100 billion words a day.

4. Translations reflect trends and events.
In addition to common phrases like “I love you,” we also see people looking for translations related to current events and trends. For instance, last year we saw a big spike in translations for the word "selfie,” and this past week, translations for "purple rain" spiked by more than 25,000 percent.

 5. You’re helping to make Google Translate better with Translate Community.
So far, 3.5 million people have made 90 million contributions through Translate Community, helping us improve and add new languages to Google Translate. A few properly translated sentences can make a huge difference when faced with a foreign language or country. By reviewing, validating and recommending translations, we’re able to improve the Google Translate on a daily basis.

6. Brazil uses Google Translate more than any other country.
Ninety-two percent of our translations come from outside of the United States, with Brazil topping the list.

7. You can see the world in your language.
Word Lens is your friend when reading menus, street signs and more. This feature in the Google Translate App lets you instantly see translations in 28 languages.

8. You can have a conversation no matter what language you speak.
In 2011, we first introduced the ability to have a bilingual conversation on Google Translate. The app will recognize which language is being spoken when you’re talking with someone, allowing you to have a natural conversation in 32 languages.

9. You don't need an Internet connection to connect.
Many countries don’t have reliable Internet, so it’s important to be able to translate on the go. You can instantly translate signs and menus offline with Word Lens on both Android and iOS, and translate typed text offline with Android.

10. There's always more to translate.
We’re excited and proud of what we’ve accomplished together over the last 10 years—but there’s lots more to do to break language barriers and help people communicate no matter where they’re from or what language they speak. Thank you for using Google Translate—here’s to another 10!

Translate Community, and Sietske Poepjes, help add Frisian to Google Translate

Last week, we introduced 13 new languages to Google Translate. As we mentioned, there are a number of factors that go into adding a new language: once it’s established that it’s a written language with a significant amount of translations available on the web, we use a combination of machine learning, licensed content and input from Translate Community. One language where Translate Community played an especially big role was Frisian.

Today, we’re speaking with Sietske Poepjes, a member of the Frisian community who recently helped to organize a Translate Community event. Thanks to Sietske and the community's support, we were able to get enough data to officially add Frisian to Google Translate. The interview gives an overview of what went into Sietske’s community effort and how you can get involved.

What’s your job and title? I'm a provincial deputy on language and education in the province of Fryslân. Fryslân is one of the twelve provinces in The Netherlands. It’s the only province with an official language other than Dutch, namely: Frisian.

Sietske Poepjes
Why is it important to add the Frisian language to Google Translate? Frisian is the second official language in the Netherlands and is the mother language of more than half of the population of the province of Fryslân. Most people in Fryslân are able to understand or speak the language, but not a lot of people are able to write it at a high level (only 15%). With most of our education taking place in Dutch, Google Translate can be used as a tool for those who can’t write the Frisian language sufficiently.

Besides, it’s important for lesser used languages like Frisian to be used digitally. Being one of the languages in Google Translate also enhances the visibility of the language and allows people throughout the world to translate to and from the language.

So you organized Frisian Google Translate Week last year -- what motivated you to organize the event?To add Frisian to Google Translate, we knew we needed a lot of data. Since there wasn’t sufficient material available both in the Frisian and English language, Translate Community could help. To get everyone involved, the province of Fryslân decided to organise a central week in which everyone is asked to participate. The idea was that working together in the same week, we would motivate people to contribute even more. This has definitely paid out; thousands of people participated, resulting in nearly a million translated words!

Translate Community event. 
How did you set up the event? Who did you work with?As the province of Fryslân, we coordinated the event with educational and scientific organisations and libraries, and received lots of support. The organisations invited employees, members, students or other interested people to come along and translate with them on a certain day/time. This turned out to work really well. One example: the Frisian department of the University of Groningen (which is outside of Fryslân) organised a reunion with former students and teachers and together they translated thousands of words.

We also organised an opening session which was the official start of the entire week and invited school children to help open the festival with their own song. At this opening session an introduction to Google Translate presentation was given by a Google Netherlands representative.

We provided hand-outs and made a training video on YouTube to guide people on how to navigate on Translate Community site and make contributions. We also created a commercial video (with famous Frisians), broadcast on the regional television “Omrop Fryslân”, which turned out to be very influential.

How did you motivate the participants?We relied on social media. We created our own Facebook event for people to join and asked all of the organisations involved to use their own social media to share our messages and calls for everyone to contribute. Participants could make a screenshot showing their number of contributions made, to share on Facebook. We gave the participant with the most contributions a Google Translate cake.

How did the Frisian week go? Any memorable moments?
The Frisian Google Translate Week became a huge success. A lot of Frisians participated, resulting in a whopping one million translated words at the end of the week. The total number of translations was revealed at a national festival for languages. It was amazing to see the amount of publicity we gained and to see that so many people were interested in our event. It was even broadcast in the Dutch news at prime time.

What’s the impact of the event? What are people’s reactions?
We saw the need for Google Translate, as we received a lot of feedback and questions from people who wanted to know when Frisian would be available in Google Translate.

The most impressive thing of the whole Frisian Google Translate Week is the commitment of all Frisians (in and outside the province of Fryslân). So many people participated and everyone felt the need to join and start translating. The Frisian community worked together to achieve a goal.

Have you worked on any follow up efforts?
Yes! We have organized a validation session. In this session, we have reached Frisian experts and gathered in the local provincial library to work together on validating the translations. It worked out really well, again the sense of community was very strong. And to thank everyone who participated in the Frisian Google Translate Week and the validation session, we organized a celebration party. It was a really nice party with a spectacular multilingual musical performance from the Frisian band ‘De Kast’. Their number-1 hit “De nije dei” (The new day) was performed with the lyrics in Dutch translated by Google Translate in the background. There were also secondary school pupils showing the use and work of Frisian in Google Translate to all guests.

The band De Kast, who performed songs in Frisian, Dutch and English with the meaning shown in other languages on the screen through Translate.
Any advice for future event organizers? We think it is very important to communicate about the value of Google Translate for the language, it improves the visibility of the language, and it offers speakers of the language a very helpful and easy-to-use digital tool. It shows the vitality of the language, which is especially important for small languages such as Frisian. Second, we think it is important to cooperate with the local organisations for a sense of community. And last but not least: we have made a nice event of it. It was great fun!

If you would like to help improve Google Translate and organize Translate Community events for your language, apply here.

Posted by Mengmeng Niu, Program Manager, Google Translate

From Amharic to Xhosa, introducing Translate in 13 new languages — now over 100 in total!

In 2006, we started with machine learning-based translations between English and Arabic, Chinese and Russian. Almost 10 years later, with today’s update, we now offer 103 languages that cover 99% of the online population.

The 13 new languages — Amharic, Corsican, Frisian, Kyrgyz, Hawaiian, Kurdish (Kurmanji), Luxembourgish, Samoan, Scots Gaelic, Shona, Sindhi, Pashto and Xhosa — help bring a combined 120 million new people to the billions who can already communicate with Translate all over the world.

So what goes into adding a new language? Beyond the basic criteria that it must be a written language, we also need a significant amount of translations in the new language to be available on the web. From there, we use a combination of machine learning, licensed content and Translate Community.

As we scan the Web for billions of already translated texts, we use machine learning to identify statistical patterns at enormous scale, so our machines can "learn" the language. But, as already existing documents can’t cover the breadth of a language, we also rely on people like you in Translate Community to help improve current Google Translate languages and add new ones, like Frisian and Kyrgyz. So far, over 3 million people have contributed approximately 200 million translated words.
Before you dive into translating, here are a few fun facts about the new languages:

  • Amharic (Ethiopia) is the second most widely spoken Semitic language after Arabic 
  • Corsican (Island of Corsica, France) is closely related to Italian and was Napoleon's first language 
  • Frisian (Netherlands and Germany) is the native language of over half the inhabitants of the Friesland province of the Netherlands 
  • Kyrgyz (Kyrgyzstan) is the language of the Epic of Manas, which is 20x longer than the Iliad and the Odyssey put together 
  • Hawaiian (Hawaii) has lent several words to the English language, such as ukulele and wiki
  • Kurdish (Kurmanji) (Turkey, Iraq, Iran and Syria) is written with Latin letters while the others two varieties of Kurdish are written with Arabic script 
  • Luxembourgish (Luxembourg) completes the list of official EU languages Translate covers
  • Samoan (Samoa and American Samoa) is written using only 14 letters 
  • Scots Gaelic (Scottish highlands, UK) was introduced by Irish settlers in the 4th century AD
  • Shona (Zimbabwe) is the most widely spoken of the hundreds of languages in the Bantu family
  • Sindhi (Pakistan and India) was the native language of Muhammad Ali Jinnah, the "Father of the Nation” of Pakistan 
  • Pashto (Afghanistan and Pakistan) is written in Perso-Arabic script with an additional 12 letters, for a total of 44 
  • Xhosa (South Africa) is the second most common native language in the country after Afrikaans and features three kinds of clicks, represented by the letters x, q and c
We’ve come a long way with over 100 languages, but we aren’t done yet. If you want to help, International Mother Language day — just around the corner on February 21 — is a great time to get involved in Translate Community. To start, just select the languages you speak; then choose to either translate phrases on your own or validate existing translations. Every contribution helps improve the quality of translation over time. You can also share feedback directly from Translate.Google.com, so as you try out the new languages, we’d love to hear your suggestions.

For each new language, we make our translations better over time, both by improving our algorithms and systems and by learning from your translations with Translate Community. Today's update will be rolling out over the coming days.

No matter what language you speak, we hope today’s update makes it easier to communicate with millions of new friends and break language barriers one conversation at a time.

Translate Community: Over one million people and 50 million contributions

Over the past year, more than one million people speaking 117 languages have made 50 million contributions through the Google Translate Community.

With those contributions we’ve launched 10 new languages, including Chichewa (Chinyanja) and Malayalam (മലയാളം), and been able to make improvements in how we speak dozens of other languages. Now almost 50% of the most common phrases typed in Google Translate come from translations provided by the Translate Community.

unnamed.png

Translate Community members come from all over the world and translate in many different ways - from translating on their own to hosting group events. This year, Bengali speakers worked together to host events throughout the country by partnering with schools and cultural groups. And Frisian speakers worked with their government to create a week of events dedicated to getting their language added to Google Translate.

This month, language lovers are participating in a Translatathon in India. With just a few more days to go, if you speak Hindi, Bengali, Telugu, Marathi, Tamil, Gujarati, Kannada, Malayalam or Punjabi, sign up today to help Google Translate deliver better translations in your community.

India’s second Translatathon needs you!

Sometimes language isn’t straightforward. Only a Hindi speaker could tell you that although ऊँट के मुँह में जीरा may literally mean ‘cumin seed in a camel's mouth’, it actually means ‘a drop in the ocean’ or something too insufficient to fulfill a need.

There are 22 official languages in India. And while Google Translate can help you with nine of them at the moment, languages that are under-represented on the Internet like Bengali, Telugu, and Tamil could use a little help. This is where people who are passionate about their native languages can use the Translate Community tool to make a big difference.

We’ve just kicked off our second translatathon in India, this time for nine languages — Hindi, Bengali, Telugu, Marathi, Tamil, Gujarati, Kannada, Malayalam and Punjabi. You can use Google Translate Community on your phone, laptop or computer. Just type, swipe or tap translations in the languages you speak. You have the option to either translate phrases directly, or validate existing translations.

Last year 20,000 people contributed over one million new Hindi translations, helping improve the overall quality of Hindi content online. We’re now including all the Indic languages that Google Translate is available in, and we look forward to seeing how people from around the world can help Google say जंगल में मोर नाचा किस ने देखा? or আপনার পায়ে কুড়ল মারা more accurately. Millions of people in India are coming online for the first time and most of them don’t speak English. Bringing more Indian language content online, and improving Indian language translation quality, will help them have a better experience on the Web. 

Validate phrases with the Google Translate Community tool














Once you join the translatathon, you can Translate and validate words and short phrases up until December 30. We will then reward the 50 most active and accurate contributors with an Android One phone*.

So why not stop by and say नमस्ते, নমস্কার, வணக்கம் and help India showcase the beauty and diversity of languages online. Register and participate at g.co/translatathon and thanks in advance for your help. You’re making the web better for everyone.

*Terms and conditions apply: https://goo.gl/kEAehI
Posted by Barak Turovsky, Product Lead, Google Translate

Fútbol, translated

We’re always amazed by the power of technology to connect people. Not long ago we heard a story involving the Google Translate app and a boy named Alberto who had recently moved from Spain to a small town in Northern Ireland, with little knowledge of English. When Alberto joined Portadown’s youth soccer club, his coaches Gary and Glen turned to Google Translate to communicate with Alberto and his mother, on and off the field. As they progressed from protección de la pelota to retroceso de bicicleta, Alberto grew to feel a part of the team. We loved this story (and wanted to share it with you) because what Gary and Glen did was so much bigger than translating sentences from one language into another. They didn’t just find a way to coach Alberto in football—they found a way to invite someone who was on the outside into their community.

¡Vamos, Alberto!

Two new Translate features coming your way

We’re all about breaking language barriers, whatever language you speak or device you use. So with that in mind, over the next week, we’ll be rolling out two new Google Translate app features— instantly translating both English and German to Arabic and easier multitasking for iPad users.

See the world in a new language with instant visual translation 
You can already have bilingual conversations from English or German to Arabic thanks to the conversation mode or text input in the Google Translate app. Today, we’re also adding the ability to translate printed text instantly between these languages.

To use instant visual translation, just open the app, click on the camera, and point it at the text you need to translate. You’ll see the text transform from one language to another in real-time on your screen. And the best part? There’s no Internet connection or cell phone data needed.
To try out Arabic with either English or German you'll be prompted to download a small (~2 MB) language pack.

Split View translations with the newest iPads
Starting today, customers using iPads supporting Split View will be able to use Google Translate along with the new feature. So if you’re sending an email or text and need to translate, you can see both apps at the same time. And it even works with text from online books or websites.
Whether you’re starting a new bi-lingual conversation on your iPad or using instant visual translation to find your way, Google Translate helps you see the world in your language. With today’s updates, we hope that we’re able to continue to help and give more translation options to the 500 million people using Google Translate to see over 100 billion words a day in their language.

Posted by Barak Turovsky, Product Lead, Google Translate

Translate text within apps thanks to the latest Android update

We face communication barriers every day. Switching back and forth between apps and screens to translate shouldn’t be another one. We’ve heard your feedback, and have worked with the Android team to make translating text, chats, and other app content a whole lot easier.

Beginning this week, you’ll be able to translate in 90 languages right from within some of your favorite apps like TripAdvisor, WhatsApp and LinkedIn.
Translating a TripAdvisor review from Portuguese
Composing a WhatsApp message in Russian 

This update works on any device running the newest version of Android’s operating system (Android 6.0, Marshmallow). To get started, you first need to have the Translate app downloaded on your Android phone. From there, just go to an app, like TripAdvisor or LinkedIn, and highlight and select the text you want to translate. This feature is already enabled in apps that use Android text selection behavior. Developers who created custom text selection behavior can also easily add the new feature.

More than 500 million people translate over 100 billion words a day on Google Translate. With updates like this one, plus features like conversation mode and instant camera translation, we’re making Translate available anywhere you need it. So when you’re chatting with a new colleague from halfway around the world, conversation mode is perfect. Wondering which subway sign says “exit” on your next global adventure? Instant camera translation has your back. And now, when you’re sending messages or checking out reviews on your phone, you can translate right from within the apps you’re using.

Posted by Barak Turovsky, Product Lead, Google Translate

Watch your language! 44 of them, actually.

More than 500 million people use Google Translate every month across web and mobile phones, translating more than 100 billion words every day around the globe. Now, we’re launching Google Translate on all Android Wear watches, too.

Translate is built into the latest Android Wear software update, so you can have bilingual conversations even if you don’t have Google Translate on your phone, or if you’re away from your phone but connected via Wi-Fi.

And it’s easy to use - just speak into your watch to see your conversation translated into any of 44 languages. Flip your wrist to show the translation to a friend. When they respond in their own language, flip your wrist back, and you’ll see in your language what they’ve just said. Google Translate will automatically recognize which of the two languages is being spoken, so once you tap to start the conversation, all you and your buddy need to do is keep talking naturally.
Google Translate covers 90 languages total (for text translation), and we are always working to expand the number of languages that work across various features.

How Google Translate squeezes deep learning onto a phone

Today we announced that the Google Translate app now does real-time visual translation of 20 more languages. So the next time you’re in Prague and can’t read a menu, we’ve got your back. But how are we able to recognize these new languages?

In short: deep neural nets. When the Word Lens team joined Google, we were excited for the opportunity to work with some of the leading researchers in deep learning. Neural nets have gotten a lot of attention in the last few years because they’ve set all kinds of records in image recognition. Five years ago, if you gave a computer an image of a cat or a dog, it had trouble telling which was which. Thanks to convolutional neural networks, not only can computers tell the difference between cats and dogs, they can even recognize different breeds of dogs. Yes, they’re good for more than just trippy art—if you're translating a foreign menu or sign with the latest version of Google's Translate app, you're now using a deep neural net. And the amazing part is it can all work on your phone, without an Internet connection. Here’s how.

Step by step
First, when a camera image comes in, the Google Translate app has to find the letters in the picture. It needs to weed out background objects like trees or cars, and pick up on the words we want translated. It looks at blobs of pixels that have similar color to each other that are also near other similar blobs of pixels. Those are possibly letters, and if they’re near each other, that makes a continuous line we should read.
Second, Translate has to recognize what each letter actually is. This is where deep learning comes in. We use a convolutional neural network, training it on letters and non-letters so it can learn what different letters look like.

But interestingly, if we train just on very “clean”-looking letters, we risk not understanding what real-life letters look like. Letters out in the real world are marred by reflections, dirt, smudges, and all kinds of weirdness. So we built our letter generator to create all kinds of fake “dirt” to convincingly mimic the noisiness of the real world—fake reflections, fake smudges, fake weirdness all around.

Why not just train on real-life photos of letters? Well, it’s tough to find enough examples in all the languages we need, and it’s harder to maintain the fine control over what examples we use when we’re aiming to train a really efficient, compact neural network. So it’s more effective to simulate the dirt.
Some of the “dirty” letters we use for training. Dirt, highlights, and rotation, but not too much because we don’t want to confuse our neural net.

The third step is to take those recognized letters, and look them up in a dictionary to get translations. Since every previous step could have failed in some way, the dictionary lookup needs to be approximate. That way, if we read an ‘S’ as a ‘5’, we’ll still be able to find the word ‘5uper’.

Finally, we render the translation on top of the original words in the same style as the original. We can do this because we’ve already found and read the letters in the image, so we know exactly where they are. We can look at the colors surrounding the letters and use that to erase the original letters. And then we can draw the translation on top using the original foreground color.

Crunching it down for mobile
Now, if we could do this visual translation in our data centers, it wouldn’t be too hard. But a lot of our users, especially those getting online for the very first time, have slow or intermittent network connections and smartphones starved for computing power. These low-end phones can be about 50 times slower than a good laptop—and a good laptop is already much slower than the data centers that typically run our image recognition systems. So how do we get visual translation on these phones, with no connection to the cloud, translating in real-time as the camera moves around?

We needed to develop a very small neural net, and put severe limits on how much we tried to teach it—in essence, put an upper bound on the density of information it handles. The challenge here was in creating the most effective training data. Since we’re generating our own training data, we put a lot of effort into including just the right data and nothing more. For instance, we want to be able to recognize a letter with a small amount of rotation, but not too much. If we overdo the rotation, the neural network will use too much of its information density on unimportant things. So we put effort into making tools that would give us a fast iteration time and good visualizations. Inside of a few minutes, we can change the algorithms for generating training data, generate it, retrain, and visualize. From there we can look at what kind of letters are failing and why. At one point, we were warping our training data too much, and ‘$’ started to be recognized as ‘S’. We were able to quickly identify that and adjust the warping parameters to fix the problem. It was like trying to paint a picture of letters that you’d see in real life with all their imperfections painted just perfectly.

To achieve real-time, we also heavily optimized and hand-tuned the math operations. That meant using the mobile processor’s SIMD instructions and tuning things like matrix multiplies to fit processing into all levels of cache memory.

In the end, we were able to get our networks to give us significantly better results while running about as fast as our old system—great for translating what you see around you on the fly. Sometimes new technology can seem very abstract, and it's not always obvious what the applications for things like convolutional neural nets could be. We think breaking down language barriers is one great use.