官方APP下载:英语全能特训(微信小程序版,支持苹果手机、安卓手机)
创办于2003年
UNSV记不住?那就记中文谐音“忧安思危”吧!
  Slow and Steady Wins the Race!
UNSV英语学习频道 - Slow and steady wins the race!
公众微信服务号
英语全能特训(微信公众服务号)
UNSV英语学习频道淘宝网店
客服短信:18913948480
客服邮箱:web@unsv.com
初级VIP会员
全站英语学习资料下载。
¥98元/12个月

Facebook Develops Machine Translation System for 100 Languages

阅读次数:


VIP会员专享下载:(非VIP会员无权下载!如果想下载,但还不是VIP会员,请点此订购
下载方式:使用鼠标右键(注意是鼠标右键!)点击下面的MP3音频/MP4视频链接,然后选择“另存为…”。
MP3节目录音 MP3节目录音 
文章原文
同步字幕

Facebook has developed the first machine learning model that can translate between any two of 100 languages without going into English first.

Facebook says the new multilingual machine translation model was created to help its more than two billion users worldwide. The company is still testing the translation system – which it calls M2M-100 - and hopes to add it to different products in the future.

The social media service says it has made the system open source -- meaning its computer code will be freely available for others to copy or change.

Angela Fan, a research assistant at Facebook, explained the new machine translation model this week on one of the company's websites. She said its development represented a "milestone" in progress after years of "foundational work in machine translation.'

Fan said the model produces better results than other machine learning systems that depend on English to help in the translation process. The other systems use it as an intermediate step -- like a bridge -- to translate between two non-English languages.

One example would be a translation from Chinese to French. Fan noted that many machine translation models begin by translating from Chinese to English first, and then from English to French. This is done "because English training data is the most widely available,' she said. But such a method can lead to mistakes in translation.

'Our model directly trains on Chinese to French data to better preserve meaning," Fan said. Facebook said the system outperformed English-centered systems in a widely used system that uses data to measure the quality of machine translations.

Facebook says about two-thirds of its users communicate in a language other than English. The company already carries out an average of 20 billion translations every day on Facebook's News Feed. But it faces a huge test with many users publishing massive amounts of content in more than 160 languages.

The development team trained, or directed, the new model on a data set of 7.5 billion sentence pairs for 100 languages. In addition, the system was trained on a total of 2,200 language directions. Facebook said this is 10 times the number on the best machine translation models in the past.

One difficulty the team faced was trying to develop an effective machine translation system for language combinations that are not widely used. Facebook calls these "low-resource languages." The data used to create the new model was collected from content available on the internet. But there is limited internet data on low-resource languages.

To deal with this problem, Facebook said it used a method called back-translation. This method can create "synthetic translations" to increase the amount of data used to train on low-resource languages.

For now, the company says, it plans to continue exploring new language research methods while working to improve the new model. No date has been set for launching the translation system on Facebook.

But Angela Fan said the new system marks an important step for Facebook, especially for the times we live in. 'Breaking language barriers through machine language translation is one of the most important ways to bring people together, provide authoritative information on COVID-19, and keep them safe from harmful content,' she said.

I'm Bryan Lynn.

Bryan Lynn wrote this story for VOA Learning English, based on reports from Facebook and Agence France-Presse. George Grow was the editor.

We want to hear from you. Write to us in the Comments section, and visit our Facebook page.

Quiz - Facebook Develops New Machine Translation System for 100 Languages

Start the Quiz to find out

Start Quiz

----------------

Words in This Story

translate – v. change written or spoken words from one language to another

code – n. a set of rules used to instruct computers how to behave or do things

milestone - n. an event that reaches never before seen levels

intermediate – adj. between two different stages in a process

preserve – v. keep something the same or prevent it from being damaged of destroyed

pair – n. two things that look the same and are used together

content – n. information contained in a piece of writing, a speech, a movie or on the internet

synthetic – adj. not made from substances or in the usual way

authoritative – adj. respected and considered to be accurate

网友的学习评论(0条):
版权所有©2003-2019 南京通享科技有限公司,保留所有权利。未经书面许可,严禁转载本站内容,违者追究法律责任。 互联网经营ICP证:苏B2-20120186
网站备案:苏公网安备 32010202011039号苏ICP备05000269号-1中国工业和信息化部网站备案查询
广播台