Quality translation and interpreting,
backed by service, technology and know-how

The Multilingual Development of the Internet

At a technical level, the Internet was not adept at supporting multiple languages initially, but the rapid growth of languages on the Internet has paved the way for a truly multilingual web. The increased use of languages online is being facilitated by swift changes that allow the use of language specific characters in all technical aspects of a website. The launch of Internationalised Domain Names (IDNs), which are domain names in local languages, was a major step forward that enabled the Internet to become truly multilingual.

Multilingual Internet

Historically, domain names contained American Standard Code for Information Interchange (ASCII) characters. This restricted the use of language specific characters and the domains had to be created  in English even when the content was available in a different language. However, two years ago, domain name addresses in local language scripts were launched by the Internet Corporation for Assigned  Names and Numbers (ICANN) to facilitate the growth of a number of various languages on the web. 

The introduction of IDNs is significant in expanding the user base of the Internet globally. The non-English speaking web population will continue to grow considerably over the next 10 years and unavailability of the Internet in local languages will hinder their regular use of the web. Countries like India and China, amongst others, have ambitious plans in place to provide online access to a large percent of its population, where the majority of their populations are non-English speakers and can only communicate in local languages. India is undergoing the process of acquiring IDN’s in seven  Indian  languages including Hindi, Bengali, Punjabi, Urdu and Tamil while a number of other countries have already acquired the IDNs that they had requested.

An internalized web population has paved the way for the on-going development of the web. IDNs will allow users to type domain names in languages that they are familiar with, and this will continue to enhance the localization of the web. For the business world, this means that it is more important than ever to research and provide localized sites, even on a technical level for higher success. The increased use of IDNs in the future will have a profound impact that reaches beyond the system that it is created for. Emerging economies are already discussing the social and economic possibilities with  the use of IDNs and how it can initiate an internet-led boom on a local level.

 

The language of research gears up for Web 3.0

There are exciting times ahead for language companies working in the market research sector as the industry gears up for Web 3.0 which uses technologies that can analyse the meaning of content and information.

Companies are increasing the number of languages they work with. Apple’s website, for example, has doubled the number of languages it supports to 24 in the past 12 months, while Facebook now supports more than 70 (up from 2 in under 2 years). It’s estimated that the global website of major multi-nationals now support on average 20 languages, up from a mere dozen, 5 years ago.

Now add the trend for the internet to better support non-latin scripts. Top-level domains for UAE, Saudi Arabia, Eqypt and Russia were the first to be introduced earlier this year. A further 18 countries are in the approval process including China, Korea and India. More are set to follow allowing researchers to recruit online from a pool of native speakers; a first in some markets.

The use of non-latin scripts in market research is likely to be a fast moving area driven in part by a local and regional demand for native versions of web pages and associated search patterns. In mid November, for example, registrations for the international Russia domain name in Cyrillic script opened and 100,000 registration requests were received in the first three hours and over half a million over the first 6 days.

The future growth in the internet points to a clear localisation bias as well as a shift away from English as the de-facto language of the web. “Right now, more than half of the content on the Internet is not in English” says John Yunker of Byte Level Research. “Ten years from now, the percentage of English content could easily drop below 25%”.

Fully-localised domain names and websites present an opportunity for companies to market and gain feedback on their products and services in new countries. Meta tags (the key identifiers for search terms embedded in web pages) will need to be translated for optimal search and localisation to make the site visible within the local country’s search results. Ben Taylor, Language Connect’s MD comments, “There’s a lot of feedback coming from the international travel, leisure and automotive sectors where the customer experience is paramount and where search terms might be very subjective or require precise localisation. Searching for feedback about experience at a particular hotel for example, might cover a dozen different terms for the words ‘bathroom’ or ‘cleanliness’. It doesn’t just apply to large corporates, SMEs use multilingual searching to extend their international footprint very cost effectively and achieve high search engine rankings in less competitive markets”.

So is the demand for instant, multilingual feedback going to overwhelm the research world? Will quality be substituted for quantity as people turn to machine translation software and web scraping tools?

“There is certainly a strong demand for open ended coding and analysis in the original language”, says Ben. “At the same time there is a strong requirement for really high quality interpreting services. Clients want specialist language skills, like the ability to use the same vocabulary as marketing-savvy teenagers, or the language of medical staff when working with sensitive topics with a patient in hospital. And it’s clear that technology is having a rapid impact on changing research methodologies; clients can offset the cost saving they achieve using in-language coding (saving on back translation costs) and invest any remaining budget in qualitative depth probe interviews which amplify the research findings”.

Web 3.0 is beginning to have a major impact on the structure of the market research and insight industry. There’s increasing demand from international companies to synthesise large volumes of search and research information across different languages and cultures, using technologies such as web scraping techniques to bring all the information together.

Language Connect will be discussing these and similar opportunities at the Insight exhibition in London in June 2011. Please contact us beforehand if you would like to discuss any commercial partnerships in advance.

www.languageconnect.net

Can you afford not to go global?

According to a recent report by IMRG Capgemini, the UK is the top ranking net exporter of e-commerce in the world- bringing in £2.80 in exports for every £1 imported. Hardly surprising when recent research shows that internet sales increased by 24% in the course of the last year alone; massive returns which reflect the growing trend to take business online.

The internet has earned its place as an intrinsic part of the economy according to Matt Brittin, the Managing Director of Google for the UK and Ireland, who commented-

“Now for the first time we can see how its adoption by British business has become a major contributor to the UK’s GDP and that the internet is a central pillar of the UK’s economy,”

These comments are encouraging, but they should also be a cause for more serious reflection. Industry cannot afford to ignore the fact that the internet now underpins the UK’s economy in a vital way; outperforming sectors such as restaurants, utilities, construction, and transportation. Another report by the Boston Consulting Group shows that companies selling online saw a yearly growth rate of 4% while those who did not offer a transactional service remained static or grew just a little. Now more than ever, businesses need to invest resources in growing an online strategy before they are overtaken by younger companies who were quicker to take the plunge.

So how are businesses making the move online so successfully? Well, they’re building powerful brand identities with the use of social media. 30% of UK companies now have a social media page and 25% have corporate blogs or use Twitter to update their customers.

In addition, let’s not forget that the internet is global. Out of the 100 million additional internet users each year, 72% don’t speak English- so it’s not just a question of an online presence, businesses are increasingly waking up to the fact that they need to speak the language of their target markets too. The average number of languages supported by large multi nationals has increased from 12 to 20 in 5 years. Facebook, for example now supports over 70 languages compared with just two languages when the site launched. Apple has doubled the number of languages available on their website in the last year- from 12 to 24.

Choosing which languages to invest in is also an important consideration. 28% of internet users are English speaking, which is a substantial figure, yes- but did you know that 22% of internet users speak Chinese? A massive and largely untapped market surely exists here for UK businesses.

Plus, let’s not forget basic accessibility, currently only 29% of UK businesses accept online payments, which seems low when we know that 30% have a social media page. With the profits already garnered from e-commerce being so high this shows amazing potential for the future as the number of sites offering an online checkout increases.

In the current economic climate e-commerce is proving to be a real beacon of hope, showing massive growth levels, and no signs of dying down for the foreseeable future as internet businesses are likely to double their economic contribution in the next five years.

Sources:

Boston Consulting Group: The Connected Kingdom, How the internet is transforming the UK economy: http://www.bcg.com/documents/file62983.pdf

Yunker, John: The Art of the Global Gateway, Strategies for Successful Multilingual Navigation, Byte Level Books, One internet, many languages, Page 10

IMRG/Capgemini Sales Index: October 2010

Net gets set for alphabet changes

Users of scripts other than that in which English is written will soon have web addresses in their own language.

The net regulator Icann has invited countries to ask for “internationalised domain names” in non-Latin characters.

Egypt and Saudi Arabia have announced their intentions to apply for the first Arabic domains.

Countries can also apply for domains in other scripts, such as Chinese. The first official international web addresses are expected in 2010.

“The IDN [International Domain Names] program will encompass close to one hundred thousand characters, opening up the internet to billions of potential users around the globe,” said Peter Dengate Thrush, chairman of the Internet Corporation for Assigned Names and Numbers (Icann).

The regulator has described the introduction of IDNs as the “biggest change” to the net “since it was invented 40 years ago”.

“Over half the internet users around the world don’t use a Latin-based script as their native language,” said Icann president Rod Beckstrom.

“IDNs are about making the internet more global and accessible for everyone.”

He said Icann had “already received six applications from around the world for three different scripts.

These included a bid from Egypt’s bid for .misr, meaning Egypt in Arabic, and one from Russia.

Plans for IDNs were approved at a meeting in June 2008. However, it has taken until now to make sure that the translation system needed to make it work functions correctly.

The changes have been made to the net’s Domain Name System, which acts like a phone book, translating easily understood domain names such as bbc.co.uk into strings of computer readable numbers known as IP addresses.

The tweaks will allow this system to recognise and translate the non-Latin characters.

Some countries, such as China and Thailand, have already introduced workarounds that allow computer users to enter web addresses in their own language.

However, these were not internationally approved and do not necessarily work on all computers.

Icann has now invited registrations from countries to apply for internationalised country codes, such as .uk or .us.

It said that people will be able to apply for an entire web address at a later date through the body which wins the right to control a nation’s internationalised country-code.

BBC News

Internet addresses set for change

The internet regulator has approved plans to allow non-Latin-script web addresses, in a move that is set to transform the online world.

The board of Icann voted at its annual meeting in Seoul to allow domain names in Arabic, Chinese and other scripts.

More than half of the 1.6 billion people who use the internet speak languages with non-Latin scripts.

It is being described as the biggest change to the way the internet works since it was created 40 years ago.

The first Internationalised Domain Names (IDNs) could be in use next year.

Plans for IDNs were first approved at a meeting in June 2008, but testing of the system has been going on for two years.

Technical upheaval

The move paves the way for the internet’s Domain Name System (DNS) to be changed so it can recognise and translate non-Latin characters.

The DNS acts like a phonebook, turning easily understood domain names into strings of computer-readable numbers, known as Internet Protocol (IP) addresses.

The Internet Corporation for Assigned Names and Numbers (Icann) said the “fantastically complicated technical feature” allowing IDNs would represent the “biggest change” to the coding that underlies the internet since it was invented four decades ago.

BBC technology correspondent Mark Gregory says in the early days of the internet, language posed no problem, as most web-surfers spoke English and those that did not usually wrote in languages based on the Latin alphabet.

But this is no longer true, adds our correspondent.

Icann said it would accept the first applications for IDNs by 16 November, with the first up and running by “mid-2010″.

It is likely the majority of early non-Latin net addresses to be approved will be in Chinese and Arabic script, followed by Russian.

Some countries, such as China and Thailand, have already introduced workarounds that allow computer users to enter web addresses in their own language.

However, these were not internationally approved and do not work on all computers.

Autonomy

Our correspondent says the point of the Icann vote was to create a universal internet address code that will work in any language and every place so all the world’s computers can connect with each other.

“Of the 1.6 billion internet users today worldwide, more than half use languages that have scripts that are not Latin-based,” said Icann president and CEO Rod Beckstrom earlier this week.

“So this change is very much necessary for not only half the world’s internet users today but more than half, probably, of the future users as the internet continues to spread.”

Icann, set up by the US government, was founded in 1998 to oversee the development of the net.

Last month, after years of criticism, the US government eased its control over the non-profit body.

It signed a new agreement that gave Icann autonomy for the first time. The agreement came into effect on 1 October and puts it under the scrutiny of the global “internet community”.

 BBC News

Tech giants rush Farsi versions

Search giant Google has stepped up work to release a tool that will translate Farsi into English and vice-versa.

The company told the BBC it was speeding up the project because of the huge interest in what was currently going on in Iran.

At the same time, the world’s biggest social networking site, Facebook, is launching a Persian version.

Both companies say they hope their efforts will improve access to information and communication.

“There is a huge amount of interest in events in Iran and people want to know what is going on inside and outside the country,” Google’s principal scientist, Franz Och, told the BBC.

“Providing access to information across language boundaries should be very helpful. It’s one more tool that Persian speakers can use to communicate directly to the world and vice-versa.”

Facebook announced that it had also accelerated work on its Farsi translation.

In a blog posting, the company noted that people around the world had been using Facebook to exchange information about the aftermath of the Iranian election, but that most of that had not been in English.

“Much of the content created and shared on Facebook related to these events has been in Persian – the native language of Iran – but the users have had to navigate the site in English or other languages,” it said.

As a result, Facebook has now made the entire site available in a test version of Persian, “so Persian speakers inside of Iran and around the world can begin using it in their native language”.

Challenges

Both companies stress privately that there is no political motivation in what they are doing, and that their main goal is to facilitate communication and the flow of information.

Google’s stated mission is to organise the world’s information and make it universally accessible and useful. To that end, it is already available in 41 other languages from Arabic to Spanish and from Danish to Vietnamese.

Message in Farsi saying access blocked, 25 May, 2009

Native speakers helped Facebook with its Farsi translation

Mr Och said he and his team had been working hard over the past few days to rush the Farsi service out. As a result, it was about 80% perfect.

“This is done using machine translation which, in general, is not as good as human translation and so, for some languages, the quality is mixed,” explained Mr Och.

“With Spanish to English we get a nice translation, but for others it’s much harder because we don’t have so much data from which we can build our systems.

“Farsi is one of those where the translation is not as good but we hope to make it better relatively soon.”

Likewise Facebook has said its translated site is not a polished product but “we felt it was important to help people communicate rather than wait”.

However the company said a lot of its success in being able to go live with what it has done is down to the more than 400 Persian speakers who submitted thousands of individual translations of the site.

Both companies are looking to perfect their work and said they welcomed help from Persian speakers.

Milestone

Much has been written about the role of the internet and social networking sites in spreading news, video, pictures and information about the post-election chaos.

YouTube

YouTube is showing video from the streets of Tehran

These latest moves to provide Farsi translations have been praised by the Personal Democracy Forum, which looks at how technological advances and global internet trends are reinventing politics, democracy, society and government.

“The ability to translate the information flow that is going on at the moment into Farsi will benefit everybody on both sides of this battle,” the Forum’s co-founder, Andrew Rasiej, told the BBC.

“What these efforts do is add to the bigger idea which is that as people are more and more connected, the basic human right of free speech spreads with it.

“This is another milestone in allowing human beings to communicate with each other and break down the barriers that might have prevented them from understanding each other due to language,” said Mr Rasiej.

“We hope we will have a positive impact,” said Google’s Mr Och.

BBC News

Found in translation: China’s volunteer online army

SHANGHAI, China – On Saturday at 10 a.m. it’s show time for Brenda Zhang and her subtitle team. They roll out of bed, meet each other online and chat, while their modems download the latest episode of “Prison Break,” which just aired half a world away on Friday night in America.

Chinese netizens have a growing appetite for online English content.

Once they have the show on their hard drives, the team spends the rest of the day creating subtitles for it in Chinese before putting it back online for other fans to watch.

Dozens of such groups exist in China. They are voluntary and are translating a mix of media, from books and magazines to games, TV shows and movies. The translated products are for an audience whose primary means of accessing foreign entertainment is the Internet.

The members of these online translations groups participate out of a desire to improve their English. For many there is also a passionate interest in overseas content and a desire to make it accessible to other Chinese people.

“This is a way to fulfill your life and do something you are interested in,” said Zhang, a 24-year-old who translates for a team that calls themselves “Showfa.”

“I think Chinese people need to know something different, to see how the foreigners think about life, think about love.”

Aside from the international content made available on the Web, largely by the translation groups, people in China have fewer opportunities to legally watch imported entertainment than in many developed countries.

While the viewing selections have been growing in recent years, Beijing still issues quotas for the number of international films allowed in cinemas each year, while the limited amount of content permitted to air on state-run TV is usually censored, poorly dubbed and not popular anymore.

Translation team members download TV shows through different methods: watching directly from a Web site or downloading from a translation group Web site. Clips can also be downloaded through BitTorrent — file-sharing sites such as Xunlei. Through this method, groups post their “seeds” on these sites, and people can download them.

The translation teams acknowledge that what they are doing is less than legal and say they do worry that someday they may be forced to quit (so far there are no reports of a group being shut down).

“We are living in this grey zone,” said Deping Wang, a former member of a team called 1000fr. “It is not legal, but at the same time, nobody can live without it. This is the dilemma.”

Many international media companies are concerned the groups are eroding their potential profit margins in China by illegally making the content available for free. Some, however, see the translation groups’ efforts as a conduit for cultivating a market that will be willing to buy content if it is allowed to be distributed through official avenues in the future.

“While there is still an issue on the translation right, we also take a broader view to look at its impacts on other parties such as it’s providing a platform for a Chinese audience who would otherwise not be able to understand or access The Economist content,” Henry Luk, Asia-Pacific regional director for The Economist Group, told CNN via e-mail.

Since 2006, a group calling itself the Eco Team has translated more than 8,000 articles from British newsmagazine The Economist. Members collaborate with the translations, posting articles on their message board to be collectively copy-edited. The final versions are published on Eco Weekly a biweekly publication that can be downloaded and printed at no cost. The Eco Team Web site has more than 60,000 registered users, according to the group.

“We want this magazine to be published in Chinese,” said translator Jerry Bai at a recent party held by some of the Eco Team members in Shanghai. “This is our only one wish.”

However, beyond the copyright issues that come up around the subtitle squads, Bingchun Meng, a media lecturer at the London School of Economics who is studying the teams, sees them as an opportunity to understand the social dynamics of virtual groups.

“These are fascinating communities,” Meng said. “You think that everything online is more egalitarian and anti-establishment, but what we see is the emergence of another kind of operation of power, but maybe the power doesn’t come from the traditional source but from different venues.”

Virtual communities

The translation teams consist of loosely structured yet highly coordinated, even hierarchical, virtual communities often consisting of hundreds of participants around the world. While some members occasionally meet each other offline, they mostly communicate through email, online forums or via instant messaging services, like QQ and MSN Messenger.

Within each team, there are leaders responsible for organizing the translation efforts of various sub-groups assigned to different TV shows and movies in English and also other languages such as German and Japanese. Leaders select the shows to subtitle by taking votes from Chinese “netizens” about the programs they want to see.

More experienced translators are given so-called “zero day” shows — highly popular programs, like “Prison Break,” that are recorded and uploaded by someone living abroad to a special FTP site immediately after the program. Meanwhile, back in China, a team is on standby, ready to download and subtitle the show as fast as they can for Chinese fans to watch online — and to see if they can beat rival translation gangs.

Competition can be fierce. To get ahead, some teams route content through servers in Thailand and Hong Kong to circumvent slow speeds that can stymie downloads between the United States and China.

“It can buy 10 minutes, and, you know, 10 minutes means a lot of things,” said Wang, the former member of 1000fr, one of the first translation teams in China.

There are around a half-dozen other subtitle groups that are well known, including YDY, or Yidianyuan (Garden of Eden), 1000fr’s main rival.

“These two groups fight together to see who is the fastest to translate into Chinese,” Wang said. “Two hours was one of our record translation times.”

Before someone can be a translator, he or she must first go through an application process and take a language test. Wang said she had to translate 300 words in 15 minutes to become part of 1000fr.

Teams also often have entire human resources departments responsible for recruiting new members, and there is never a shortage of applicants.

“A lot of people apply. A lot,” said Zhang, who is responsible for bringing in new recruits for Showfa’s Japanese subgroup. “There are a lot of TV series, and we need a lot of human resources.”

But just as quickly as new members come, they also go. Many of the translators are college students with plenty of time on their hands, often quitting once they graduate and have to get a job.

Zhang finished school several years ago but still manages to find time to spend with her subtitle group who she says are also her best friends. They also meet online every Tuesday morning to translate another popular American show, “24″. Yet now on Saturdays, Zhang and the translators will have to come up with a different routine. This season of Prison Break was its last.

CNN

Microsoft on Bing in China

To the Editor:

A June 8 online letter about Microsoft’s new search engine, Bing, suggests a negative connotation with the Bing brand in China. It is because we were aware of this one possible interpretation that we chose to select a Chinese character to pair up with Bing to form the brand.

This approach enables us to ensure appropriate pronunciation for Bing in China. Our new brand in China is pronounced “bee-ying,” its meaning derived from the last two characters of a Chinese proverb meaning “ask and you shall find.” The direct translation for “Bi-ying” is “certain to respond or ready to answer.”

Developing a global brand is no small feat. Bing is available in 53 countries and 34 languages. We underwent a comprehensive process to arrive at a Chinese brand for Bing, including customer research and consultation with Chinese-language specialists. We think we arrived at a good result.

Yusuf Mehdi
Senior Vice President
Microsoft
Redmond, Wash., June 8, 2009

NY Times

May 2012
M T W T F S S
« Apr    
 123456
78910111213
14151617181920
21222324252627
28293031