Wikipedia and the Internet language chasm

When talking about language diversity across the Internet, I like to include a visual that illustrates the language leaders of the Internet:

Language leaders of the Internet 2014

This chart is based on data from the 2014 Web Globalization Report Card. English (US) is not counted.

In it, you have Wikipedia at the top, supporting more than 280 languages.Wikipedia represents (for now) the high-water mark for linguistic diversity on a website. It’s a fascinating benchmark because people are not paid to create content; what you see reflects user initiative (as well as factors such as Internet and computer penetration).

I was interested to see this quote in Motherboard:

There are 533 proposals for Wikipedia languages in incubator stage, more than twice the number of actual Wikipedias, but Kornai estimates no more than a third of them will ever get the required minimum of at least five active users and get enough pages to make it onto Wikipedia proper.

So it’s feasible we could the see the number of languages on Wikipedia double in the years ahead — though the article stresses that languages are in fact dying as a result of the Internet (a topic for a future blog post).

To the left of Wikipedia we have Google Search with support for more than 140 languages. However, this number reflects only the Google Search interface; most Google services (such as YouTube and Gmail) support fewer than 60 languages.

Next you have global companies such as Toyota and DHL and Panasonic, which support roughly 41-42 languages on their websites.

For most companies, 40 languages is a goal they cannot even imagine reaching. The average number of languages supported by the websites in the Report Card is 28 — which reflects only the leading global companies and brands.

Average number of languages supported by leading global websites

Most companies are happy if they support five or more languages on their websites.

So what does this data mean? To me, it means that there is a profound gap between possible number of languages a website can support (Wikipedia) and the practical number of languages that most websites currently support. By practical, I’m referring to the limited budgets that companies commit to professional translation.

Now, to the far right of the chart is Google Translate — with support for roughly 80 languages. Now here is where things get interesting, because machine translation (warts and all) supports a vastly greater number of languages than the Fortune 500 (or 50 for that matter)

google_translate_2014

That’s not to say that companies shouldn’t continue to invest in professional translation — indeed they should.

But machine translation has a  disruptive role to play in helping to overcome the language chasm. 

iYouth: The battle over the next generation of tech consumers

I love these new Apple advertisements:

And…

But one thing I couldn’t help but wonder while watching them: Where are the older folks?

That is, where are the people of my demographic? Or my parents?

If you look closely you’ll see one or two people who may be 40 or 50 years or older, but overwhelmingly these are ads starring young people.

Apple does not want to be known as that company that is only popular with your parents.

A point that Samsung is happy to underscore with this ad:

Of course, Apple is hugely popular with your parents. And grandparents. And even great-grandparents.

Which is probably why Apple is so focused on the next generation of tech consumers.

And not just American tech consumers, but global tech consumers.