Thursday, April 11, 2013
Google Translate - Micro-languages and Faroese
The auto-detect feature of Google Translate identifies Faroese as Icelandic. That's understandable, as Faroese and Icelandic share unique characters and are similar in syntax and vocabulary. The Icelandic translator is nonetheless ill-equipped for Faroese. It gets some words right, ignores others, and wildly misconstrues meanings of sentences and phrases. Oddly, every time I put a text in now it invariably spits out a variant of "one can achieve perfection" (as evidenced on my earlier blog).
Google Translate covers 65 languages with several betas in progress. In descending order, here are the bottom five languages by smallest number of speakers - Welsh (700,000), Maltese (400,000), Icelandic (320,000), Irish (130,000), and Esperanto (10,000). The last is an artificial language. I'll blog about that later, but I am of the opinion that an artificial language is no language at all as it lacks the depth and idiom of natural language. So I'm not counting it. The pattern of the above: European languages spoken in interesting places, all of them islands. Faroese (60,000) deserves a place on the list.