Image by Olaf Tausch via WikiÂmeÂdia ComÂmons
We may not see warp driÂves any time soon, but anothÂer piece of Star Trek tech, the uniÂverÂsal transÂlaÂtor, may become a realÂiÂty in our lifeÂtime, if it hasn’t already. Machine learnÂing “has proven to be very comÂpeÂtent” when it comes to transÂlaÂtion, “so much so that the CEO of one of the world’s largest employÂers of human transÂlaÂtors has warned that many of them should be facÂing up the stark realÂiÂty of losÂing their job to a machine,” writes Bernard Marr at Forbes.
But the fact that AI can do things humans can doesÂn’t mean that it does those things well. One Google researcher put the case plainÂly in an interÂview with Wired: “PeoÂple naiveÂly believe that if you take deep learnÂing and… 1,000 times more data, a neurÂal net will be able to do anyÂthing a human being can do, but that’s just not true.” AI transÂlaÂtors have advanced sigÂnifÂiÂcantÂly in the past few years, with Google’s TransÂlaÂtotron proÂtoÂtype (yes, that’s its real name), promisÂing to interÂpret “tone and cadence.” Still, AI transÂlaÂtions are often stiltÂed, awkÂward, and occaÂsionÂalÂly incomÂpreÂhenÂsiÂble approxÂiÂmaÂtions that no human would come up with.
Does AI’s limÂiÂtaÂtions with livÂing lanÂguage hinÂder its abilÂiÂty to deciÂpher very long dead ones, whose orthogÂraÂphy, gramÂmar, and synÂtax have been comÂpleteÂly lost? Yuan Cao from Google’s AI lab and Jiaming Luo and RegiÂna BarziÂlay from MIT put machine learnÂing to the test when they develÂoped a “sysÂtem capaÂble of deciÂpherÂing lost lanÂguages.” They took a very difÂferÂent approach “from the stanÂdard machine transÂlaÂtion techÂniques,” reports the MIT TechÂnolÂoÂgy Review, using less data instead of more, a techÂnique they call “minÂiÂmum-cost flow.”
The researchers testÂed their transÂlaÂtion machine on both the 3500-year-old LinÂear B and UgaritÂic, an ancient form of Hebrew, both of which have already been deciÂphered by peoÂple. Still, the AI was “able to transÂlate both lanÂguages with remarkÂable accuÂraÂcy,” with a rate of 67.3% in the transÂlaÂtion of cogÂnates in LinÂear B. The far oldÂer Bronze Age Minoan script LinÂear A, howÂevÂer (see it at the top), “one of the earÂliÂest forms of writÂing ever disÂcovÂered… is conÂspicÂuÂous for its absence.” No human has yet been able to deciÂpher it.
A lost lanÂguage transÂlaÂtor machine that only works on lanÂguages that have already been transÂlatÂed (it needs preÂexÂistÂing data on the progÂenÂiÂtor lanÂguage to funcÂtion) may not seem parÂticÂuÂlarÂly useÂful. Then again, it could be one step in the direcÂtion of what the authors call the “autoÂmatÂic deciÂpherÂment of lost lanÂguages,” those that humans can’t already work out on their own. Read the paper “NeurÂal DeciÂpherÂment via MinÂiÂmum-Cost Flow: From UgaritÂic to LinÂear B” at arXÂiv.
via MIT TechÂnolÂoÂgy Review
RelatÂed ConÂtent:
Josh Jones is a writer and musiÂcian based in Durham, NC. FolÂlow him at @jdmagness