Major Breakthrough: Google Cracked the Code for Building AI in 400+ Languages
Ever wonder why ChatGPT speaks English better than, say, Swahili or Arabic?
It’s not an accident, or some special favorability of English in the training data; it’s math. AI companies have been flying blind when building models for non-English languages, guessing at how much data to use and which languages to train together.
Google’s research team just published ATLAS (paper), the largest public study on multilingual AI training. They ran 774 experiments across 400+ languages...