Baidu, China's closest equivalent to Google, has achieved the highest score at the General Language Understanding Evaluation (GLUE) AI competition. What's notable about Baidu's achievement is that it illustrates how AI research benefits from a diversity of contributors. MIT Technology Review explains: GLUE is a widely accepted benchmark for how well an AI system understands human language. It consists of nine different tests for things like picking out the names of people and organizations in a sentence and figuring out what a pronoun like "it" refers to when there are multiple potential antecedents. A language model that scores highly on GLUE, therefore, can handle diverse reading comprehension tasks. Out of a full score of 100, the average person scores around 87 points. Baidu is now the first team to surpass 90 with its model, ERNIE. Baidu's researchers had to develop a technique specifically for the Chinese language to build ERNIE (which stands for "Enhanced Representation through kNowledge IntEgration"). It just so happens, however, that the same technique makes it better at understanding English as well. [...] [T]he researchers trained ERNIE on a new version of masking that hides strings of characters rather than single ones. They also trained it to distinguish between meaningful and random strings so it could mask the right character combinations accordingly. As a result, ERNIE has a greater grasp of how words encode information in Chinese and is much more accurate at predicting the missing pieces. This proves useful for applications like translation and information retrieval from a text document. The researchers very quickly discovered that this approach actually works better for English, too. Though not as often as Chinese, English similarly has strings of words that express a meaning different from the sum of their parts. Proper nouns like "Harry Potter" and expressions like "chip off the old block" cannot be meaningfully parsed by separating them into individual words. The latest version of ERNIE uses several other training techniques as well. It considers the ordering of sentences and the distances between them, for example, to understand the logical progression of a paragraph. Most important, however, it uses a method called continuous training that allows it to train on new data and new tasks without it forgetting those it learned before. This allows it to get better and better at performing a broad range of tasks over time with minimal human interference. Baidu actively uses ERNIE to give users more applicable search results, remove duplicate stories in its news feed, and improve its AI assistant Xiao Du's ability to accurately respond to requests. The researchers have described ERNIE's latest architecture in a paper that will be presented at the Association for the Advancement of Artificial Intelligence conference next year.
Read more of this story at Slashdot.
from RSSMix.com Mix ID 8859861 https://ift.tt/2Zq6jBg
Post a Comment