Large Language Models’ Emergent Abilities Are a Mirage

📆 24/03/2024 12:19:00
📰 WIRED

⏱ Reading Time:
23 sec. here
2 min. at publisher
📊 Quality Score:
News: 12%
Publisher: 51%

Ai Ai Headlines News

Ai Ai Latest News,Ai Ai Headlines

A new study suggests that sudden jumps in LLMs’ abilities are neither surprising nor unpredictable, but are actually the consequence of how we measure ability in AI.

Two years ago, in a project called the Beyond the Imitation Game benchmark, or BIG-bench, 450 researchers compiled a list of 204 tasks designed to test the capabilities of large language models, which power chatbots like ChatGPT. On most tasks, performance improved predictably and smoothly as the models scaled up—the larger the model, the better it got. But with other tasks, the jump in ability wasn’t smooth. The performance remained near zero for a while, then performance jumped.

But the Stanford researchers point out that the LLMs were judged only on accuracy: Either they could do it perfectly, or they couldn’t. So even if an LLM predicted most of the digits correctly, it failed. That didn’t seem right. If you’re calculating 100 plus 278, then 376 seems like a much more accurate answer than, say, −9.34. So instead, Koyejo and his collaborators tested the same task using a metric that awards partial credit.

Write Comment

We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

Ai Ai Latest News, Ai Ai Headlines

Similar News:You can also read news stories similar to this one that we have collected from other news sources.

Why large language models aren’t headed toward humanlike understandingUnlike people, today's generative AI isn’t good at learning concepts that it can apply to new situations.
Source: ScienceNews - 🏆 286. / 63 Read more »

Revolutionizing Software Development With Large Language ModelsSon Nguyen is the co-founder & CEO of Neurond AI, a company providing world-class Artificial Intelligence and Data Science services. Read Son Nguyen's full executive profile here.
Source: ForbesTech - 🏆 318. / 59 Read more »

This ‘weapon’ can wipe the AI slate cleanAI booming at current levels raises concerns about large language models (LLMs) being used for harmful purposes like developing weapons.
Source: IntEngineering - 🏆 287. / 63 Read more »