One of the key ingredients that made ChatGPT a ripsnorting success was an army of human trainers who gave the artificial intelligence model behind the bot guidance on what constitutes good and bad outputs. OpenAI now says that adding even more AI into the mix—to help assist human trainers—could help make AI helpers smarter and more reliable. In developing ChatGPT, OpenAI pioneered the use of reinforcement learning with human feedback, or RLHF.
“And as models continue to get better and better, we suspect that people will need more help,” McAleese says. The new technique is one of many now being developed to improve large language models and squeeze more abilities out of them. It is also part of an effort to ensure that AI behaves in acceptable ways even as it becomes more capable.
Ai Ai Latest News, Ai Ai Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
Source: hackernoon - 🏆 532. / 51 Read more »
Source: axios - 🏆 302. / 63 Read more »
Source: FoxBusiness - 🏆 458. / 53 Read more »