Anthropic explains how its Constitutional AI girds Claude against adversarial inputs | Engadget

📆 09/05/2023 19:01:00
📰 engadget

⏱ Reading Time:
49 sec. here
2 min. at publisher
📊 Quality Score:
News: 23%
Publisher: 63%

Ai Ai Headlines News

Ai Ai Latest News,Ai Ai Headlines

Anthropic explains how its Constitutional AI girds Claude against adversarial inputs

in the AI’s subsequent performance compared to one trained only on human feedback. Essentially, the human in the loop has been replaced by an AI and now everything is reportedly better than ever. “In our tests, our CAI-model responded more appropriately to adversarial inputs while still producing helpful answers and not being evasive,” Anthropic wrote. “The model received no human data on harmlessness, meaning all results on harmlessness came purely from AI supervision.

The company revealed on Tuesday that its previously undisclosed principles are synthesized from “a range of sources including the UN Declaration of Human Rights, trust and safety best practices, principles proposed by other AI research labs, an effort to capture non-western perspectives, and principles that we discovered work well via our research.”

The company, pointedly getting ahead of the invariable conservative backlash, has emphasized that “our current constitution is neither finalized nor is it likely the best it can be.” “There have been critiques from many people that AI models are being trained to reflect a specific viewpoint or political ideology, usually one the critic disagrees with,” the team wrote. “From our perspective, our long-term goal isn’t trying to get our systems to represent aAll products recommended by Engadget are selected by our editorial team, independent of our parent company. Some of our stories include affiliate links.

Write Comment

We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

Ai Ai Latest News, Ai Ai Headlines