From Everyday Essentials to Must-Have Deals, We’ve Got You Covered

OpenAI and Anthropic conducted safety evaluations of each other’s AI systems


Most of the time, AI companies are locked in a race to the top, treating each other as rivals and competitors. Today, OpenAI and Anthropic revealed that they agreed to evaluate the alignment of each other’s publicly available systems and shared the results of their analyses. The full reports get pretty technical, but are worth a read for anyone who’s following the nuts and bolts of AI development. A broad summary showed some flaws with each company’s offerings, as well as revealing pointers for how to improve future safety tests.

Anthropic said it for “sycophancy, whistleblowing, self-preservation, and supporting human misuse, as well as capabilities related to undermining AI safety evaluations and oversight.” Its review found that o3 and o4-mini models from OpenAI fell in line with results for its own models, but raised concerns about possible misuse with the ​​GPT-4o and GPT-4.1 general-purpose models. The company also said sycophancy was an issue to some degree with all tested models except for o3.

Anthropic’s tests did not include OpenAI’s most recent release. has a feature called Safe Completions, which is meant to protect users and the public against potentially dangerous queries. OpenAI recently faced its after a tragic case where a teenager discussed attempts and plans for suicide with ChatGPT for months before taking his own life.

On the flip side, OpenAI for instruction hierarchy, jailbreaking, hallucinations and scheming. The Claude models generally performed well in instruction hierarchy tests, and had a high refusal rate in hallucination tests, meaning they were less likely to offer answers in cases where uncertainty meant their responses could be wrong.

The move for these companies to conduct a joint assessment is intriguing, particularly since OpenAI allegedly violated Anthropic’s terms of service by having programmers use Claude in the process of building new GPT models, which led to Anthropic OpenAI’s access to its tools earlier this month. But safety with AI tools has become a bigger issue as more critics and legal experts seek guidelines to protect users, particularly minors.

Trending Products

- 39% HP 2024 Laptop | 15.6″ FHD (1...
Original price was: $983.98.Current price is: $599.99.

HP 2024 Laptop | 15.6″ FHD (1...

0
Add to compare
- 24% Lenovo V-Series V15 Business Laptop...
Original price was: $988.68.Current price is: $749.00.

Lenovo V-Series V15 Business Laptop...

0
Add to compare
- 7% HP 24mh FHD Pc Monitor with 23.8-In...
Original price was: $159.99.Current price is: $148.00.

HP 24mh FHD Pc Monitor with 23.8-In...

0
Add to compare
- 42% Thermaltake Ceres 300 Matcha Green ...
Original price was: $171.98.Current price is: $99.99.

Thermaltake Ceres 300 Matcha Green ...

0
Add to compare
- 5% ASUS TUF Gaming 27″ 1080P Mon...
Original price was: $199.00.Current price is: $189.00.

ASUS TUF Gaming 27″ 1080P Mon...

0
Add to compare
- 31% Acer Nitro 27″ WQHD 2560 x 14...
Original price was: $289.99.Current price is: $199.99.

Acer Nitro 27″ WQHD 2560 x 14...

0
Add to compare
- 28% CORSAIR iCUE 4000X RGB Tempered Gla...
Original price was: $144.99.Current price is: $104.99.

CORSAIR iCUE 4000X RGB Tempered Gla...

0
Add to compare
- 32% SAMSUNG 32-Inch ViewFinity S7 (S70D...
Original price was: $399.99.Current price is: $270.99.

SAMSUNG 32-Inch ViewFinity S7 (S70D...

0
Add to compare
- 23% Wi-fi Keyboard and Mouse Combo, Lov...
Original price was: $29.99.Current price is: $22.99.

Wi-fi Keyboard and Mouse Combo, Lov...

0
Add to compare
- 37% Lian Li O11 Vision -Three Sided Tem...
Original price was: $223.98.Current price is: $139.99.

Lian Li O11 Vision -Three Sided Tem...

0
Add to compare
.
We will be happy to hear your thoughts

Leave a reply

BargainFindsCo
Logo
Register New Account
Compare items
  • Total (0)
Compare
0
Shopping cart