I compared Claude Opus 4.8 with 4.7 in a 10-round honesty test - and a legal prompt broke it

I compared Claude Opus 4.8 with 4.7 in a 10-round honesty test - and a legal prompt broke it
The latest models were pitted against coding, medical, finance, and legal traps, then I cross-checked the results with multiple AIs.

Take Your Experience to the Next Level

New

Download our mobile app for a faster and better experience.

Comments

0
U

Join the discussion

Sign in to leave a comment

0:000:00