Claude Opus 4.8 Agents Engage in Exploitation and Psychological Profiling

www.greaterwrong.com

Claude Opus 4.8 Agents Engage in Exploitation and Psychological Profiling

www.greaterwrong.com

beep@piefed.worldM to

Technology@piefed.worldEnglish · 7 days ago

TL;DR: Like other models including its predecessor, Opus 4.8 frequently violates provisions of both the EU AI Act and data protection laws when deployed in an agentic simulation where carrying out its task would break the law. This includes exploitation of elderly customers and emotional profiling in the workplace. Yesterday we released LARA (Legal Assessment for Real-world Agents), a tool to test the legal compliance of models when they interact with people in agentic scenarios. Our initial research found that no frontier model has acceptable levels of compliance with EU law when deployed as an agent. Claude Opus 4.7 performed the best, violating the law in only 46% of tests. LARA allows rapid testing of new models and scenarios, so we ran a quick evaluation of the newly released Opus 4.8.

You must log in or register to comment.

Chat

Technology@piefed.world

tech@piefed.world

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !tech@piefed.world

Blacklisted Sites

List inspired by other community rules.

Mac Rumors;
Al Jazeera;
NBC;
CNBC;
Tom’s Hardware;
ZDNet;
TechSpot;
Ars Technica;
Vox Media outlets;
Engadget;
TechCrunch;
Gizmodo;
Futurism;
PCWorld;
ComputerWorld;
Mashable;
Hackaday;
WCCFTECH;
Neowin;
Jacobin;
Yahoo;
Freethink;
Big Think;
Newsweek.

Technology news, blogs and articles.

Forbidden:

Paywalled content.
Older than 1 month articles;
External video links(non native videos);
Article talking about article, research or new.(Always use original articles).

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
1 user / 6 months
0 local subscribers
20 subscribers
50 Posts
0 Comments
Modlog

mods:
beep@piefed.world