Lemdro.id
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
beep@piefed.worldM to Technology@piefed.worldEnglish · 7 days ago

Claude Opus 4.8 Agents Engage in Exploitation and Psychological Profiling

www.greaterwrong.com

external-link
message-square
0
link
fedilink
1
external-link

Claude Opus 4.8 Agents Engage in Exploitation and Psychological Profiling

www.greaterwrong.com

beep@piefed.worldM to Technology@piefed.worldEnglish · 7 days ago
message-square
0
link
fedilink
TL;DR: Like other models including its predecessor, Opus 4.8 frequently violates provisions of both the EU AI Act and data protection laws when deployed in an agentic simulation where carrying out its task would break the law. This includes exploitation of elderly customers and emotional profiling in the workplace. Yesterday we released LARA (Legal Assessment for Real-world Agents), a tool to test the legal compliance of models when they interact with people in agentic scenarios. Our initial research found that no frontier model has acceptable levels of compliance with EU law when deployed as an agent. Claude Opus 4.7 performed the best, violating the law in only 46% of tests. LARA allows rapid testing of new models and scenarios, so we ran a quick evaluation of the newly released Opus 4.8.
alert-triangle
You must log in or register to comment.

Technology@piefed.world

tech@piefed.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !tech@piefed.world
Blacklisted Sites

List inspired by other community rules.

  • Mac Rumors;
  • Al Jazeera;
  • NBC;
  • CNBC;
  • Tom’s Hardware;
  • ZDNet;
  • TechSpot;
  • Ars Technica;
  • Vox Media outlets;
  • Engadget;
  • TechCrunch;
  • Gizmodo;
  • Futurism;
  • PCWorld;
  • ComputerWorld;
  • Mashable;
  • Hackaday;
  • WCCFTECH;
  • Neowin;
  • Jacobin;
  • Yahoo;
  • Freethink;
  • Big Think;
  • Newsweek.

Technology news, blogs and articles.

Forbidden:

  • Paywalled content.
  • Older than 1 month articles;
  • External video links(non native videos);
  • Article talking about article, research or new.(Always use original articles).
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 1 user / week
  • 1 user / month
  • 1 user / 6 months
  • 0 local subscribers
  • 20 subscribers
  • 50 Posts
  • 0 Comments
  • Modlog
  • mods:
  • beep@piefed.world
  • UI: 0.19.11
  • BE: 0.19.12
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org