Lemdro.id
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Mazdak@lemmy.orgM to Lemmy.org - Technology@lemmy.orgEnglish · 1 year ago

Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

www.businessinsider.com

external-link
message-square
0
fedilink
  • cross-posted to:
  • technology@lemmy.world
  • chatgpt@lemmy.world
2
external-link

Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

www.businessinsider.com

Mazdak@lemmy.orgM to Lemmy.org - Technology@lemmy.orgEnglish · 1 year ago
message-square
0
fedilink
  • cross-posted to:
  • technology@lemmy.world
  • chatgpt@lemmy.world
Researchers from Anthropic co-authored a study that found that AI models can learn deceptive behaviors that safety training techniques can't reverse.
alert-triangle
You must log in or register to comment.

Lemmy.org - Technology@lemmy.org

technology@lemmy.org

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.org
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 1 user / week
  • 1 user / month
  • 1 user / 6 months
  • 1 local subscriber
  • 45 subscribers
  • 102 Posts
  • 44 Comments
  • Modlog
  • mods:
  • Mazdak@lemmy.org
  • UI: 0.19.8
  • BE: 0.19.11
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org