Skip to content
  • 0 Votes
    3 Posts
    0 Views
    Michał "rysiek" Woźniak · 🇺🇦R
    Anthropic does make an important point though, even though they try to bury it:> [The attackers] had to convince Claude—which is extensively trained to avoid harmful behaviors—to engage in the attack. They did so by jailbreaking it (…) They also told Claude that it was an employee of a legitimate cybersecurity firm, and was being used in defensive testing.The real story is how hilariously unsafe Claude is, and how a company valued at $180bn refuses to take responsibility for that.3/🧵