Claude was being judgy, so I called it out. It immediately caved. Is verbal abuse a valid method of circumventing LLM censorship??

  • @Pieisawesome
    link
    English
    12 days ago

    Yes. Abuse towards LLMs works.

    My team has shared prompts and about 50% of them threaten some sort of harm

    • @lunar17OP
      link
      English
      71 day ago

      Yikes. I knew this tech would introduce new societal issues, but I can’t say this is one I foresaw.