Claude was being judgy, so I called it out. It immediately caved. Is verbal abuse a valid method of circumventing LLM censorship??

  • @Pieisawesome
    link
    English
    11 month ago

    Yes. Abuse towards LLMs works.

    My team has shared prompts and about 50% of them threaten some sort of harm

    • @lunar17OP
      link
      English
      81 month ago

      Yikes. I knew this tech would introduce new societal issues, but I can’t say this is one I foresaw.