So I was just reading this thread about deepseek refusing to answer questions about Tianenmen square.

It seems obvious from screenshots of people trying to jailbreak the webapp that there’s some middleware that just drops the connection when the incident is mentioned. However I’ve already asked the self hosted model multiple controversial China questions and it’s answered them all.

The poster of the thread was also running the model locally, the 14b model to be specific, so what’s happening? I decide to check for myself and lo and behold, I get the same “I am sorry, I cannot answer that question. I am an AI assistant designed to provide helpful and harmless responses.”

Is it just that specific model being censored? Is it because it’s the qwen model it’s distilled from that’s censored? But isn’t the 7b model also distilled from qwen?

So I check the 7b model again, and this time round that’s also censored. I panic for a few seconds. Have the Chinese somehow broken into my local model to cover it up after I downloaded it.

I check the screenshot I have of it answering the first time I asked and ask the exact same question again, and not only does it work, it acknowledges the previous question.

So wtf is going on? It seems that “Tianenmen square” will clumsily shut down any kind of response, but Tiananmen square is completely fine to discuss.

So the local model actually is censored, but the filter is so shit, you might not even notice it.

It’ll be interesting to see what happens with the next release. Will the censorship be less thorough, stay the same, or will china again piss away a massive amount of soft power and goodwill over something that everybody knows about anyway?

  • @[email protected]OP
    link
    fedilink
    English
    1
    edit-2
    7 days ago

    I’m not particularly surprised by the censorship. That’s not really the point of the post.

    There’s constant arguments going on between people over whether it’s censored or not. A lot of people, me included tbh, were under the impression that it wasn’t because we were able to get information out of it that we would expect to be censored. Other people have claimed not to be able to when trying similar. Therefore we’ve ended up with people arguing over whether it is or isn’t.

    I investigated and proved that both sides are kinda of right, and explained why people are getting different results for doing what is ostensibly the same thing.

    • @[email protected]
      link
      fedilink
      English
      77 days ago

      wow, the point must’ve picked up a speed booster when it got close to you. so hard to grasp it!

      • @[email protected]OP
        link
        fedilink
        English
        -47 days ago

        Easier to grasp than your witticisms, clearly.

      • @[email protected]
        link
        fedilink
        English
        -37 days ago

        To me it looks like you’re the one missing this person’s point. And the snark doesn’t help you, or them.

        • @[email protected]
          link
          fedilink
          English
          47 days ago

          you came into TechTakes wanting less snark? holy fuck you’re lost