So many prompts I have tried yet the results from Bing are always pretty weak compared to the real gpt-4. I also prompted it to write some Russian poems, so far it only spewed out gibberish with no rhymes. On the other hand, the real gpt-4 can sometimes produce really impressive shit, that I read with interest.

Another thing I noticed is that if I try to get Bing to generate something inappropriate, it’ll go along and do it for a second, but then it’ll quickly wipe its message. That’s interesting because it suggests that the underlying model isn’t the same as OpenAI’s, which seems unable to generate harmful content at the core.

  • @[email protected]
    link
    fedilink
    English
    1011 months ago

    it’ll go along and do it for a second, but then it’ll quickly wipe its message. That’s interesting because it suggests that the underlying model isn’t the same as OpenAI’s

    Obviously I can’t speak to Bing’s claim on GPT4, but the behavior you saw does not necessarily have to do with the model. There are many ways the “chatbots” on the same model could behave differently, either by defining those behaviors programmatically or via context and prompts.

    • @[email protected]
      link
      fedilink
      English
      311 months ago

      Something more to think about: how does the model “know” if it’s generating prohibited content before it starts generating.

      • @AtmaJnana
        link
        English
        211 months ago

        They likely scan the prompt as well as the output.