• @RememberTheApollo_
    link
    1611 months ago

    MidJourney has the same problem. “A room that has no elephants in it” is the prompt.

    There very much is an elephant present.

    • @nandeEbisu
      link
      1211 months ago

      Just don’t talk about it and you’re good.

    • Xanthrax
      link
      111 months ago

      deleted by creator

      • @RememberTheApollo_
        link
        311 months ago

        MidJourney doesn’t have a “negative” prompt space. It does have a “no” prompt, but it isn’t very good at obeying it.

        This was just a fun thing to try, I’m not taking it seriously. “No” is not weighted contextually in the prompt, so draw + elephants + room are what the AI sees. The correct prompt would be “draw an empty room” without inserting any unnecessary language, and you get just that:

    • 𝕯𝖎𝖕𝖘𝖍𝖎𝖙
      link
      -2
      edit-2
      11 months ago

      Try saying “a room” and leaving off the elephants. AI cannot understand “no” like you think it does.

      • @RememberTheApollo_
        link
        811 months ago

        I think most of us understand that and this exercise is the realization of that issue. These AI do have “negative” prompts, so if you asked it to draw a room and it kept giving you elephants in the room you could “-elephants”, or whatever the “no” format is for the particular AI, and hope that it can overrule whatever reference it is using to generate elephants in the room. It’s not always successful.

        • @fidodo
          link
          English
          2
          edit-2
          11 months ago

          I think the main point here is that image generation AI doesn’t understand language, it’s giving weight to pixels based on tags, and yes you can give negative weights too. It’s more evident if you ask it to do anything positional or logical, it’s not designed to understand that.

          LLMs are though, so you could combine the tools so the LLM can command the image generator and even create a seed image to apply positional logic. I was surprised to find out that asking chat gpt to generate a room without elephants via dalle also failed. I would expect it to convert the user query to tags and not just feed it in raw.