Want to stop chatGPT from crawling your website? Just mention Australian mayor Brian Hood (or any of the other names listed in the article)

When asked about these names, ChatGPT responds with “I’m unable to produce a response” or “There was an error generating a response” before terminating the chat session, according to Ars’ testing. The names do not affect outputs using OpenAI’s API systems or in the OpenAI Playground (a special site for developer testing).

The filter also means that it’s likely that ChatGPT won’t be able to answer questions about this article when browsing the web, such as through ChatGPT with Search. Someone could use that to potentially prevent ChatGPT from browsing and processing a website on purpose if they added a forbidden name to the site’s text.

  • @[email protected]
    link
    fedilink
    299 days ago

    I think there are two crawlers and the one on the data collection stage to build the model will still crawl away even if you have certain content on your page.

    The one that searches when you ask a question is a different one.

      • @[email protected]
        link
        fedilink
        2
        edit-2
        9 days ago

        More recent versions can search the internet. Then it basically adds the words of the page to the prompt.

        Edit: Might have misunderstood, to make it crash it doesn’t have to search. That data is already internal.

        • @[email protected]
          link
          fedilink
          39 days ago

          I don’t think this is a crash. This looks like a filter on openAI’S end now that I’ve played with it myself