The research from Purdue University, first spotted by news outlet Futurism, was presented earlier this month at the Computer-Human Interaction Conference in Hawaii and looked at 517 programming questions on Stack Overflow that were then fed to ChatGPT.

“Our analysis shows that 52% of ChatGPT answers contain incorrect information and 77% are verbose,” the new study explained. “Nonetheless, our user study participants still preferred ChatGPT answers 35% of the time due to their comprehensiveness and well-articulated language style.”

Disturbingly, programmers in the study didn’t always catch the mistakes being produced by the AI chatbot.

“However, they also overlooked the misinformation in the ChatGPT answers 39% of the time,” according to the study. “This implies the need to counter misinformation in ChatGPT answers to programming questions and raise awareness of the risks associated with seemingly correct answers.”

  • @Eheran
    link
    English
    287 months ago

    The study is using 3.5, not version 4.

    • @phoneymouse
      link
      English
      17 months ago

      4 produces inaccurate programming answers too

      • @Eheran
        link
        English
        67 months ago

        Obviously. But it is FAR better yet again.

        • @phoneymouse
          link
          English
          17 months ago

          Not really. I ask it questions all the time and it makes shit up.

          • @Eheran
            link
            English
            27 months ago

            Yes. But it is better than 3.5 without any doubt.