Zeyi Yang / MIT Technology Review: Some researchers say GPT-4o’s Chinese token-training data is polluted by spam and porn websites, likely due to inadequate data cleaning  —  Soon after OpenAI released GPT-4o on Monday, May 13, some Chinese speakers started to notice something seemed off about this newest version of the chatbot …

  • @breakingcups
    link
    English
    45 months ago

    It’s only an 86 billion dollar company, can’t expect them to have decent quality control before major releases.