• @[email protected]
    link
    fedilink
    69 months ago

    So I guess there are two paths of training data. Some company selling it explicitly, and the companies just scraping accessible data. Not that either is “good”, but at least with public data, you only have the AI company profiting.

    • Soatok DreamseekerOPM
      link
      fedilink
      69 months ago

      Yep. That’s why the two things I say Automattic MUST do to make things right are about proper consent controls for Automattic’s use of data and sale to AI vendors, but the third thing is a proposed proactive defense against scrapers.