‘Impossible’ to create AI tools like ChatGPT without copyrighted material, OpenAI says::Pressure grows on artificial intelligence firms over the content used to train their products

  • @BURN
    link
    English
    59 months ago

    Too bad

    Why do they have free reign to store and use copyrighted material as training data? AIs don’t learn as a human would, and comparisons can’t be made between the learning processes.

    • @[email protected]
      link
      fedilink
      English
      19 months ago

      They can be made. Imagine trying to hold any conversations without being able to reference popular culture.

    • @SCB
      link
      English
      -1
      edit-2
      9 months ago

      Why do you have free reign to do the same?

      AIs don’t learn as a human would, and comparisons can’t be made between the learning processes.

      I think you’re going to have a hard time proving a financial distinction between them

      • @BURN
        link
        English
        39 months ago

        You don’t need to prove a financial difference. They are fundamentally different systems that function in different ways. They cannot be compared 1:1 and laws cannot be applied as a 1:1. New regulations need to be added around AI use of copyrighted material.

        • @SCB
          link
          English
          09 months ago

          I agree. For instance, it should be secured in law that you can train AI on anything, to avoid frivolous discussions like this.

          Output is what should be moderated by law.

          • @BURN
            link
            English
            19 months ago

            No

            Why are you entitled to use everyone else’s work? It should be secured in law that licensing applies to training data to avoid frivolous discussions like this. Then it’s an entirely opt-in solution, which works in the benefit of everyone except the people stealing data.

            Output doesn’t matter since it’s pretty well settled it’s not derivative work (as much as I disagree with that statement).

            • @SCB
              link
              English
              29 months ago

              the people stealing data

              No one is doing this

              Output doesn’t matter since it’s pretty well settled it’s not derivative work

              Cool, discussion over.

              • @BURN
                link
                English
                09 months ago

                It is stealing data. In order to train on it they have to store the data. That’s a copyright violation. There’s no way to interpret it as not stealing data.

                • @5too
                  link
                  English
                  09 months ago

                  It is not stealing. The data is still there. It is, at worst, copyright violation.

                  • @BURN
                    link
                    English
                    29 months ago

                    Copyright violations is stealing