• @[email protected]
    link
    fedilink
    English
    109 months ago

    Rather than eliminating the some of the training data, you could add more training data to create an even balance.

    • @kromem
      link
      English
      39 months ago

      Indeed - there’s a very good argument for using synthetic data to introduce diversity as long as you can avoid model collapse.