• @MajinBlayze
    link
    310 months ago

    Why scrape Lemmy when you could set up your own activitypub server, subscribe to everything, and let all the other hosts send the data to you in a format that’s already formatted in a way that’s easy to add to your training data.