• @[email protected]
    link
    fedilink
    18 months ago

    It’s all about cleaning datasets. For forecasting models, you need to occasionally remove certain historical data to increase accuracy.

    The same could work here, but it’s obviously at a significantly larger scale and crosses into every interest and discipline.

    I believe the solution is curated data models with the top members of the applicable field determining validity or a stack overflow model.

    We should basically have a “clean” copy of the internet that is always 3-6 months behind as it is only added with quality data.

    • @elshandra
      link
      18 months ago

      I believe the solution is curated data models with the top members of the applicable field determining validity or a stack overflow model.

      I think you’re on the right track here, but will retain the same flaws ultimately this way.

      Personally, I believe the models should be open and all interested parties have varying degrees of influence over the accepted truth. That’s going to be a complicated in itself.

      By limiting it to “trusted people”, you only have to corrupt enough of them, and eventually you end up with the same shitty problems, but with bots too.