User-sourced content moderation

cabbage · edit-2 10 months ago

User-sourced content moderation

Marino · 10 months ago

I like the idea but with a more gradual approach.

Instead of trusted and not trusted users (1/0, yes/no), something like levels (0 to 100). Users trust level can be growing with use and positive flags, and decrease with false positives or with time.

When a user mark a post as abuse, his/her trust level is added to the ‘abuse level’ of the post. As an example, when the post ‘abuse level’ reachs some threeshold, a warning to users can be shown, if level reachs higher, a moderator can be warned. If the level reachs even higher threeshold, post is hidden.

If a moderator reviews the post and find it is a positive mark, trust level of reporters is increased. If it’s a false positive, trust level of reporters is decreased.

This can make that a single user with a false positive won’t make the post hidden or tagged, unless is a user with a very high trust level. At the same time, a post can be marked as abuse if a few users with medium levels find it abusive.

Trust levels shouldn’t be so easy to reach and I think users shouldn’t know the exact level they are.

Just an idea :)

cabbage · 10 months ago

I like this spin on it!

I guess it would have to tie in with the existing report function - it doesn’t make much sense to have users report something as abuse if they don’t think it’s worth warning a moderator over. At that point it sounds like a downvote should be enough.

It could also be a challenge if it is taken too lightly - say someone posts something wildly controversial but within the boundaries of free speech, such a post about pineapple on pizza in a foodporn community. A large number of users might report it to moderators for more or less serious reasons, but it would be unfortunate if this caused the temporary removal of the post without moderator action.

It should probably be established in the reporting procedure not only that the user is credible, but also that the user actually believes that it is necessary to remove the post as opposed to other moderator action.

@Spiralvortexisalie · 10 months ago

I wonder how this would work across instances, especially as what might be seen as abusive in one instance may not be in another. Also could this be subject to poisoning, ie spinning up an instance to inflate account reputation on another instance or to mass report abuse from users that instance claims “reputable.” Making it something that is configurable with per instance granularity, aside from being tedious, might lead to situations where only the big instance users get a say, smaller ones can become trashed by a user who manages to become the most reputable solely for coming from a big instance, and/or being able to spam up their rep across instances while mods are asleep possibly exploiting time zone differences (ie build fake rep from lemmy.world at night, to spam a European or Asian instance that is in daylight hours).

Rimu · 10 months ago

I like this idea a lot!

The way Stackoverflow does things is very interesting, where people initially have access to few features and gradually get granted more as their reputation grows - https://stackoverflow.com/help/whats-reputation. It works for them.