Moderator’s Dilemma: The Risks of Partial Intervention

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention

If you ever end up moderating a forum, or just becoming deeply involved in the meta section of one like me, it is almost inevitable that you will become involved in disputes over exactly what one is and is not allowed to say on it. You may find these choices can be very difficult as, more often than not, there are no good options. You typically have two choices: (Note: This post touches on hot-button issues more than I’d like. I’ll probably come back later and edit it to avoid these)

Comment

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=WMjx8xuPvpX72xW4s

I think the bigger issue is trust. Do you trust the moderators to moderate fairly? Do you trust that when the moderators ban something for violating policy, their interpretation of the policy is correct? Do you trust that, when you disagree with the moderators, they know better than you? If a mod team has the trust of the community, it can use whatever moderation form it likes. If it doesn’t have that trust, then any form of moderation will be called out by some as being unfair. If this sounds like politics, that’s because it is. I don’t believe the establishment or dissipation of this trust has a lot to do with the specific form of moderation. I think it has more to do (at least in this community) with the moderators being transparent and self-critical about moderation decisions. It also requires people who are willing to stand fast in the face of public criticism without ignoring it or high-handedly proclaiming executive fiat. Policy can help here, but it is no substitute for personality. Focus on choosing people with good judgement, and the correct temperament and the policy takes care of itself. Conversely, choose people with bad judgement or the wrong kind of temperament, and no amount of formal policy will save you.

Comment

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=5LiQrfavNEopTrhg9

Is your point that there should be no formal policy in the first place? Once you do have such a formal policy, the best of judgment doesn’t necessarily help you if you can’t circumvent the constraints set by these policies.

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=8aWqg3oAySYSofKd4

This is a good perspective from off-site: https://​​www.shamusyoung.com/​​twentysidedtale/​​?p=19709

Comment

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=bSQfKN7Nxjn5xEXaM

Excellent article btw:"And then, in every sample, you’re likely to have a tiny minority of completely batshit crazy moron assholes… A lot of things can put someone into this last group. Maybe they’re performing for attention and don’t care how destructive they’re being. Maybe they have a bunch of pain in their lives and they’re trying to share it. Maybe they were raised in some messed-up abusive environment and aggressive hate is their normal. Maybe they just aren’t very good at communicating. It doesn’t matter. They’re broken, and as a moderator you don’t have the power to fix them. The problem is in their heart, and nothing you say or do can make them care about others. Most communities are built around the idea of getting these people to behave. This is a mistake. Broken people cannot be fixed by rules. If you make the rules loose, they will find weak spots and exploit them. If you make the rules tight and specific, they will rules-lawyer you to the brink of insanity. They will haggle over the specifics of the rules, and they will insist everyone be held to precisely the same standards. If you let someone else slide, the nut will condemn you as a hypocrite or accuse you of injustice. … Instead of making rules to compel crazies to behave – which can become a full-time enforcement project – I allow them to act out. And then I ban them. I want to know who the crazy people are, as fast as possible. The sooner they reveal their character, the sooner I can pull them out of the pool before they make a mess. This isn’t hard. Problem People are usually easy to spot. Now, in the context of an open system like this blog, "banning" doesn’t mean much. People can change personal details and come in as someone new. But so what? If someone assumes a new identity, they still have to pass the sanity test. They still have to behave like a human being. And if a banned person assumes a new identity and then behaves in a civilized manner? That’s not a flaw in enforcement. That’s mission accomplished."

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=ZPZ872Hi9h7BcvgsH

When these rules are written down, people will start to assume that if something isn’t in the rules, it must be allowed, particularly if it would have been easy to put it in the rules if they had wanted to. So more people may end up doing things that you don’t want. A related thing that can happen here is that the people who would otherwise do forbidden things, will now do the closest-possible permitted thing. Like, everyone likes saying numbers between 0 and 1. Most of your users like numbers close to 0, like 0.1 and 0.03. A few like numbers close to 1, like 0.95. Basically no one says numbers between 0.3 and 0.7. Team Low convinces you that high numbers are bad, and pushes you to ban people from saying them. You agree, and pick a fairly arbitrary cutoff, say 0.6. You figure it doesn’t matter much exactly where the cutoff is, there’s a whole range of numbers that are basically functionally identical because nobody uses them. But now you’ve got a lot of people saying 0.6, and Team Low isn’t very happy about that (0.6 is still pretty high), and Team High aren’t very happy either (you’ve banned their favourite numbers). It’s not just that any toy not banned will be assumed safe. It’s that toys will suddenly start edging as close as possible to the safety limits. If you say "food weights must be within 3% of what’s on the packaging", then they’ll be 3% below.

Comment

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=tEvCvDF2LnPrB66kt

LARPs (live-action roleplay) have a rule 7: "don’t take the piss." This allows enforcers to punish people for that kind of "closest-possible permitted thing." (an extension of the "don’t take the piss" rule is "if you have to ask if you’re taking the piss, you are"). If we port this back to the moderator’s dilemma, this would entail giving the moderator more power to use judgements or to be capricious (and an acceptance of that by the people posting). This gives great power to the moderators, and can easily be abused (that’s roughly how Chinese government censorship works); but if the moderators are reasonably trustworthy for that particular forum, and if they are known to be capricious, this can work very well.

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=TrxSg8Kezg6vWKTtS

If you say "food weights must be within 3% of what’s on the packaging", then they’ll be 3% below. I would guess that setting this sort of regulation takes this into account, and the process for devising it must be accordingly more complicated. The way the regulatory agency ought to get this number (note: I have no relevant background or experience, so this is all wild guessing) might look something like:

  • Estimate the cost to the supplier of determining the weight of a package as a function of average accuracy. For example, it may cost $0.01 per package to determine the weight with a standard deviation of 1%, $0.04 for std dev of 0.5%, etc.
  • ‎Estimate the cost to the consumer of inaccurate package weights (which could just be linear, as in a package which is 3% underweight may be 3% undervalue, but for large inaccuracies and certain products—like medicine—can be different).
  • ‎Find some balance between costs to the supplier (taking into account, perhaps, that such costs may be different for large suppliers with enough throughput to easily absorb the fixed cost and small suppliers which will face a daunting up-front cost) and the consumer (taking into account, perhaps, that some consumers may be more sensitive than others to inaccuracies). In such a case, the average inaccuracy wouldn’t be 3%; if the supplier had the capability to calculate their weights so exactly that every package was no more or less than 3% underweight, the regulatory agency would presumably insist on 0% inaccuracy. Instead, the supplier would pick an inaccuracy between 0% and 3% such that they would be certain that enough of their packages would be less than 3% inaccurate. Any supplier which did not do so wisely might get away with it for a time, but would eventually put out a sub-par product and presumably be penalized for it. Even a self-interested person will build a margin of error to ensure that they are never found to be noncompliant. These margins may be different for different suppliers—if the agency makes regulations with the intent of keeping barriers to entry low, it will allow a wider margin of error so as not to exclude potential suppliers without affordable access to super-precise measurement technology, allowing suppliers with it to choose an inaccuracy which is allowed by regulation but still safe for them. While more difficult, a remedy to this issue may be to additionally regulate on intent (e.g. an internal email saying "let’s aim for 2% inaccuracy, we can get away with it" would be evidence of wrongdoing) or known ability (e.g. holding suppliers with better measurement capability to a higher standard). To return to forum posting: rational trolls will not break every unwritten rule while perfectly respecting all of the written ones, because they likely can’t be precise enough to do so consistently. Irrational trolls will try, fail, and be punished for it. Some trolls are better at staying close to the line without crossing it than others; moderators may want to hold these folks to higher standards or simply add a provision to punish those they believe to be acting within the rules but without good faith.

Comment

This is a good comment. I wonder how much regulator agencies do do that kind of thinking. It’s the sort of "here’s an interesting problem someone had, and here’s the interesting solution" story that I’d be interested in reading about, and would weakly expect to reach high visibility in my social circles if someone wrote an article about it. So I guess this is weak evidence that they don’t do that? But I’m certainly not confident one way or the other, and I’m interested in learning more. You’re right about the forum trolls; the 0.6 line is likely to cull some of them, and permitting an amount of arbitrariness in moderation will help cull others. I think the thing I’m trying to point at is still a relevant thing to keep in mind, but it’s not as clear-cut as I described.

Comment

I wonder how much regulator agencies do do that kind of thinking.They might do (civil services are generally more careful than you’d expect), or it may emerge as they adjust their rules in response to what companies do.

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=MZxEXKxn7wWksLZBG

I find this easier to parse from a non-neutral perspective: If all bad comments are (currently) overtly bad, you might think we could ban overt bad comments and win at moderation. But in fact, once the ban is in effect, the bad commenters might switch to covert bad comments instead. The ban isn’t necessarily wrong, but this effect has to be considered in the cost-benefit analysis.

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=hbbqfBNk3TD4QNeWC

If you say "food weights must be within 3% of what’s on the packaging", then they’ll be 3% below.How about "within 3%, and with an average no lower than the printed weight"? That seems it would work for a lot of different levels of precision on the part of the company.

Comment

That’s the correct solution for food weights, but this is sort of beyond philh’s point, which is just that those you govern will adapt their behavior to the rules you put in place.

Comment

That may be enough: https://​​xkcd.com/​​810/​​

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=DAzLEf6sTxygutEPy

In US law*, restrictions on speech can be content-based (e.g. banning white supremacist content, even if it’s polite) or content-neutral (e.g. banning insults by anyone). I think it maps rather well onto what you’re describing and is a better dichotomy than libertarian vs. non-libertarian. • Source: https://​​lawshelf.com/​​courseware/​​entry/​​limitations-on-expression

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=qEgz3A9w39hjm5uBg

In this approach, you concede a need to at least occasionally intervene in a particular kind of dispute such as banning the white supremacists Another problem with this is what does one mean by "white supremacists"? The definition used by the people who most advocate banning them tends to include anyone who believes in certain statements about the differences between races that are almost certainly true. For example, how race correlates with IQ. This is especially a problem for a forum that wants to be "rational".

Comment

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=ADdfSHF4NxPStGxtF

Yeah. Two decisions must be made: Do I want to ban ideology X? Is comment Y an example of ideology X? So even if you could somehow reach a consensus about which ideologies are banned and which are not, users will continue their wars by arguing whether given comment is an example (or a "dog whistle" for) one of the banned ideologies. If you interpret it too literally, everything you banned will come back, only the users will avoid using certain keywords, sometimes openly mocking the moderation system. Interpreting it otherwise will force you to express opinions on thousand topics, and any decision will seem biased in favor of some side.

https://www.lesswrong.com/posts/7xGcyB7RNdfDe5vxL/moderator-s-dilemma-the-risks-of-partial-intervention?commentId=a5uSgKZNuygt5yHmH

Sorry for the downvote (because I’m guessing you weren’t trying to needlessly debate the specific example you mention), but it seems pretty obvious that any ‘formal’ policy (i.e. ‘laws’) will create an opportunity for ‘lawyering’, e.g. arguing about the definition of specific formal terms.