automod: action limits; create reports for interaction churn #465

bnewbold · 2023-12-07T14:27:57Z

The motivation here is to start auto-reporting in production based on specific rules. Specifically, this PR would start reporting on interactions churn (follow/unfollow), letting human mods confirm before taking action on an account.

Before we do that, need to de-duplicate reports. For example, if an account creates thousands of spammy posts, only want to report the account once. Generally want to prevent run-away rules from creating millions of reports, or doing thousands of automated account takedowns (for example).

This PR prevents re-reporting based on daily counters (fast checks), as well as double-checking against the mod service API just before filing a report (slower, but reliable).

This PR also adds "quotas" for mod actions, implemented using the counter system. This isn't perfect (the counter system itself might be buggy or broken (causing duplicate actions), but seems like a good start. If quotas are exceeded, automod will log and skip taking additional actions until the next day.

warpfork

Looks mostly good to me.

I'm getting a little twitchy at the sheer size of the PersistAccountActions func, though. I think I read it, and I think I understand it, and together with the tests I think it's probably correct... but it's getting harder to be confident, and taking longer to read. It might be good to break it up soon. I tried to offer a few inline thoughts about where, but they're very preliminary and it's up to you if you can make any use of them :)

automod/event.go

warpfork · 2023-12-10T02:53:52Z

automod/event.go

+			e.Increment("automod-quota", "takedown")
+		}
+	}
+
 	if newTakedown || len(newLabels) > 0 || len(newFlags) > 0 || len(newReports) > 0 {


(again in the "i'm not quite sure how I'd factor these things out, but, thinking out loud" vein...)

These four condition clauses seem like a good focal point for guiding a (?future) refactor.

When I was reviewing this, I noticed myself checking this line against all the other branches below to see if a slack notification would be accurately sent for exactly every event that would have at least one other persisted effect.

I think the answer is still "yes", as intended. It's just getting a bit harder to see as all the other instances of the same condition are drifting further apart. Maybe roughly a function per section below that's gated on each of these clauses? Or perhaps if we bundle all these values in a small struct, so their collective role is easier to see (and maybe add some logging methods to that, as a bonus? because I think "what updates we're about to push" is actually about what belongs in the slack message too)?

Also: I believe all the needsPurge = true assignments and eventual check of that bool can be replaced by... just checking 3 of these 4 again, directly, too.

there is a bit of a corner case with reports, where it might not be known that they were not created until the actual attempt to do so (because there is an existing report). in that case a slack message gets sent out, but no report is created, and the account metadata cache is not purged.

I guess we could treat the skip-record-creation-at-last-minute thing as a real corner-case and always purge the cache?

Yeah, I was wondering about that too.

I was wondering if maybe the heavier query for prior reports should actually move earlier? Because it's also potentially mildly confusing if the slack notification gets sent out saying a report to happen but it's actually not.

It isn't super great how it is, but it does enable running with pushing to slack without actual admin-token access to the mod service, which I think is helpful as a possible way to deploy/operate, both for internal folks and third parties. The idea there is folks can develop rules and just manually independently report them in-app, based on a slack channel. Maybe that is too jank to care about, but I was running that way for a bit.

bnewbold · 2023-12-11T08:08:41Z

I agree overall, these persist functions have gotten unwieldy and hard to understand, even for me. I'll look in to refactoring.

…de-dupe

bnewbold · 2023-12-15T04:45:50Z

I took at pass at breaking this function out and refactoring it. The control flow and behavior should be the same (still pretty complex), but hopefully clearer and more readable.

automod/circuit_breaker_test.go

automod/event.go

warpfork

Looking pretty darn good to me!

Co-authored-by: Eric Myhre <[email protected]>

bnewbold requested a review from warpfork December 8, 2023 11:03

warpfork reviewed Dec 10, 2023

View reviewed changes

bnewbold added 7 commits December 15, 2023 00:27

automod: avoid duplicate account reports

96e3053

automod: default+prefix for report comments

a85a670

automod: action-level circuit breakers

44cd4a4

automod: additional logging when taking actions

9c3b940

automod: report on high interaction churn

c7393f8

automod: tests for circuit breaking and counter-based account report …

34e25b0

…de-dupe

review suggestion from warpfork

0d4a31f

bnewbold force-pushed the bnewbold/automod-action-limits branch from 1a1224c to 0d4a31f Compare December 14, 2023 17:31

bnewbold added 4 commits December 15, 2023 01:11

automod: refactor account persist in to smaller functions

c8db9db

automod: more action persist refactoring

89fc543

engine: check for more event errors

90ffed8

countstore: a bit more mem test coverage (defensive)

d349d3e

warpfork reviewed Dec 15, 2023

View reviewed changes

automod/circuit_breaker_test.go Outdated Show resolved Hide resolved

warpfork reviewed Dec 15, 2023

View reviewed changes

automod/event.go Show resolved Hide resolved

warpfork approved these changes Dec 15, 2023

View reviewed changes

Update automod/circuit_breaker_test.go

371cde1

Co-authored-by: Eric Myhre <[email protected]>

bnewbold merged commit fb51430 into main Dec 15, 2023
7 checks passed

bnewbold deleted the bnewbold/automod-action-limits branch December 15, 2023 12:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automod: action limits; create reports for interaction churn #465

automod: action limits; create reports for interaction churn #465

bnewbold commented Dec 7, 2023 •

edited

Loading

warpfork left a comment

warpfork Dec 10, 2023 •

edited

Loading

bnewbold Dec 15, 2023

warpfork Dec 15, 2023

bnewbold Dec 15, 2023

bnewbold commented Dec 11, 2023

bnewbold commented Dec 15, 2023

warpfork left a comment

automod: action limits; create reports for interaction churn #465

automod: action limits; create reports for interaction churn #465

Conversation

bnewbold commented Dec 7, 2023 • edited Loading

warpfork left a comment

Choose a reason for hiding this comment

warpfork Dec 10, 2023 • edited Loading

Choose a reason for hiding this comment

bnewbold Dec 15, 2023

Choose a reason for hiding this comment

warpfork Dec 15, 2023

Choose a reason for hiding this comment

bnewbold Dec 15, 2023

Choose a reason for hiding this comment

bnewbold commented Dec 11, 2023

bnewbold commented Dec 15, 2023

warpfork left a comment

Choose a reason for hiding this comment

bnewbold commented Dec 7, 2023 •

edited

Loading

warpfork Dec 10, 2023 •

edited

Loading