automod: identical reply rule #466

bnewbold · 2023-12-07T15:25:08Z

Two enabling features:

cheap consistent non-cryptographic-strength hashing of strings for use in counter keys (went with uint64 murmur3, which was already in dependency tree)
ability to increment a counter for a single time period, to control counter key space growth (for redis)

This initial version of the rule counts replies to any other user in the same bucket, not distinct-accounts-with-same-reply-text. I'm a little worried about redis memory growth if we have a HyperLogLog for each author+text combination (as opposed to simple counter int). Maybe the redis implementation is clever and efficient for the small-distinct case? Or maybe RAM is cheap enough?

This branch will conflict with #464. Plan to merge that one first, then i'll rebase this one.

bnewbold · 2023-12-08T11:09:26Z

can trigger this against prod data with:

go run ./cmd/hepa/ process-recent hackerdarkweb.bsky.social

warpfork · 2023-12-11T16:43:20Z

automod/rules/replies.go

+	// use a specific period (IncrementPeriod()) to reduce the number of counters (one per unique post text)
+	period := automod.PeriodDay
+	bucket := evt.Account.Identity.DID.String() + "/" + HashOfString(post.Text)
+	if evt.GetCount("reply-text", bucket, period) >= identicalReplyLimit {


Right now, this is proceeding to act immediately on a hash collision. Do you think it would be reasonable to do a more expensive check to see if things are actually identical?

I don't know how bad a false positive is here. Maybe if the threshhold for identical in sheer count is moderate, it's unlikely to trigger on reasonable real human behavior?

I think the naive false positive rate (for the 64bit variant of murmur3) is low enough to not worry about it and not do secondary network requests to check for exact matches.

This isn't a cryptographic hash, so attacks could be a concern for some rules.

Generally, I feel like all the counters we are using here should be treated as a bit fuzzy, at least for record-level counts. It is totally possible for events to get partially-processed (and partially persisted) and then re-processed again after a crash. I think the semantics and kinds of rules and actions we write are generally resilient to this: doing things like reporting for human review, or having large margins before taking fully-automated action.

warpfork · 2023-12-11T16:47:19Z

automod/rules/replies.go

+	period := automod.PeriodDay
+	bucket := evt.Account.Identity.DID.String() + "/" + HashOfString(post.Text)
+	if evt.GetCount("reply-text", bucket, period) >= identicalReplyLimit {
+		evt.AddAccountFlag("multi-identical-reply")


Not new to this PR, but I think some docs on the semantic distinctions between AccountLabels, AccountFlags, and AccountReports are needed somewhere (and then perhaps most methods touching them should have a quick pointer to that doc). The last one of the group is close to self-explanatory, but the first two are both just []string in the code and don't autounpack their meaning very readily.

mmm! added doc comments to all the fields for most of the RepoEvent variants

warpfork

I think this is an overall LGTM, given that it can be iterated on further if the collision rate were to turn out problematically high.

Will have some merge conflicts to resolve.

bnewbold · 2023-12-14T17:18:19Z

Merged with main (copied additions to countstore over to that package), and added a bunch of doc comments.

I'm not too concerned about distinct counter collisions, but can revisit.

This PR is currently rebased on top of #466, to demonstrate testing that rule. **UPDATE:** that PR merged, so now against `main` Adds a `hepa` command to "capture" the current state of a real-world account: currently some account metadata (identity, profile, etc), plus some recent post records. This gets serialized to JSON for easy dumping to file, like: ```shell go run ./cmd/hepa/ capture-recent atproto.com > automod/testdata/capture_atprotocom.json ``` Then, a test helper function which loads this file, and processes all the post records using an engine fixture. Combined, these fixtures make it easy to do test-driven-development of new rules. You find an account which recently sent spam or violated some policy, take a capture snapshot, set up a test case, and then write a rule which triggers and satisfies the test. Some notes: - tried moving the "test helpers" in to a sub-package (`indigo/automod/automodtest`) but hit a circular import, so left where it is - this won't work with all rule types, and some captures/rules may need additional mocking (eg, additional identities in the mock directory), but that should be fine - it usually isn't appropriate to capture real-world content in to public code. we can be careful about what we add in this repo (indigo); the "hackerdarkweb" example included in this PR seems fine to snapshot to me. the code does strip "Private" account metadata by default. - probably could use docs/comments. i'm not sure where best to put effort, feedback welcome!

bnewbold added 3 commits December 7, 2023 23:15

automod: helper to fast-hash strings

1f49fc7

countstore: ability to increment specific time periods (not all)

7489349

automod: rule to identity repeated reply with identical text

8d6486d

bnewbold requested a review from warpfork December 8, 2023 11:09

bnewbold mentioned this pull request Dec 9, 2023

automod: test capture framework #470

Merged

warpfork reviewed Dec 11, 2023

View reviewed changes

warpfork approved these changes Dec 13, 2023

View reviewed changes

bnewbold added 5 commits December 14, 2023 23:36

Merge branch 'main' into bnewbold/identical-reply

8df80a3

Will have some merge conflicts to resolve.

countstore: resolve merge conflicts

002a887

automod: add doc strings to event fields

b5b3175

codespell

32c2df3

automod: doc strings on event methods

c10dfce

bnewbold merged commit 7d7e1f1 into main Dec 14, 2023
7 checks passed

bnewbold deleted the bnewbold/identical-reply branch December 14, 2023 17:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automod: identical reply rule #466

automod: identical reply rule #466

bnewbold commented Dec 7, 2023

bnewbold commented Dec 8, 2023

warpfork Dec 11, 2023

bnewbold Dec 14, 2023

warpfork Dec 11, 2023

bnewbold Dec 14, 2023

warpfork left a comment

bnewbold commented Dec 14, 2023

automod: identical reply rule #466

automod: identical reply rule #466

Conversation

bnewbold commented Dec 7, 2023

bnewbold commented Dec 8, 2023

warpfork Dec 11, 2023

Choose a reason for hiding this comment

bnewbold Dec 14, 2023

Choose a reason for hiding this comment

warpfork Dec 11, 2023

Choose a reason for hiding this comment

bnewbold Dec 14, 2023

Choose a reason for hiding this comment

warpfork left a comment

Choose a reason for hiding this comment

bnewbold commented Dec 14, 2023