Make the backfiller aware of revs and fill gaps #454

whyrusleeping · 2023-11-27T17:42:41Z

No description provided.

whyrusleeping · 2023-11-27T23:00:04Z

Hrn... jobs don't recover from error states. Need to figure that one out

ericvolp12 · 2023-11-28T18:14:40Z

backfill/gormstore.go

 		return false, nil
+	case StateInProgress, StateEnqueued:


Why are we buffering ops on enqueued backfill jobs here? They won't have been checked out yet if they're still enqueued so any events they get will be included in the checkout, right?

ah, thats true

ericvolp12 · 2023-11-28T18:25:41Z

backfill/gormstore.go

-func (j *Gormjob) FlushBufferedOps(ctx context.Context, fn func(kind, path string, rec *typegen.CBORMarshaler, cid *cid.Cid) error) error {
+var ErrEventGap = fmt.Errorf("buffered event revs did not line up")
+
+func (j *Gormjob) FlushBufferedOps(ctx context.Context, fn func(kind, path string, rec typegen.CBORMarshaler, cid *cid.Cid) error) error {


How does this work with deletes? IIRC the rec being a pointer worked with deletes because it'd be nil when deleting a record, does it just end up being an empty typegen.CBORMarshaler in this case?

typegen.CBORMarshaler is an interface, it would just be nil

ericvolp12 · 2023-11-28T18:33:23Z

Hrn... jobs don't recover from error states. Need to figure that one out

Yeah, the way I typically handle this is by running a query to reset the job state on jobs that failed for a retryable reason but it's not always clear which jobs are safe to retry etc.

whyrusleeping · 2023-11-28T18:36:14Z

need to figure something out there, maybe just keep a counter of how many times its failed and do automatic retries up to a point?

ericvolp12 · 2023-11-28T18:37:53Z

need to figure something out there, maybe just keep a counter of how many times its failed and do automatic retries up to a point?

would we do retries when we see more events from the repo or somehow enqueue it again for a retry when it fails? would probably also want some kind of backoff in there.

whyrusleeping · 2023-11-28T18:39:09Z

i think we should automatically re-enqueue it, after some duration

ericvolp12 · 2023-11-28T18:42:38Z

i think we should automatically re-enqueue it, after some duration

okay, we probably want two new columns then for the jobs, something like a retry_count and a retry_after timestamptz and then we can bump the retry_after based on the retry count when we get a failure. Update the query to get the next job to include failed jobs that have a retry_after that's passed, and then when we hit a max number of retries we set that column to nil so it doesn't get picked up.

whyrusleeping · 2023-11-28T18:55:20Z

oh the other thing i havent implemented et is the gap fill handling logic, it spits out an error when we detect a gap, and resets the job state, but when we actually go to process that we arent filtering to just the gap records

whyrusleeping requested a review from ericvolp12 November 27, 2023 22:25

ericvolp12 reviewed Nov 28, 2023

View reviewed changes

whyrusleeping and others added 7 commits November 30, 2023 15:14

WIP: working on making the backfiller rev aware

6f51abb

get it running

745a735

check for event gap error, reset state to enqueued

e4bc8d0

Retry logic, still need to go update search to use the new patterns

efe202e

Update search to use new backfill model maybe

cc1e946

dont create job on failed load

b1a3705

load jobs from database on demand

be7d652

whyrusleeping force-pushed the feat/backfill-revs branch from cc8fe9a to be7d652 Compare December 1, 2023 22:27

whyrusleeping and others added 5 commits December 1, 2023 14:30

if we have an existing rev state on a job, fetch with since param

f752ca1

fix build

72ca3df

Check for JobNotFound errors

6b135a6

Use GetOrCreateJob

27d610c

Cleanup

40259f2

ericvolp12 marked this pull request as ready for review December 2, 2023 00:07

whyrusleeping merged commit a857fdc into main Dec 4, 2023
7 checks passed

whyrusleeping deleted the feat/backfill-revs branch December 4, 2023 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make the backfiller aware of revs and fill gaps #454

Make the backfiller aware of revs and fill gaps #454

whyrusleeping commented Nov 27, 2023

whyrusleeping commented Nov 27, 2023

ericvolp12 Nov 28, 2023

whyrusleeping Nov 28, 2023

ericvolp12 Nov 28, 2023

whyrusleeping Nov 28, 2023

ericvolp12 commented Nov 28, 2023

whyrusleeping commented Nov 28, 2023

ericvolp12 commented Nov 28, 2023

whyrusleeping commented Nov 28, 2023

ericvolp12 commented Nov 28, 2023

whyrusleeping commented Nov 28, 2023

Make the backfiller aware of revs and fill gaps #454

Make the backfiller aware of revs and fill gaps #454

Conversation

whyrusleeping commented Nov 27, 2023

whyrusleeping commented Nov 27, 2023

ericvolp12 Nov 28, 2023

Choose a reason for hiding this comment

whyrusleeping Nov 28, 2023

Choose a reason for hiding this comment

ericvolp12 Nov 28, 2023

Choose a reason for hiding this comment

whyrusleeping Nov 28, 2023

Choose a reason for hiding this comment

ericvolp12 commented Nov 28, 2023

whyrusleeping commented Nov 28, 2023

ericvolp12 commented Nov 28, 2023

whyrusleeping commented Nov 28, 2023

ericvolp12 commented Nov 28, 2023

whyrusleeping commented Nov 28, 2023