Reduce intra cluster conflicts #5371

rnewson · 2024-12-29T18:06:45Z

Overview

CouchDB issues all write requests in parallel without coordination, applying a quorum on the results of those independent actions. When updating a document concurrently this can lead to the introduction of a stored conflict if two different writes reach separate nodes first. This is undesirable.

This patch changes fabric_doc_update in the following ways;

Workers are no longer started immediately, but are given a unique reference each.
For each range in the write request, one node is chosen to "lead" the write decision (calculated as the lowest live node that hosts the shard range)
"Leader" workers are started.
Any doc update that receives "conflict" from a Leader is added to the reply dict W times and the doc updates are removed from the other (unstarted) workers. If that leaves the worker with nothing to do, it is removed entirely.

Testing recommendations

There is some existing coverage in the module itself but more testing is needed before this can be merged.

Related Issues or Pull Requests

Checklist

Code is written and works correctly
Changes are covered by tests
Any new configurable parameters are documented in rel/overlay/etc/default.ini
Documentation changes were made in the src/docs folder
Documentation changes were backported (separated PR) to affected branches

rnewson · 2025-01-01T11:12:05Z

noting need for a liveness check at the start (so the 'leader' is always a live node as of the invocation) and also to consider maintenance mode (receipt of that message from a 'leader' specifically)

This should prevent spurious intra-cluster conflicts most of the time. It is not true consistency, however. spurious conflicts are still possible whenever the nodes in the cluster disagree on the current live set of other nodes.

ensure we start followers if leader node is down (rexi_DOWN) or is in maintenance mode (rexi_EXIT).

rnewson added 2 commits December 28, 2024 16:22

introduce acc record

5743e5b

delay start of 'follower' shards

1a1c34a

rnewson force-pushed the reduce-intra-cluster-conflicts branch 2 times, most recently from b727716 to 04000f7 Compare December 29, 2024 20:56

reject write at leader if conflict

cd57656

This should prevent spurious intra-cluster conflicts most of the time. It is not true consistency, however. spurious conflicts are still possible whenever the nodes in the cluster disagree on the current live set of other nodes.

rnewson force-pushed the reduce-intra-cluster-conflicts branch from 038b996 to 507b4c6 Compare January 2, 2025 19:52

start followers on any error

577dc25

ensure we start followers if leader node is down (rexi_DOWN) or is in maintenance mode (rexi_EXIT).

rnewson force-pushed the reduce-intra-cluster-conflicts branch from 507b4c6 to 577dc25 Compare January 6, 2025 10:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce intra cluster conflicts #5371

Reduce intra cluster conflicts #5371

rnewson commented Dec 29, 2024

rnewson commented Jan 1, 2025

Reduce intra cluster conflicts #5371

Are you sure you want to change the base?

Reduce intra cluster conflicts #5371

Conversation

rnewson commented Dec 29, 2024

Overview

Testing recommendations

Related Issues or Pull Requests

Checklist

rnewson commented Jan 1, 2025