Batch task queue user data persistence updates #7039

dnr · 2024-12-30T20:52:24Z

What changed?

Multiple user data updates coming in for task queues in the same namespace within a short period of time get batched into a smaller number of persistence operations (transactions).

Since multiple updates are batched into a transaction, conflicts in one can cause unrelated ones to fail. This is detected and a non-retryable error is returned for the conflicting one, while a retryable error is returned for the other ones.

Why?

With deployments, we sometimes have to update user data on multiple task queues at once (all in the same namespace), and on cassandra, these updates go through an LWT. This could cause a backup since the throughput of LWTs is fairly low.

This change allows batching of multiple updates in one persistence operation (LWT on cassandra or transaction on sql). The batching is transparent: updates that come in within a short period of time automatically get batched (in matching engine).

How did you test it?

unit test for batcher component
added some persistence tests for user data, including conflict behavior (there were none before)
existing tests for user data updates

Potential risks

small extra latency on all user data updates

ShahabT

Generally lgtm, but someone else should review line-by-line.

ychebotarev · 2025-01-08T23:23:52Z

common/persistence/sql/task.go

-			if err != nil {
-				return err
+			if m.Db.IsDupEntryError(err) {
+				return &persistence.ConditionFailedError{Msg: err.Error()}


with this type of error handling you will end up with "partially updated" state, but you don't really know which one are passed.
(unless I miss something).
Is there a reason to stop on error? or may it make sense to move forward, and return an array of failed updates?

it's a transaction so you can't end up with things partially updated, right?

it was not clear that all of this happening inside transaction. Probably worth a comment

common/persistence/cassandra/matching_task_store.go

common/persistence/task_manager.go

common/stream_batcher/batcher.go

service/matching/matching_engine_test.go

ychebotarev · 2025-01-09T01:02:01Z

common/stream_batcher/batcher.go

+		// try to add more items. stop after a gap of MaxGap, total time of MaxTotalWait, or
+		// MaxItems items.
+		maxWaitC, maxWaitT := s.clock.NewTimer(s.opts.MaxDelay)
+	loop:


I'm not sure I understand all the possible implications, so I guess I will trust this work and covered by tests.

stephanos

First half of my review. I'll review the stream_batcher next.

stephanos · 2025-01-09T16:40:48Z

common/persistence/data_interfaces.go

-		NamespaceID     string
-		TaskQueue       string
+		NamespaceID string
+		Updates     map[string]*SingleTaskQueueUserDataUpdate // key is task queue name


Non-blocking: I usually name maps with ambiguous keys byXY, ie UpdatesByTaskQueue. Another option that seems reasonable is to have a new type taskQueueName.

All these struct names are so long already... 😥

service/matching/matching_engine.go

stephanos · 2025-01-09T18:56:37Z

common/persistence/cassandra/matching_task_store.go

 	}

-	previous := make(map[string]interface{})
+	previous := make(map[string]any)
 	applied, iter, err := d.Session.MapExecuteBatchCAS(batch, previous)


retries/conflicts are not handled well yet: on a version conflict, all updates in the batch will fail and none will be retried, but the non-conflicting ones should be retried

From a user perspective, is it acceptable to be released like that?

It depends what the user data change is.. if it's something initiated by a user rpc (versioning-1 or -2), they can just get an error and re-run it, so probably fine. For versioning-3 it'll be initiated by a deployment workflow, for registering a new task queue with a deployment, and for changing the current deployment. The registration will get retried so that's okay. For changing the current... I'm not sure if the error will get propagated back correctly or retried.

Basically the answer is: probably not. So I'll plan to fix it in this PR.

common/stream_batcher/batcher.go

stephanos · 2025-01-10T01:37:43Z

common/stream_batcher/batcher.go

+		// process batch
+		r := s.fn(items)
+
+		// send responses


Maybe call out here (or somewhere else) that all all batch items receive the same response. (without the PR context, that would have been surprising)

the processor function is only called once per batch and returns one value.. how can they receive anything other than the same response?

stephanos · 2025-01-10T16:39:28Z

common/stream_batcher/batcher_test.go

+	clk.AdvanceNext()            // first Add
+	time.Sleep(time.Millisecond)
+	clk.AdvanceNext() // second Add
+	time.Sleep(time.Millisecond)


tangential question: the time.Sleeps are necessary because of the use of context.Context, right? I wonder if there's also a "fake context" impl that we could connect with the timesource to make things like this fully deterministic.

nothing to do with Context, it's just that it needs other goroutines to advance to get themselves blocked on the fake clock so that the AdvanceNext does the right thing. (the fake clock already has a fake context, btw.)

the new synctest thing in go 1.24 fixes this by integrating with the runtime.. I think it will make these tests a lot nicer.

dnr added 7 commits December 30, 2024 20:25

coalesce wip

c2e140b

generic

33e4035

clock + renames

dd8d0b6

fix race

1bc9a1b

move and unit test

cc71013

renames, fix up matching

2690bd1

matching tests

824b702

dnr requested a review from a team as a code owner December 30, 2024 20:52

ShahabT reviewed Jan 8, 2025

View reviewed changes

ychebotarev reviewed Jan 9, 2025

View reviewed changes

stephanos reviewed Jan 9, 2025

View reviewed changes

stephanos reviewed Jan 10, 2025

View reviewed changes

dnr added 8 commits January 10, 2025 12:49

Merge branch 'main' of github.com:temporalio/temporal into userdata11

bc70844

renames

c72a822

more code review comments

e193e5e

Merge branch 'main' of github.com:temporalio/temporal into userdata11

fa24bf1

test skeleton

bd307e4

basic test

e757d9b

identify conflicts and add test

ca1ef63

use conflicting in matching engine

50af982

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch task queue user data persistence updates #7039

Batch task queue user data persistence updates #7039

dnr commented Dec 30, 2024 •

edited

Loading

ShahabT left a comment

ychebotarev Jan 8, 2025

dnr Jan 10, 2025

ychebotarev Jan 10, 2025

ychebotarev Jan 9, 2025

stephanos left a comment

stephanos Jan 9, 2025

dnr Jan 10, 2025

stephanos Jan 9, 2025

dnr Jan 10, 2025

dnr Jan 21, 2025

stephanos Jan 10, 2025

dnr Jan 10, 2025

stephanos Jan 10, 2025 •

edited

Loading

dnr Jan 10, 2025

Batch task queue user data persistence updates #7039

Are you sure you want to change the base?

Batch task queue user data persistence updates #7039

Conversation

dnr commented Dec 30, 2024 • edited Loading

What changed?

Why?

How did you test it?

Potential risks

ShahabT left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stephanos left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stephanos Jan 10, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dnr commented Dec 30, 2024 •

edited

Loading

stephanos Jan 10, 2025 •

edited

Loading