Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve query building routines of DML event queries, reducing time and allocations #1459

Merged
merged 4 commits into from
Oct 23, 2024

Conversation

danieljoos
Copy link
Contributor

@danieljoos danieljoos commented Oct 22, 2024

Description

This pull request adapts the query building routines for DML event queries.
Parts that never change throughout the migration will only be computed once.

Measurements

Tests with production load showed an increase in throughput of around 5-10%.
The tests have been conducted on separate testing replicas.

image

(🔵 blue: current version, 🔴 red: updated version)

The results of synthetic benchmarks can be found here: #1459 (comment)


In case this PR introduced Go code changes:

  • contributed code is using same conventions as original code
  • script/cibuild returns with no formatting errors, build errors or unit test errors.

@meiji163
Copy link
Contributor

meiji163 commented Oct 23, 2024

I believe the failing test case generated-columns-unique is unrelated. The error can be reproduced on the master branch by adding an UPDATE to the test case (#1461)

@danieljoos danieljoos force-pushed the danieljoos-dml-query-builders branch from 0c4db0e to 4608dcb Compare October 23, 2024 08:43
@danieljoos
Copy link
Contributor Author

danieljoos commented Oct 23, 2024

Find below the results of some synthetic benchmarks, using the Go benchmarking capabilities.
The benchmarks have been conducted on Codespace hosts.
Benchmark functions can be found in the details.

Benchmark Build DML Delete Query

Benchmark function
func BenchmarkApplierBuildDMLEventQuery_Delete(b *testing.B) {
	columns := sql.NewColumnList([]string{"id", "item_id", "name"})

	migrationContext := base.NewMigrationContext()
	migrationContext.DatabaseName = "test"
	migrationContext.OriginalTableName = "test"
	migrationContext.OriginalTableColumns = columns
	migrationContext.SharedColumns = columns
	migrationContext.MappedSharedColumns = columns
	migrationContext.UniqueKey = &sql.UniqueKey{
		Name:    "unique",
		Columns: *columns,
	}

	applier := NewApplier(migrationContext)
	applier.prepareQueries() // comment out in current version

	columnValues := sql.ToColumnValues([]interface{}{123, 456, "abc"})
	binlogEvent := &binlog.BinlogDMLEvent{
		DatabaseName:      "test",
		DML:               binlog.DeleteDML,
		WhereColumnValues: columnValues,
	}
	for range b.N {
		_ = applier.buildDMLEventQuery(binlogEvent)
	}
}

Current version

goos: linux
goarch: amd64
pkg: github.com/github/gh-ost/go/logic
cpu: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
BenchmarkApplierBuildDMLEventQuery_Delete-2   	  460914	      2994 ns/op	    1112 B/op	      45 allocs/op
PASS
ok  	github.com/github/gh-ost/go/logic	1.416s

Updated version

goos: linux
goarch: amd64
pkg: github.com/github/gh-ost/go/logic
cpu: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
BenchmarkApplierBuildDMLEventQuery_Delete-2   	 5844512	       186.2 ns/op	     128 B/op	       4 allocs/op
PASS
ok  	github.com/github/gh-ost/go/logic	1.306s

Benchmark Build DML Insert Query

Benchmark function
func BenchmarkApplierBuildDMLEventQuery_Insert(b *testing.B) {
	columns := sql.NewColumnList([]string{"id", "item_id"})

	migrationContext := base.NewMigrationContext()
	migrationContext.DatabaseName = "test"
	migrationContext.OriginalTableName = "test"
	migrationContext.OriginalTableColumns = columns
	migrationContext.SharedColumns = columns
	migrationContext.MappedSharedColumns = columns
	migrationContext.UniqueKey = &sql.UniqueKey{
		Name:    "unique",
		Columns: *columns,
	}

	applier := NewApplier(migrationContext)
	applier.prepareQueries() // comment out in current version
	columnValues := sql.ToColumnValues([]interface{}{1234, 5678})
	binlogEvent := &binlog.BinlogDMLEvent{
		DatabaseName:    "test",
		DML:             binlog.InsertDML,
		NewColumnValues: columnValues,
	}
	for range b.N {
		_ = applier.buildDMLEventQuery(binlogEvent)
	}
}

Current version

goos: linux
goarch: amd64
pkg: github.com/github/gh-ost/go/logic
cpu: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
BenchmarkApplierBuildDMLEventQuery_Insert-2   	  774986	      1713 ns/op	     600 B/op	      27 allocs/op
PASS
ok  	github.com/github/gh-ost/go/logic	1.353s

Updated version

goos: linux
goarch: amd64
pkg: github.com/github/gh-ost/go/logic
cpu: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
BenchmarkApplierBuildDMLEventQuery_Insert-2   	 9303057	       141.0 ns/op	     104 B/op	       3 allocs/op
PASS
ok  	github.com/github/gh-ost/go/logic	1.453s

Benchmark Build DML Update Query

Benchmark function
func BenchmarkApplierBuildDMLEventQuery_Update(b *testing.B) {
	columns := sql.NewColumnList([]string{"id", "item_id"})

	migrationContext := base.NewMigrationContext()
	migrationContext.DatabaseName = "test"
	migrationContext.OriginalTableName = "test"
	migrationContext.OriginalTableColumns = columns
	migrationContext.SharedColumns = columns
	migrationContext.MappedSharedColumns = columns
	migrationContext.UniqueKey = &sql.UniqueKey{
		Name:    "unique",
		Columns: *columns,
	}

	applier := NewApplier(migrationContext)
	applier.prepareQueries() // comment out in current version
	columnValues := sql.ToColumnValues([]interface{}{1234, 5678})
	binlogEvent := &binlog.BinlogDMLEvent{
		DatabaseName:      "test",
		DML:               binlog.UpdateDML,
		NewColumnValues:   columnValues,
		WhereColumnValues: columnValues,
	}
	for range b.N {
		_ = applier.buildDMLEventQuery(binlogEvent)
	}
}

Current version

goos: linux
goarch: amd64
pkg: github.com/github/gh-ost/go/logic
cpu: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
BenchmarkApplierBuildDMLEventQuery_Update-2   	  400082	      3062 ns/op	    1136 B/op	      52 allocs/op
PASS
ok  	github.com/github/gh-ost/go/logic	1.266s

Updated version

goos: linux
goarch: amd64
pkg: github.com/github/gh-ost/go/logic
cpu: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
BenchmarkApplierBuildDMLEventQuery_Update-2   	 3431607	       363.4 ns/op	     232 B/op	       6 allocs/op
PASS
ok  	github.com/github/gh-ost/go/logic	1.611s

@danieljoos danieljoos marked this pull request as ready for review October 23, 2024 11:45
@danieljoos danieljoos changed the title Use query builders for DML event queries Improve query building routines of DML event queries, reducing time and allocations Oct 23, 2024
@arthurschreiber arthurschreiber merged commit a834c00 into master Oct 23, 2024
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants