Skip to content

Django database backend for MariaDB Galera Cluster

License

Notifications You must be signed in to change notification settings

jur-clerkx/django-galera

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

django-galera

A Django application providing a database backend for MariaDB Galera Cluster.

Features

  • Read/Write Split: All writes will be routed to a single primary node, which greatly reduces deadlocks often seen with busy Django sites using Galera.
  • Loadbalancing: Randomly choose a secondary node for readonly queries to balance load over multiple database servers.
  • Optimistic transactions on secondary nodes: With Djangos ATOMIC_REQUESTS every request will be processed inside a transaction. Instead of connecting to the primary node once a transaction starts, django-galera will begin these transactions on a secondary node and only switches to the primary node once data is going to be changed.
  • Automatic and transparent failover: When a node fails, the backend will reconnect to a different node. After that a transaction replay will ensure data consistency, making a failure of both primary and secondary nodes transparent to the application.

User guide

Installation

You can install django-galera using pip:

$ python -m pip install django-galera

Configure as database backend

To start using django-galera, you'll have to change the ENGINE entry in your database configuration from django.db.backends.mysql to galera.backends.readwritesplit and add a NODES entry containing the nodes you want to use:

DATABASES = {
    'default': {
        'ATOMIC_REQUESTS': True,
        'ENGINE': 'galera.backends.readwritesplit',
        'PORT': 3306,
        'NAME': 'db_name',
        'USER': 'db_username',
        'PASSWORD': 'db_password',
        'OPTIONS' : {},
        'NODES': {
            'db1': {'HOST': '10.0.0.1'},
            'db2': {'HOST': '10.0.0.2'},
            'db3': {'HOST': '10.0.0.3'},
        }
    }
}

Backend specific options

Some features of django-galera can be configured to suit your needs by adding them to the OPTIONS entry of the database settings:

'OPTIONS' : {
    'failover_enable': True,
    'failover_history_limit': 1000,
    'optimistic_transactions': True,
    'wsrep_sync_after_write': True,
    'wsrep_sync_use_gtid': False,
}
Options
Option Type Default Description
failover_enable bool True Enable failover and transaction replay on another node when the current node fails.
failover_history_limit int 1000 Transaction replay keeps a list of every query and checksums of their results. In case of failure, they will be replayed on another node and the results compared to ensure data consistency. If there are more than this many entries in the list, failover and transaction replay will be disabled for the current transaction to prevent ever growing memory consumption.
optimistic_transactions bool True Enable optimistic transaction execution on secondary nodes, switching to primary node only once data is going to be changed. Depending on your application, you can disable this option if you have issues with data being changed by concurrent queries.
wsrep_sync_after_write bool True Although Galera allows replication to be almost instantaneous, it is still possible that changes written to the primary node have not yet been applied to the secondary node. If this option is set to True, django-galera will block until all changes have been written to the secondary node by making use of the variable wsrep_sync_wait.
wsrep_sync_use_gtid bool False Instead of using wsrep_sync_wait, django-galera can also utilize the more granular functions wsrep_last_written_gtid and wsrep_sync_wait_upto_gtid. As wsrep_last_written_gtid currently returns wrong values on MariaDB 10.5 and later, it is disabled by default and should not be used until MDEV-26359 is fixed.

Application and database on the same machine

In case the application and database server are running on the same machine, you can improve performance by having the application either connect to localhost using TCP/IP or using a socket. Setting the HOST entry in your database settings will make django-galera try this node first when choosing a secondary node. This can greatly improve performance by reducing network roundtrip time.

DATABASES = {
    'default': {
        # ...
        'HOST': 'localhost',
        # or 'HOST': '/var/run/mysqld/mysqld.sock',
        # ...
    }
}

Per node settings

Database settings like PORT, USER or PASSWORD can be changed per node by simply specifying them in the nodes settings:

'NODES': {
    'db1': {'HOST': '10.0.0.1', 'PORT': 3306, 'USERNAME': 'user1'},
    'db2': {'HOST': '10.0.0.2', 'PORT': 3307, 'USERNAME': 'user2'},
    'db3': {'HOST': '10.0.0.3', 'PORT': 3308, 'USERNAME': 'user3'},
}

Example configuration

This is an annotated example configuration for a 3-node cluster.

DATABASES = {
    'default': {
        'ATOMIC_REQUESTS': True,
        'ENGINE': 'galera.backends.readwritesplit',
        # if HOST is omitted, a random node will be used for secondary (readonly) access
        # HOST can still be set to prioritize a node (useful if app and db are running on the same machine)
        # 'HOST': 'localhost',                      # prefer a local node for faster access times
        # 'HOST': '/var/run/mysqld/mysqld.sock',    # also works with sockets
        'PORT': 3306,
        'NAME': 'db_name',
        'USER': 'db_username',
        'PASSWORD': 'db_password',
        'OPTIONS': {
            'failover_enable': True,  # enable transparent failover with transaction replay
            'failover_history_limit': 1000,  # disable replay for transactions reaching this limit (saves memory)
            'optimistic_transactions': True,  # enable optimistic transaction execution on secondary node
            'wsrep_sync_after_write': True,  # explicitly wait until writes from primary have been applied before reading from secondary
            'wsrep_sync_use_gtid': False,  # use WSREP_SYNC_UPTO_GTID for syncing secondary node (currently not recommended because of MariaDB issue MDEV-26359)
            # options are also attributes of django.db.connection and can be changed on the fly for the current connection
        },
        'NODES': {
            'db1': {'HOST': '10.0.0.1'},  # first node becomes primary and is preferred for read/write transactions
            'db2': {'HOST': '10.0.0.2'},  # following nodes are secondary nodes, used for readonly transactions
            'db3': {'HOST': '10.0.0.3'},
            # 'db4': {'HOST': '10.0.0.3', 'PORT': 3307},    # node settings inherit but can be overwritten
        }
    }
}

MIDDLEWARE = [
    # ...
    # Optional: the middleware repeats a request if a deadlock occurs
    # this middleware handles deadlocks caused by any view and other middleware coming after it
    'galera.middleware.GaleraMiddleware'
    # ...
]

About

Django database backend for MariaDB Galera Cluster

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%