This document shows you how to perform various maintenance tasks related to the Postgres database server used by Matrix.
Table of contents:
-
Getting a database terminal, for when you wish to execute SQL queries
-
Vacuuming PostgreSQL, for when you wish to run a Postgres VACUUM (optimizing disk space)
-
Backing up PostgreSQL, for when you wish to make a backup
-
Upgrading PostgreSQL, for upgrading to new major versions of PostgreSQL. Such manual upgrades are sometimes required.
-
Tuning PostgreSQL to make it run faster
You can use the /matrix/postgres/bin/cli
tool to get interactive terminal access (psql) to the PostgreSQL server.
If you are using an external Postgres server, the above tool will not be available.
By default, this tool puts you in the matrix
database, which contains nothing.
To see the available databases, run \list
(or just \l
).
To change to another database (for example synapse
), run \connect synapse
(or just \c synapse
).
You can then proceed to write queries. Example: SELECT COUNT(*) FROM users;
Be careful. Modifying the database directly (especially as services are running) is dangerous and may lead to irreversible database corruption. When in doubt, consider making a backup.
Deleting lots data from Postgres does not make it release disk space, until you perform a VACUUM
operation.
You can run different VACUUM
operations via the playbook, with the default preset being vacuum-complete
:
- (default)
vacuum-complete
: stops all services temporarily and runsVACUUM FULL VERBOSE ANALYZE
. vacuum-full
: stops all services temporarily and runsVACUUM FULL VERBOSE
vacuum
: runsVACUUM VERBOSE
without stopping any servicesvacuum-analyze
runsVACUUM VERBOSE ANALYZE
without stopping any servicesanalyze
runsANALYZE VERBOSE
without stopping any services (this is just ANALYZE without doing a vacuum, so it's faster)
Note: for the vacuum-complete
and vacuum-full
presets, you'll need plenty of available disk space in your Postgres data directory (usually /matrix/postgres/data
). These presets also stop all services (e.g. Synapse, etc.) while the vacuum operation is running.
Example playbook invocations:
just run-tags run-postgres-vacuum
: runs the defaultvacuum-complete
preset and restarts all servicesjust run-tags run-postgres-vacuum -e postgres_vacuum_preset=analyze
: runs theanalyze
preset with all services remaining operational at all times
To automatically make Postgres database backups on a fixed schedule, see Setting up postgres backup.
To make a one off back up of the current PostgreSQL database, make sure it's running and then execute a command like this on the server:
/usr/bin/docker exec \
--env-file=/matrix/postgres/env-postgres-psql \
matrix-postgres \
/usr/local/bin/pg_dumpall -h matrix-postgres \
| gzip -c \
> /matrix/postgres.sql.gz
If you are using an external Postgres server, the above command will not work, because neither the credentials file (/matrix/postgres/env-postgres-psql
), nor the matrix-postgres
container is available.
Restoring a backup made this way can be done by importing it.
Unless you are using an external Postgres server, this playbook initially installs Postgres for you.
Once installed, the playbook attempts to preserve the Postgres version it starts with. This is because newer Postgres versions cannot start with data generated by older Postgres versions.
Upgrades must be performed manually.
This playbook can upgrade your existing Postgres setup with the following command:
just run-tags upgrade-postgres
The old Postgres data directory is backed up automatically, by renaming it to /matrix/postgres/data-auto-upgrade-backup
. To rename to a different path, pass some extra flags to the command above, like this: --extra-vars="postgres_auto_upgrade_backup_data_path=/another/disk/matrix-postgres-before-upgrade"
The auto-upgrade-backup directory stays around forever, until you manually decide to delete it.
As part of the upgrade, the database is dumped to /tmp
, an upgraded and empty Postgres server is started, and then the dump is restored into the new server. To use a different directory for the dump, pass some extra flags to the command above, like this: --extra-vars="postgres_dump_dir=/directory/to/dump/here"
To save disk space in /tmp
, the dump file is gzipped on the fly at the expense of CPU usage. If you have plenty of space in /tmp
and would rather avoid gzipping, you can explicitly pass a dump filename which doesn't end in .gz
. Example: --extra-vars="postgres_dump_name=matrix-postgres-dump.sql"
All databases, roles, etc. on the Postgres server are migrated.
PostgreSQL can be tuned to make it run faster. This is done by passing extra arguments to the Postgres process.
The Postgres Ansible role already does some tuning by default, which matches the tuning logic done by websites like https://pgtune.leopard.in.ua/. You can manually influence some of the tuning variables. These parameters (variables) are injected via the postgres_postgres_process_extra_arguments_auto
variable.
Most users should be fine with the automatically-done tuning. However, you may wish to:
-
adjust the automatically-determined tuning parameters manually: change the values for the tuning variables defined in the Postgres role's default configuration file (see
postgres_max_connections
,postgres_data_storage
etc). These variables are ultimately passed to Postgres via apostgres_postgres_process_extra_arguments_auto
variable -
turn automatically-performed tuning off: override it like this:
postgres_postgres_process_extra_arguments_auto: []
-
add additional tuning parameters: define your additional Postgres configuration parameters in
postgres_postgres_process_extra_arguments_custom
. Seepostgres_postgres_process_extra_arguments_auto
defined in the Postgres role's default configuration file for inspiration