404
+ +Page not found
+ + +diff --git a/assets/documentation/1.28.0-rc1/404.html b/assets/documentation/1.28.0-rc1/404.html new file mode 100644 index 000000000..7c619bdb8 --- /dev/null +++ b/assets/documentation/1.28.0-rc1/404.html @@ -0,0 +1,351 @@ + + +
+ + + + +Page not found
+ + +Warning
+As of CloudNativePG 1.26, native Barman Cloud support is deprecated in +favor of the Barman Cloud Plugin. This page has been moved to the appendix +for reference purposes. While the native integration remains functional for +now, we strongly recommend beginning a gradual migration to the plugin-based +interface after appropriate testing. For guidance, see +Migrating from Built-in CloudNativePG Backup.
+CloudNativePG natively supports online/hot backup of PostgreSQL +clusters through continuous physical backup and WAL archiving on an object +store. This means that the database is always up (no downtime required) +and that Point In Time Recovery is available.
+The operator can orchestrate a continuous backup infrastructure
+that is based on the Barman Cloud tool. Instead
+of using the classical architecture with a Barman server, which
+backs up many PostgreSQL instances, the operator relies on the
+barman-cloud-wal-archive, barman-cloud-check-wal-archive,
+barman-cloud-backup, barman-cloud-backup-list, and
+barman-cloud-backup-delete tools. As a result, base backups will
+be tarballs. Both base backups and WAL files can be compressed
+and encrypted.
For this, it is required to use an image with barman-cli-cloud included.
+You can use the image ghcr.io/cloudnative-pg/postgresql for this scope,
+as it is composed of a community PostgreSQL image and the latest
+barman-cli-cloud package.
Important
+Always ensure that you are running the latest version of the operands +in your system to take advantage of the improvements introduced in +Barman cloud (as well as improve the security aspects of your cluster).
+Changes in Barman Cloud 3.16+ and Bucket Creation
+Starting with Barman Cloud 3.16, most Barman Cloud commands no longer
+automatically create the target bucket, assuming it already exists. Only the
+barman-cloud-check-wal-archive command creates the bucket now. Whenever this
+is not the first operation run on an empty bucket, CloudNativePG will throw an
+error. As a result, to ensure reliable, future-proof operations and avoid
+potential issues, we strongly recommend that you create and configure your
+object store bucket before creating a Cluster resource that references it.
A backup is performed from a primary or a designated primary instance in a
+Cluster (please refer to
+replica clusters
+for more information about designated primary instances), or alternatively
+on a standby.
If you are looking for a specific object store such as +AWS S3, +Microsoft Azure Blob Storage, +Google Cloud Storage, or a compatible +provider, please refer to Appendix C - Common object stores for backups.
+WAL archiving is the process that feeds a WAL archive +in CloudNativePG.
+The WAL archive is defined in the .spec.backup.barmanObjectStore stanza of
+a Cluster resource.
Info
+Please refer to BarmanObjectStoreConfiguration
+in the barman-cloud API for a full list of options.
If required, you can choose to compress WAL files as soon as they +are uploaded and/or encrypt them:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ [...]
+ wal:
+ compression: gzip
+ encryption: AES256
+
+You can configure the encryption directly in your bucket, and the operator +will use it unless you override it in the cluster configuration.
+PostgreSQL implements a sequential archiving scheme, where the
+archive_command will be executed sequentially for every WAL
+segment to be archived.
Important
+By default, CloudNativePG sets archive_timeout to 5min, ensuring
+that WAL files, even in case of low workloads, are closed and archived
+at least every 5 minutes, providing a deterministic time-based value for
+your Recovery Point Objective (RPO). Even though you change the value
+of the archive_timeout setting in the PostgreSQL configuration,
+our experience suggests that the default value set by the operator is
+suitable for most use cases.
When the bandwidth between the PostgreSQL instance and the object +store allows archiving more than one WAL file in parallel, you +can use the parallel WAL archiving feature of the instance manager +like in the following example:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ [...]
+ wal:
+ compression: gzip
+ maxParallel: 8
+ encryption: AES256
+
+In the previous example, the instance manager optimizes the WAL +archiving process by archiving in parallel at most eight ready +WALs, including the one requested by PostgreSQL.
+When PostgreSQL will request the archiving of a WAL that has +already been archived by the instance manager as an optimization, +that archival request will be just dismissed with a positive status.
+CloudNativePG can manage the automated deletion of backup files from +the backup object store, using retention policies based on the recovery +window.
+Internally, the retention policy feature uses barman-cloud-backup-delete
+with --retention-policy “RECOVERY WINDOW OF {{ retention policy value }} {{ retention policy unit }}”.
For example, you can define your backups with a retention policy of 30 days as +follows:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ destinationPath: "<destination path here>"
+ s3Credentials:
+ accessKeyId:
+ name: aws-creds
+ key: ACCESS_KEY_ID
+ secretAccessKey:
+ name: aws-creds
+ key: ACCESS_SECRET_KEY
+ retentionPolicy: "30d"
+
+There's more ...
+The recovery window retention policy is focused on the concept of
+Point of Recoverability (PoR), a moving point in time determined by
+current time - recovery window. The first valid backup is the first
+available backup before PoR (in reverse chronological order).
+CloudNativePG must ensure that we can recover the cluster at
+any point in time between PoR and the latest successfully archived WAL
+file, starting from the first valid backup. Base backups that are older
+than the first valid backup will be marked as obsolete and permanently
+removed after the next backup is completed.
CloudNativePG by default archives backups and WAL files in an
+uncompressed fashion. However, it also supports the following compression
+algorithms via barman-cloud-backup (for backups) and
+barman-cloud-wal-archive (for WAL files):
The compression settings for backups and WALs are independent. See the +DataBackupConfiguration and +WALBackupConfiguration sections in +the barman-cloud API reference.
+It is important to note that archival time, restore time, and size change +between the algorithms, so the compression algorithm should be chosen according +to your use case.
+The Barman team has performed an evaluation of the performance of the supported +algorithms for Barman Cloud. The following table summarizes a scenario where a +backup is taken on a local MinIO deployment. The Barman GitHub project includes +a deeper analysis.
+| Compression | +Backup Time (ms) | +Restore Time (ms) | +Uncompressed size (MB) | +Compressed size (MB) | +Approx ratio | +
|---|---|---|---|---|---|
| None | +10927 | +7553 | +395 | +395 | +1:1 | +
| bzip2 | +25404 | +13886 | +395 | +67 | +5.9:1 | +
| gzip | +116281 | +3077 | +395 | +91 | +4.3:1 | +
| snappy | +8134 | +8341 | +395 | +166 | +2.4:1 | +
Barman 2.18 introduces support for tagging backup resources when saving them in
+object stores via barman-cloud-backup and barman-cloud-wal-archive. As a
+result, if your PostgreSQL container image includes Barman with version 2.18 or
+higher, CloudNativePG enables you to specify tags as key-value pairs
+for backup objects, namely base backups, WAL files and history files.
You can use two properties in the .spec.backup.barmanObjectStore definition:
tags: key-value pair tags to be added to backup objects and archived WAL
+ file in the backup object storehistoryTags: key-value pair tags to be added to archived history files in
+ the backup object storeThe excerpt of a YAML manifest below provides an example of usage of this +feature:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ [...]
+ tags:
+ backupRetentionPolicy: "expire"
+ historyTags:
+ backupRetentionPolicy: "keep"
+
+You can append additional options to the barman-cloud-backup and barman-cloud-wal-archive commands by using
+the additionalCommandArgs property in the
+.spec.backup.barmanObjectStore.data and .spec.backup.barmanObjectStore.wal sections respectively.
+These properties are lists of strings that will be appended to the
+barman-cloud-backup and barman-cloud-wal-archive commands.
For example, you can use the --read-timeout=60 to customize the connection
+reading timeout.
For additional options supported by barman-cloud-backup and barman-cloud-wal-archive commands you can refer to the
+official barman documentation here.
If an option provided in additionalCommandArgs is already present in the
+declared options in its section (.spec.backup.barmanObjectStore.data or .spec.backup.barmanObjectStore.wal), the extra option will be
+ignored.
The following is an example of how to use this property:
+For backups:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ [...]
+ data:
+ additionalCommandArgs:
+ - "--min-chunk-size=5MB"
+ - "--read-timeout=60"
+
+For WAL files:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ [...]
+ wal:
+ additionalCommandArgs:
+ - "--max-concurrency=1"
+ - "--read-timeout=60"
+
+You can recover from a backup created by Barman Cloud and stored on a supported
+object store. After you define the external cluster, including all the required
+configuration in the barmanObjectStore section, you need to reference it in
+the .spec.recovery.source option.
This example defines a recovery object store in a blob container in Azure:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-restore
+spec:
+ [...]
+
+ superuserSecret:
+ name: superuser-secret
+
+ bootstrap:
+ recovery:
+ source: clusterBackup
+
+ externalClusters:
+ - name: clusterBackup
+ barmanObjectStore:
+ destinationPath: https://STORAGEACCOUNTNAME.blob.core.windows.net/CONTAINERNAME/
+ azureCredentials:
+ storageAccount:
+ name: recovery-object-store-secret
+ key: storage_account_name
+ storageKey:
+ name: recovery-object-store-secret
+ key: storage_account_key
+ wal:
+ maxParallel: 8
+
+The previous example assumes that the application database and its owning user
+are named app by default. If the PostgreSQL cluster being restored uses
+different names, you must specify these names before exiting the recovery phase,
+as documented in "Configure the application database".
Important
+By default, the recovery method strictly uses the name of the
+cluster in the externalClusters section as the name of the main folder
+of the backup data within the object store. This name is normally reserved
+for the name of the server. You can specify a different folder name
+using the barmanObjectStore.serverName property.
Note
+This example takes advantage of the parallel WAL restore feature, +dedicating up to 8 jobs to concurrently fetch the required WAL files from the +archive. This feature can appreciably reduce the recovery time. Make sure that +you plan ahead for this scenario and correctly tune the value of this parameter +for your environment. It will make a difference when you need it, and you will.
+Important
+Please refer to the official Kubernetes documentation for a list of all +the supported Container Storage Interface (CSI) drivers +that provide snapshotting capabilities.
+CloudNativePG is one of the first known cases of database operators that +directly leverages the Kubernetes native Volume Snapshot API for both +backup and recovery operations, in an entirely declarative way.
+Volume snapshotting was first introduced in
+Kubernetes 1.12 (2018) as alpha,
+promoted to beta in 1.17 (2019),
+and moved to GA in 1.20 (2020).
+It’s now stable, widely available, and standard, providing 3 custom resource
+definitions: VolumeSnapshot, VolumeSnapshotContent and
+VolumeSnapshotClass.
This Kubernetes feature defines a generic interface for:
+Kubernetes delegates the actual implementation to the underlying CSI drivers +(not all of them support volume snapshots). Normally, storage classes that +provide volume snapshotting support incremental and differential block level +backup in a transparent way for the application, which can delegate the +complexity and the independent management down the stack, including +cross-cluster availability of the snapshots.
+For Volume Snapshots to work with a CloudNativePG cluster, you need to ensure
+that each storage class used to dynamically provision the PostgreSQL volumes
+(namely, storage and walStorage sections) support volume snapshots.
Given that instructions vary from storage class to storage class, please +refer to the documentation of the specific storage class and related CSI +drivers you have deployed in your Kubernetes system.
+Normally, it is the VolumeSnapshotClass
+that is responsible to ensure that snapshots can be taken from persistent
+volumes of a given storage class, and managed as VolumeSnapshot and
+VolumeSnapshotContent resources.
Important
+It is your responsibility to verify with the third party vendor +that volume snapshots are supported. CloudNativePG only interacts +with the Kubernetes API on this matter, and we cannot support issues +at the storage level for each specific CSI driver.
+CloudNativePG allows you to configure a given Postgres cluster for Volume
+Snapshot backups through the backup.volumeSnapshot stanza.
Info
+Please refer to VolumeSnapshotConfiguration
+in the API reference for a full list of options.
A generic example with volume snapshots (assuming that PGDATA and WALs share +the same storage class) is the following:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: snapshot-cluster
+spec:
+ instances: 3
+
+ storage:
+ storageClass: @STORAGE_CLASS@
+ size: 10Gi
+ walStorage:
+ storageClass: @STORAGE_CLASS@
+ size: 10Gi
+
+ backup:
+ # Volume snapshot backups
+ volumeSnapshot:
+ className: @VOLUME_SNAPSHOT_CLASS_NAME@
+
+ plugins:
+ - name: barman-cloud.cloudnative-pg.io
+ isWALArchiver: true
+ parameters:
+ barmanObjectName: @OBJECTSTORE_NAME@
+
+As you can see, the backup section contains both the volumeSnapshot stanza
+(controlling physical base backups on volume snapshots) and the
+plugins one (controlling the WAL archive).
Info
+Once you have defined the plugin, you can decide to use
+both volume snapshot and plugin backup strategies simultaneously
+to take physical backups.
The volumeSnapshot.className option allows you to reference the default
+VolumeSnapshotClass object used for all the storage volumes you have
+defined in your PostgreSQL cluster.
Info
+In case you are using a different storage class for PGDATA and
+WAL files, you can specify a separate VolumeSnapshotClass for
+that volume through the walClassName option (which defaults to
+the same value as className).
Once a cluster is defined for volume snapshot backups, you need to define
+a ScheduledBackup resource that requests such backups on a periodic basis.
Warning
+As noted in the backup document, a cold snapshot explicitly +set to target the primary will result in the primary being fenced for +the duration of the backup, making the cluster read-only during this +period. For safety, in a cluster already containing fenced instances, a cold +snapshot is rejected.
+By default, CloudNativePG requests an online/hot backup on volume snapshots, using the +PostgreSQL defaults of the low-level API for base backups:
+Important
+The default values are suitable for most production environments. Hot +backups are consistent and can be used to perform snapshot recovery, as we +ensure WAL retention from the start of the backup through a temporary +replication slot. However, our recommendation is to rely on cold backups for +that purpose.
+You can explicitly change the default behavior through the following options in
+the .spec.backup.volumeSnapshot stanza of the Cluster resource:
online: accepting true (default) or false as a valueonlineConfiguration.immediateCheckpoint: whether you want to request an
+ immediate checkpoint before you start the backup procedure or not;
+ technically, it corresponds to the fast argument you pass to the
+ pg_backup_start/pg_start_backup() function in PostgreSQL, accepting
+ true (default) or falseonlineConfiguration.waitForArchive: whether you want to wait for the
+ archiver to process the last segment of the backup or not; technically, it
+ corresponds to the wait_for_archive argument you pass to the
+ pg_backup_stop/pg_stop_backup() function in PostgreSQL, accepting true
+ (default) or falseIf you want to change the default behavior of your Postgres cluster to take
+cold backups by default, all you need to do is add the online: false option
+to your manifest, as follows:
# ...
+ backup:
+ volumeSnapshot:
+ online: false
+ # ...
+
+If you are instead requesting an immediate checkpoint as the default behavior, +you can add this section:
+ # ...
+ backup:
+ volumeSnapshot:
+ online: true
+ onlineConfiguration:
+ immediateCheckpoint: true
+ # ...
+
+You can change the default behavior defined in the cluster resource by setting
+different values for online and, if needed, onlineConfiguration in the Backup or ScheduledBackup objects.
For example, in case you want to issue an on-demand cold backup, you can
+create a Backup object with .spec.online: false:
apiVersion: postgresql.cnpg.io/v1
+kind: Backup
+metadata:
+ name: snapshot-cluster-cold-backup-example
+spec:
+ cluster:
+ name: snapshot-cluster
+ method: volumeSnapshot
+ online: false
+
+Similarly, for the ScheduledBackup:
+apiVersion: postgresql.cnpg.io/v1
+kind: ScheduledBackup
+metadata:
+ name: snapshot-cluster-cold-backup-example
+spec:
+ schedule: "0 0 0 * * *"
+ backupOwnerReference: self
+ cluster:
+ name: snapshot-cluster
+ method: volumeSnapshot
+ online: false
+
+By default, VolumeSnapshot objects created by CloudNativePG are retained after
+deleting the Backup object that originated them, or the Cluster they refer to.
+Such behavior is controlled by the .spec.backup.volumeSnapshot.snapshotOwnerReference
+option which accepts the following values:
none: no ownership is set, meaning that VolumeSnapshot objects persist
+ after the Backup and/or the Cluster resources are removedbackup: the VolumeSnapshot object is owned by the Backup resource that
+ originated it, and when the backup object is removed, the volume snapshot is
+ also removedcluster: the VolumeSnapshot object is owned by the Cluster resource that
+ is backed up, and when the Postgres cluster is removed, the volume snapshot is
+ also removedIn case a VolumeSnapshot is deleted, the deletionPolicy specified in the
+VolumeSnapshotContent is evaluated:
Retain, the VolumeSnapshotContent object is keptDelete, the VolumeSnapshotContent object is removed as wellWarning
+VolumeSnapshotContent objects do not keep all the information regarding the
+backup and the cluster they refer to (like the annotations and labels that
+are contained in the VolumeSnapshot object). Although possible, restoring
+from just this kind of object might not be straightforward. For this reason,
+our recommendation is to always backup the VolumeSnapshot definitions,
+even using a Kubernetes level data protection solution.
The value in VolumeSnapshotContent is determined by the deletionPolicy set
+in the corresponding VolumeSnapshotClass definition, which is
+referenced in the .spec.backup.volumeSnapshot.className option.
Please refer to the Kubernetes documentation on Volume Snapshot Classes +for details on this standard behavior.
+CloudNativePG supports backups using the volume snapshot method. In some +environments, volume snapshots may encounter temporary issues that can be +retried.
+The backup.cnpg.io/volumeSnapshotDeadline annotation defines how long
+CloudNativePG should continue retrying recoverable errors before marking the
+backup as failed.
You can add the backup.cnpg.io/volumeSnapshotDeadline annotation to both
+Backup and ScheduledBackup resources. For ScheduledBackup resources, this
+annotation is automatically inherited by any Backup resources created from
+the schedule.
If not specified, the default retry deadline is 10 minutes.
+When a retryable error occurs during a volume snapshot operation:
+CloudNativePG treats the following types of errors as retryable:
+You can add the annotation to a ScheduledBackup resource as follows:
apiVersion: postgresql.cnpg.io/v1
+kind: ScheduledBackup
+metadata:
+ name: daily-backup-schedule
+ annotations:
+ backup.cnpg.io/volumeSnapshotDeadline: "20"
+spec:
+ schedule: "0 0 * * *"
+ backupOwnerReference: self
+ method: volumeSnapshot
+ # other configuration...
+
+When you define a ScheduledBackup with the annotation, any Backup resources
+created from this schedule automatically inherit the specified timeout value.
In the following example, all backups created from the schedule will have a +30-minute timeout for retrying recoverable snapshot errors.
+apiVersion: postgresql.cnpg.io/v1
+kind: ScheduledBackup
+metadata:
+ name: weekly-backup
+ annotations:
+ backup.cnpg.io/volumeSnapshotDeadline: "30"
+spec:
+ schedule: "0 0 * * 0" # Weekly backup on Sunday
+ method: volumeSnapshot
+ cluster:
+ name: my-postgresql-cluster
+
+Alternatively, you can add the annotation directly to a Backup Resource:
apiVersion: postgresql.cnpg.io/v1
+kind: Backup
+metadata:
+ name: my-backup
+ annotations:
+ backup.cnpg.io/volumeSnapshotDeadline: "15"
+spec:
+ method: volumeSnapshot
+ # other backup configuration...
+
+The following example shows how to configure volume snapshot base backups on an
+EKS cluster on AWS using the ebs-sc storage class and the csi-aws-vsc
+volume snapshot class.
Important
+If you are interested in testing the example, please read +"Volume Snapshots" for the Amazon Elastic Block Store (EBS) CSI driver +for detailed instructions on the installation process for the storage class and the snapshot class.
+The following manifest creates a Cluster that is ready to be used for volume
+snapshots and that stores the WAL archive in a S3 bucket via IAM role for the
+Service Account (IRSA, see AWS S3):
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: hendrix
+spec:
+ instances: 3
+
+ storage:
+ storageClass: ebs-sc
+ size: 10Gi
+ walStorage:
+ storageClass: ebs-sc
+ size: 10Gi
+
+ backup:
+ volumeSnapshot:
+ className: csi-aws-vsc
+
+ plugins:
+ - name: barman-cloud.cloudnative-pg.io
+ isWALArchiver: true
+ parameters:
+ barmanObjectName: @OBJECTSTORE_NAME@
+
+ serviceAccountTemplate:
+ metadata:
+ annotations:
+ eks.amazonaws.com/role-arn: "@ARN@"
+---
+apiVersion: postgresql.cnpg.io/v1
+kind: ScheduledBackup
+metadata:
+ name: hendrix-vs-backup
+spec:
+ cluster:
+ name: hendrix
+ method: volumeSnapshot
+ schedule: '0 0 0 * * *'
+ backupOwnerReference: cluster
+ immediate: true
+
+The last resource defines daily volume snapshot backups at midnight, requesting +one immediately after the cluster is created.
+ +Warning
+As of CloudNativePG 1.26, native Barman Cloud support is deprecated in +favor of the Barman Cloud Plugin. While the native integration remains +functional for now, we strongly recommend beginning a gradual migration to +the plugin-based interface after appropriate testing. The Barman Cloud +Plugin documentation describes +how to use common object stores.
+You can store the backup files in any service that is supported +by the Barman Cloud infrastructure. That is:
+ +You can also use any compatible implementation of the supported services.
+The required setup depends on the chosen storage provider and is +discussed in the following sections.
+AWS Simple Storage Service (S3) is +a very popular object storage service offered by Amazon.
+As far as CloudNativePG backup is concerned, you can define the permissions to +store backups in S3 buckets in two ways:
+ACCESS_KEY_ID and ACCESS_SECRET_KEY credentialsYou will need the following information about your environment:
+ACCESS_KEY_ID: the ID of the access key that will be used
+ to upload files into S3
ACCESS_SECRET_KEY: the secret part of the access key mentioned above
ACCESS_SESSION_TOKEN: the optional session token, in case it is required
The access key used must have permission to upload files into +the bucket. Given that, you must create a Kubernetes secret with the +credentials, and you can do that with the following command:
+kubectl create secret generic aws-creds \
+ --from-literal=ACCESS_KEY_ID=<access key here> \
+ --from-literal=ACCESS_SECRET_KEY=<secret key here>
+# --from-literal=ACCESS_SESSION_TOKEN=<session token here> # if required
+
+The credentials will be stored inside Kubernetes and will be encrypted +if encryption at rest is configured in your installation.
+Once that secret has been created, you can configure your cluster like in +the following example:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ destinationPath: "<destination path here>"
+ s3Credentials:
+ accessKeyId:
+ name: aws-creds
+ key: ACCESS_KEY_ID
+ secretAccessKey:
+ name: aws-creds
+ key: ACCESS_SECRET_KEY
+
+The destination path can be any URL pointing to a folder where
+the instance can upload the WAL files, e.g.
+s3://BUCKET_NAME/path/to/folder.
In order to use IRSA you need to set an annotation in the ServiceAccount of
+the Postgres cluster.
We can configure CloudNativePG to inject them using the serviceAccountTemplate
+stanza:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+[...]
+spec:
+ serviceAccountTemplate:
+ metadata:
+ annotations:
+ eks.amazonaws.com/role-arn: arn:[...]
+ [...]
+
+Barman Cloud writes objects to S3, then does not update them until they are +deleted by the Barman Cloud retention policy. A recommended approach for an S3 +lifecycle policy is to expire the current version of objects a few days longer +than the Barman retention policy, enable object versioning, and expire +non-current versions after a number of days. Such a policy protects against +accidental deletion, and also allows for restricting permissions to the +CloudNativePG workload so that it may delete objects from S3 without granting +permissions to permanently delete objects.
+In case you're using S3-compatible object storage, like MinIO or +Linode Object Storage, you can specify an endpoint instead of using the +default S3 one.
+In this example, it will use the bucket of Linode in the region
+us-east1.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ destinationPath: "s3://bucket/"
+ endpointURL: "https://us-east1.linodeobjects.com"
+ s3Credentials:
+ [...]
+
+In case you're using Digital Ocean Spaces, you will have to use the Path-style syntax.
+In this example, it will use the bucket from Digital Ocean Spaces in the region SFO3.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ destinationPath: "s3://[your-bucket-name]/[your-backup-folder]/"
+ endpointURL: "https://sfo3.digitaloceanspaces.com"
+ s3Credentials:
+ [...]
+
+Suppose you configure an Object Storage provider which uses a certificate
+signed with a private CA, for example when using MinIO via HTTPS. In that case,
+you need to set the option endpointCA inside barmanObjectStore referring
+to a secret containing the CA bundle, so that Barman can verify the certificate
+correctly.
+You can find instructions on creating a secret using your cert files in the
+certificates document.
+Once you have created the secret, you can populate the endpointCA as in the
+following example:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ [...]
+ backup:
+ barmanObjectStore:
+ endpointURL: <myEndpointURL>
+ endpointCA:
+ name: my-ca-secret
+ key: ca.crt
+
+Note
+If you want ConfigMaps and Secrets to be automatically reloaded by instances, you can
+add a label with key cnpg.io/reload to the Secrets/ConfigMaps. Otherwise, you will have to reload
+the instances using the kubectl cnpg reload subcommand.
Azure Blob Storage is the +object storage service provided by Microsoft.
+In order to access your storage account for backup and recovery of +CloudNativePG managed databases, you will need one of the following +combinations of credentials:
+Using Azure AD Workload Identity, you can avoid saving the credentials into a Kubernetes Secret,
+and have a Cluster configuration adding the inheritFromAzureAD as follows:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ destinationPath: "<destination path here>"
+ azureCredentials:
+ inheritFromAzureAD: true
+
+On the other side, using both Storage account access key or Storage account SAS Token, +the credentials need to be stored inside a Kubernetes Secret, adding data entries only when +needed. The following command performs that:
+kubectl create secret generic azure-creds \
+ --from-literal=AZURE_STORAGE_ACCOUNT=<storage account name> \
+ --from-literal=AZURE_STORAGE_KEY=<storage account key> \
+ --from-literal=AZURE_STORAGE_SAS_TOKEN=<SAS token> \
+ --from-literal=AZURE_STORAGE_CONNECTION_STRING=<connection string>
+
+The credentials will be encrypted at rest, if this feature is enabled in the used +Kubernetes cluster.
+Given the previous secret, the provided credentials can be injected inside the cluster +configuration:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ destinationPath: "<destination path here>"
+ azureCredentials:
+ connectionString:
+ name: azure-creds
+ key: AZURE_CONNECTION_STRING
+ storageAccount:
+ name: azure-creds
+ key: AZURE_STORAGE_ACCOUNT
+ storageKey:
+ name: azure-creds
+ key: AZURE_STORAGE_KEY
+ storageSasToken:
+ name: azure-creds
+ key: AZURE_STORAGE_SAS_TOKEN
+
+When using the Azure Blob Storage, the destinationPath fulfills the following
+structure:
<http|https>://<account-name>.<service-name>.core.windows.net/<resource-path>
+
+where <resource-path> is <container>/<blob>. The account name,
+which is also called storage account name, is included in the used host name.
If you are using a different implementation of the Azure Blob Storage APIs,
+the destinationPath will have the following structure:
<http|https>://<local-machine-address>:<port>/<account-name>/<resource-path>
+
+In that case, <account-name> is the first component of the path.
This is required if you are testing the Azure support via the Azure Storage +Emulator or Azurite.
+Currently, the CloudNativePG operator supports two authentication methods for +Google Cloud Storage:
+GOOGLE_APPLICATION_CREDENTIALSWhen running inside Google Kubernetes Engine you can configure your backups to +simply rely on Workload Identity, +without having to set any credentials. In particular, you need to:
+.spec.backup.barmanObjectStore.googleCredentials.gkeEnvironment to trueiam.gke.io/gcp-service-account annotation in the serviceAccountTemplate stanzaPlease use the following example as a reference:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ [...]
+ backup:
+ barmanObjectStore:
+ destinationPath: "gs://<destination path here>"
+ googleCredentials:
+ gkeEnvironment: true
+
+ serviceAccountTemplate:
+ metadata:
+ annotations:
+ iam.gke.io/gcp-service-account: [...].iam.gserviceaccount.com
+ [...]
+
+Following the instruction from Google +you will get a JSON file that contains all the required information to authenticate.
+The content of the JSON file must be provided using a Secret that can be created
+with the following command:
kubectl create secret generic backup-creds --from-file=gcsCredentials=gcs_credentials_file.json
+
+This will create the Secret with the name backup-creds to be used in the yaml file like this:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ backup:
+ barmanObjectStore:
+ destinationPath: "gs://<destination path here>"
+ googleCredentials:
+ applicationCredentials:
+ name: backup-creds
+ key: gcsCredentials
+
+Now the operator will use the credentials to authenticate against Google Cloud Storage.
+Important
+This way of authentication will create a JSON file inside the container with all the needed +information to access your Google Cloud Storage bucket, meaning that if someone gets access to the pod +will also have write permissions to the bucket.
+Applications are supposed to work with the services created by CloudNativePG +in the same Kubernetes cluster.
+For more information on services and how to manage them, please refer to the +"Service management" section.
+Hint
+It is highly recommended using those services in your applications, +and avoiding connecting directly to a specific PostgreSQL instance, as the latter +can change during the cluster lifetime.
+You can use these services in your applications through:
+For the credentials to connect to PostgreSQL, you can +use the secrets generated by the operator.
+Connection Pooling
+Please refer to the "Connection Pooling" section for +information about how to take advantage of PgBouncer as a connection pooler, +and create an access layer between your applications and the PostgreSQL clusters.
+You can use the Kubernetes DNS service to point to a given server.
+The Kubernetes DNS service is required by the operator.
+You can do that by using the name of the service if the application is
+deployed in the same namespace as the PostgreSQL cluster.
+In case the PostgreSQL cluster resides in a different namespace, you can use the
+full qualifier: service-name.namespace-name.
DNS is the preferred and recommended discovery method.
+If you deploy your application in the same namespace that contains the +PostgreSQL cluster, you can also use environment variables to connect to the database.
+For example, suppose that your PostgreSQL cluster is called pg-database,
+you can use the following environment variables in your applications:
PG_DATABASE_R_SERVICE_HOST: the IP address of the service
+ pointing to all the PostgreSQL instances for read-only workloads
PG_DATABASE_RO_SERVICE_HOST: the IP address of the
+ service pointing to all hot-standby replicas of the cluster
PG_DATABASE_RW_SERVICE_HOST: the IP address of the
+ service pointing to the primary instance of the cluster
The PostgreSQL operator will generate up to two basic-auth type secrets for
+every PostgreSQL cluster it deploys:
[cluster name]-app (unless you have provided an existing secret through .spec.bootstrap.initdb.secret.name)[cluster name]-superuser (if .spec.enableSuperuserAccess is set to true
+ and you have not specified a different secret using .spec.superuserSecret)Each secret contain the following:
+.pgpass fileThe FQDN to be used in the URIs is calculated using the Kubernetes cluster
+domain specified in the KUBERNETES_CLUSTER_DOMAIN configuration parameter.
+See the operator configuration documentation for more information
+about that.
The -app credentials are the ones that should be used by applications
+connecting to the PostgreSQL cluster, and correspond to the user owning the
+database.
The -superuser ones are supposed to be used only for administrative purposes,
+and correspond to the postgres user.
Important
+Superuser access over the network is disabled by default.
+Hint
+For a deeper understanding, we recommend reading our article on the CNCF +blog post titled "Recommended Architectures for PostgreSQL in Kubernetes", +which provides valuable insights into best practices and design +considerations for PostgreSQL deployments in Kubernetes.
+This documentation page provides an overview of the key architectural +considerations for implementing a robust business continuity strategy when +deploying PostgreSQL in Kubernetes. These considerations include:
+postgres worker nodes: Isolating PostgreSQL workloads by
+ dedicating specific worker nodes to postgres tasks, ensuring optimal
+ performance and minimizing interference from other workloads.PostgreSQL is a database management system and, as such, it needs to be treated +as a stateful workload in Kubernetes. While stateless applications +mainly use traffic redirection to achieve High Availability (HA) and +Disaster Recovery (DR), in the case of a database, state must be replicated in +multiple locations, preferably in a continuous and instantaneous way, by +adopting either of the following two strategies:
+CloudNativePG relies on application-level replication, for a simple reason: the +PostgreSQL database management system comes with robust and reliable +built-in physical replication capabilities based on Write Ahead Log (WAL) +shipping, which have been used in production by millions of users all over +the world for over a decade.
+PostgreSQL supports both asynchronous and synchronous streaming replication +over the network, as well as asynchronous file-based log shipping (normally +used as a fallback option, for example, to store WAL files in an object store). +Replicas are usually called standby servers and can also be used for +read-only workloads, thanks to the Hot Standby feature.
+Important
+We recommend against storage-level replication with PostgreSQL, although +CloudNativePG allows you to adopt that strategy. For more information, please refer +to the talk given by Chris Milsted and Gabriele Bartolini at KubeCon NA 2022 entitled +"Data On Kubernetes, Deploying And Running PostgreSQL And Patterns For Databases In a Kubernetes Cluster" +where this topic was covered in detail.
+Kubernetes natively provides the possibility to span separate physical +locations - also known as data centers, failure zones, or more frequently +availability zones - connected to each other via redundant, low-latency, +private network connectivity.
+Being a distributed system, the recommended minimum number of availability +zones for a Kubernetes cluster is three (3), in order to make the control +plane resilient to the failure of a single zone. +For details, please refer to +"Running in multiple zones". +This means that each data center is active at any time and can run workloads +simultaneously.
+Note
+Most of the public Cloud Providers' managed Kubernetes services already +provide 3 or more availability zones in each region.
+The multi-availability zone Kubernetes architecture with three (3) or more +zones is the one that we recommend for PostgreSQL usage. +This scenario is typical of Kubernetes services managed by Cloud Providers.
+
Such an architecture enables the CloudNativePG operator to control the full
+lifecycle of a Cluster resource across the zones within a single Kubernetes
+cluster, by treating all the availability zones as active: this includes, among
+other operations,
+scheduling the workloads in a declarative manner (based on
+affinity rules, tolerations and node selectors), automated failover,
+self-healing, and updates. All will work seamlessly across the zones in a single
+Kubernetes cluster.
Please refer to the "PostgreSQL architecture" +section below for details on how you can design your PostgreSQL clusters within +the same Kubernetes cluster through shared-nothing deployments at the storage, +worker node, and availability zone levels.
+Additionally, you can leverage Kubernetes clusters +to deploy distributed PostgreSQL topologies hosting "passive" +PostgreSQL replica clusters in different regions and +managing them via declarative configuration. This setup is ideal for disaster +recovery (DR), read-only operations, or cross-region availability.
+Important
+Each operator deployment can only manage operations within its local +Kubernetes cluster. For operations across Kubernetes clusters, such as +controlled switchover or unexpected failover, coordination must be handled +manually (through GitOps, for example) or by using a higher-level cluster +management tool.
+
If your Kubernetes cluster has only one availability zone, CloudNativePG still +provides you with a lot of features to improve HA and DR outcomes for your +PostgreSQL databases, pushing the single point of failure (SPoF) to the level +of the zone as much as possible - i.e. the zone must have an outage before your +CloudNativePG clusters suffer a failure.
+This scenario is typical of self-managed on-premise Kubernetes clusters, where +only one data center is available.
+Single availability zone Kubernetes clusters are the only viable option when +only two data centers are available within reach of a low-latency +connection (typically in the same metropolitan area). Having only two zones +prevents the creation of a multi-availability zone Kubernetes cluster, which +requires a minimum of three zones. As a result, users must create two separate +Kubernetes clusters in an active/passive configuration, with the second cluster +primarily used for Disaster Recovery (see +the replica cluster feature).
+
Hint
+If you are at an early stage of your Kubernetes journey, please share this +document with your infrastructure team. The two data centers setup might +be simply the result of a "lift-and-shift" transition to Kubernetes +from a traditional bare-metal or VM based infrastructure, and the benefits +that Kubernetes offers in a 3+ zone scenario might not have been known, +or addressed at the time the infrastructure architecture was designed. +Ultimately, a third physical location connected to the other two might +represent a valid option to consider for organization, as it reduces the +overall costs of the infrastructure by moving the day-to-day complexity +from the application level down to the physical infrastructure level.
+Please refer to the "PostgreSQL architecture" +section below for details on how you can design your PostgreSQL clusters within +your single availability zone Kubernetes cluster through shared-nothing +deployments at the storage and worker node levels only. For HA, in such a +scenario it becomes even more important that the PostgreSQL instances be +located on different worker nodes and do not share the same storage.
+For DR, you can push the SPoF above the single zone, by using additional +Kubernetes clusters to define a +distributed topology hosting "passive" PostgreSQL replica clusters. +As with other Kubernetes workloads in this scenario, promotion of a Kubernetes +cluster as primary must be done manually.
+Through the replica cluster feature, you can define a +distributed PostgreSQL topology and coordinate a controlled switchover between +data centers by first demoting the primary cluster and then promoting the +replica cluster, without the need to re-clone the former primary. While failover +is now fully declarative, automated failover across Kubernetes clusters is not +within CloudNativePG's scope, as the operator can only function within a single +Kubernetes cluster.
+Important
+CloudNativePG provides all the necessary primitives and probes to +coordinate PostgreSQL active/passive topologies across different Kubernetes +clusters through a higher-level operator or management tool.
+Whether you're operating in a multi-availability zone environment or, more
+critically, within a single availability zone, we strongly recommend isolating
+PostgreSQL workloads by dedicating specific worker nodes exclusively to
+postgres in production. A Kubernetes worker node dedicated to running
+PostgreSQL workloads is referred to as a Postgres node or postgres node.
+This approach ensures optimal performance and resource allocation for your
+database operations.
Hint
+As a general rule of thumb, deploy Postgres nodes in multiples of +three—ideally with one node per availability zone. Three nodes is +an optimal number because it ensures that a PostgreSQL cluster with three +instances (one primary and two standby replicas) is distributed across +different nodes, enhancing fault tolerance and availability.
+In Kubernetes, this can be achieved using node labels and taints in a
+declarative manner, aligning with Infrastructure as Code (IaC) practices:
+labels ensure that a node is capable of running postgres workloads, while
+taints help prevent any non-postgres workloads from being scheduled on that
+node.
Important
+This methodology is the most straightforward way to ensure that PostgreSQL
+workloads are isolated from other workloads in terms of both computing
+resources and, when using locally attached disks, storage. While different
+PostgreSQL clusters may share the same node, you can take this a step further
+by using labels and taints to ensure that a node is dedicated to a single
+instance of a specific Cluster.
CloudNativePG recommends using the node-role.kubernetes.io/postgres label.
+Since this is a reserved label (*.kubernetes.io), it can only be applied
+after a worker node is created.
To assign the postgres label to a node, use the following command:
kubectl label node <NODE-NAME> node-role.kubernetes.io/postgres=
+
+To ensure that a Cluster resource is scheduled on a postgres node, you must
+correctly configure the .spec.affinity.nodeSelector stanza in your manifests.
+Here’s an example:
spec:
+ # <snip>
+ affinity:
+ # <snip>
+ nodeSelector:
+ node-role.kubernetes.io/postgres: ""
+
+CloudNativePG recommends using the node-role.kubernetes.io/postgres taint.
To assign the postgres taint to a node, use the following command:
kubectl taint node <NODE-NAME> node-role.kubernetes.io/postgres=:NoSchedule
+
+To ensure that a Cluster resource is scheduled on a node with a postgres taint, you must correctly configure the .spec.affinity.tolerations stanza in your manifests.
+Here’s an example:
spec:
+ # <snip>
+ affinity:
+ # <snip>
+ tolerations:
+ - key: node-role.kubernetes.io/postgres
+ operator: Exists
+ effect: NoSchedule
+
+CloudNativePG supports clusters based on asynchronous and synchronous +streaming replication to manage multiple hot standby replicas within the same +Kubernetes cluster, with the following specifications:
+Available services for applications:
+-rw: applications connect only to the primary instance of the cluster-ro: applications connect only to hot standby replicas for
+ read-only-workloads (optional)-r: applications connect to any of the instances for read-only
+ workloads (optional)Shared-nothing architecture recommended for better resilience of the PostgreSQL cluster:
+Important
+You can configure the above services through the managed.services section
+in the Cluster configuration. This can be done by reducing the number of
+services and selecting the type (default is ClusterIP). For more details,
+please refer to the "Service Management" section
+below.
The below diagram provides a simplistic view of the recommended shared-nothing +architecture for a PostgreSQL cluster spanning across 3 different availability +zones, running on separate nodes, each with dedicated local storage for +PostgreSQL data.
+
CloudNativePG automatically takes care of updating the above services if
+the topology of the cluster changes. For example, in case of failover, it
+automatically updates the -rw service to point to the promoted primary,
+making sure that traffic from the applications is seamlessly redirected.
Replication
+Please refer to the "Replication" section for more +information about how CloudNativePG relies on PostgreSQL replication, +including synchronous settings.
+Connecting from an application
+Please refer to the "Connecting from an application" section for +information about how to connect to CloudNativePG from a stateless +application within the same Kubernetes cluster.
+Connection Pooling
+Please refer to the "Connection Pooling" section for +information about how to take advantage of PgBouncer as a connection pooler, +and create an access layer between your applications and the PostgreSQL clusters.
+Applications can decide to connect to the PostgreSQL instance elected as +current primary by the Kubernetes operator, as depicted in the following +diagram:
+
Applications can use the -rw suffix service.
In case of temporary or permanent unavailability of the primary, for High
+Availability purposes CloudNativePG will trigger a failover, pointing the -rw
+service to another instance of the cluster.
Important
+Applications must be aware of the limitations that +Hot Standby +presents and familiar with the way PostgreSQL operates when dealing with +these workloads.
+Applications can access hot standby replicas through the -ro service made available
+by the operator. This service enables the application to offload read-only queries from the
+primary node.
The following diagram shows the architecture:
+
Applications can also access any PostgreSQL instance through the
+-r service.
Info
+CloudNativePG supports deploying PostgreSQL across multiple Kubernetes +clusters through a feature that allows you to define a distributed PostgreSQL +topology using replica clusters, as described in this section.
+In a distributed PostgreSQL cluster there can only be a single PostgreSQL +instance acting as a primary at all times. This means that applications can +only write inside a single Kubernetes cluster, at any time.
+However, for business continuity objectives it is fundamental to:
+In order to address the above concerns, CloudNativePG introduces the concept of +a PostgreSQL Topology that is distributed across different Kubernetes clusters +and is made up of a primary PostgreSQL cluster and one or more PostgreSQL +replica clusters. +This feature is called distributed PostgreSQL topology with replica clusters, +and it enables multi-cluster deployments in private, public, hybrid, and +multi-cloud contexts.
+A replica cluster is a separate Cluster resource that is in continuous
+recovery, replicating from another source, either via WAL shipping from a WAL
+archive or via streaming replication from a primary or a standby (cascading).
The diagram below depicts a PostgreSQL cluster spanning over two different +Kubernetes clusters, where the primary cluster is in the first Kubernetes +cluster and the replica cluster is in the second. The second Kubernetes cluster +acts as the company's disaster recovery cluster, ready to be activated in case +of disaster and unavailability of the first one.
+
A replica cluster can have the same architecture as the primary cluster. +Instead of a primary instance, a replica cluster has a designated primary +instance, which is a standby server with an arbitrary number of cascading +standby servers in streaming replication (symmetric architecture).
+The designated primary can be promoted at any time, transforming the replica +cluster into a primary cluster capable of accepting write connections. +This is typically triggered by:
+Warning
+CloudNativePG cannot perform any cross-cluster automated failover, as it +does not have authority beyond a single Kubernetes cluster. Such operations +must be performed manually or delegated to a multi-cluster/federated +cluster-aware authority.
+Important
+CloudNativePG allows you to control the distributed topology via +declarative configuration, enabling you to automate these procedures as part of +your Infrastructure as Code (IaC) process, including GitOps.
+In the example above, the designated primary receives WAL updates via streaming
+replication (primary_conninfo). As a fallback, it can retrieve WAL segments
+from an object store using file-based WAL shipping—for instance, with the
+Barman Cloud plugin through restore_command and barman-cloud-wal-restore.
CloudNativePG allows you to define topologies with multiple replica clusters. +You can also define replica clusters with a lower number of replicas, and then +increase this number when the cluster is promoted to primary.
+Replica clusters
+Please refer to the "Replica Clusters" section for +more detailed information on how physical replica clusters operate and how to +define a distributed topology with read-only clusters across different +Kubernetes clusters. This approach can significantly enhance your global +disaster recovery and high availability (HA) strategy.
+Info
+This section covers physical backups in PostgreSQL.
+While PostgreSQL also supports logical backups using the pg_dump utility,
+these are not suitable for business continuity and are not managed by
+CloudNativePG. If you still wish to use pg_dump, refer to the
+Troubleshooting / Emergency backup section
+for guidance.
Important
+Starting with version 1.26, native backup and recovery capabilities are +being progressively phased out of the core operator and moved to official +CNPG-I plugins. This transition aligns with CloudNativePG's shift towards a +backup-agnostic architecture, enabled by its extensible +interface—CNPG-I—which standardizes the management of WAL archiving, +physical base backups, and corresponding recovery processes.
+CloudNativePG currently supports physical backups of PostgreSQL clusters in +two main ways:
+Via CNPG-I plugins: the + CloudNativePG Community officially supports the Barman Cloud Plugin + for integration with object storage services.
+Natively, with support for:
+Before selecting a backup strategy with CloudNativePG, it's important to +familiarize yourself with the foundational concepts covered in the "Main Concepts" +section. These include WAL archiving, hot and cold backups, performing backups +from a standby, and more.
+PostgreSQL natively provides first class backup and recovery capabilities based +on file system level (physical) copy. These have been successfully used for +more than 15 years in mission critical production databases, helping +organizations all over the world achieve their disaster recovery goals with +Postgres.
+In CloudNativePG, the backup infrastructure for each PostgreSQL cluster is made +up of the following resources:
+PGDATA and any tablespace)CNPG-I provides a generic and extensible interface for managing WAL archiving +(both archive and restore operations), as well as the base backup and +corresponding restore processes.
+The WAL archive in PostgreSQL is at the heart of continuous backup, and it +is fundamental for the following reasons:
+Warning
+WAL archive alone is useless. Without a physical base backup, you cannot +restore a PostgreSQL cluster.
+In general, the presence of a WAL archive enhances the resilience of a +PostgreSQL cluster, allowing each instance to fetch any required WAL file from +the archive if needed (normally the WAL archive has higher retention periods +than any Postgres instance that normally recycles those files).
+This use case can also be extended to replica clusters, +as they can simply rely on the WAL archive to synchronize across long +distances, extending disaster recovery goals across different regions.
+When you configure a WAL archive, CloudNativePG provides +out-of-the-box an RPO <= 5 minutes for disaster +recovery, even across regions.
+Important
+Our recommendation is to always setup the WAL archive in production. +There are known use cases — normally involving staging and development +environments — where none of the above benefits are needed and the WAL +archive is not necessary. RPO in this case can be any value, such as +24 hours (daily backups) or infinite (no backup at all).
+Hot backups have already been defined in the previous section. They require the +presence of a WAL archive, and they are the norm in any modern database +management system.
+Cold backups, also known as offline backups, are instead physical base backups +taken when the PostgreSQL instance (standby or primary) is shut down. They are +consistent per definition, and they represent a snapshot of the database at the +time it was shut down.
+As a result, PostgreSQL instances can be restarted from a cold backup without +the need of a WAL archive, even though they can take advantage of it, if +available (with all the benefits on the recovery side highlighted in the +previous section).
+In those situations with a higher RPO (for example, 1 hour or 24 hours), and +shorter retention periods, cold backups represent a viable option to be considered +for your disaster recovery plans.
+CloudNativePG currently supports two main approaches for physical backups:
+Important
+CNPG-I is designed to enable third parties to build and integrate their own +backup plugins. Over time, we expect the ecosystem of supported backup +solutions to grow.
+Backups to an object store (e.g. AWS S3, Azure Blob, GCS):
+Native volume snapshots:
+The best approach depends on your environment and operational requirements. +Consider the following factors:
+| Feature | +Object Store | +Volume Snapshots | +
|---|---|---|
| WAL archiving | +Required | +Recommended1 | +
| Cold backup | +❌ | +✅ | +
| Hot backup | +✅ | +✅ | +
| Incremental copy | +❌ | +✅2 | +
| Differential copy | +❌ | +✅2 | +
| Backup from a standby | +✅ | +✅ | +
| Snapshot recovery | +❌3 | +✅ | +
| Retention policies | +✅ | +❌ | +
| Point-in-Time Recovery (PITR) | +✅ | +Requires WAL archive | +
| Underlying technology | +Barman Cloud | +Kubernetes API | +
++Notes:
++
+- WAL archiving must currently use an object store through a plugin (or the + deprecated native one).
+- Availability of incremental and differential copies depends on the + capabilities of the storage class used for PostgreSQL volumes.
+- Snapshot recovery can be emulated by using the +
+bootstrap.recovery.recoveryTarget.targetImmediateoption.
Scheduled backups are the recommended way to implement a reliable backup
+strategy in CloudNativePG. They are defined using the ScheduledBackup custom
+resource.
Info
+For a complete list of configuration options, refer to the
+ScheduledBackupSpec
+in the API reference.
The schedule field defines when the backup should occur, using a
+six-field cron expression that includes seconds. This format follows the
+Go cron package specification.
Warning
+This format differs from the traditional Unix/Linux crontab—it includes a
+seconds field as the first entry.
Example of a daily scheduled backup:
+apiVersion: postgresql.cnpg.io/v1
+kind: ScheduledBackup
+metadata:
+ name: backup-example
+spec:
+ schedule: "0 0 0 * * *" # At midnight every day
+ backupOwnerReference: self
+ cluster:
+ name: pg-backup
+ # method: plugin, volumeSnapshot, or barmanObjectStore (default)
+
+The schedule "0 0 0 * * *" triggers a backup every day at midnight
+(00:00:00). In Kubernetes CronJobs, the equivalent expression would be 0 0 * * *,
+since seconds are not supported.
Hint
+The frequency of your backups directly impacts your Recovery Time Objective +(RTO).
+To optimize your disaster recovery strategy based on continuous backup:
+In most cases, a weekly base backup is sufficient. It is rare to schedule +full backups more frequently than once per day.
+To trigger a backup immediately when the ScheduledBackup is created:
spec:
+ immediate: true
+
+To temporarily stop scheduled backups from running:
+spec:
+ suspend: true
+
+.spec.backupOwnerReference)Controls which Kubernetes object is set as the owner of the backup resource:
+none: No owner reference (legacy behavior)self: The ScheduledBackup object becomes the ownercluster: The PostgreSQL cluster becomes the ownerOn-demand backups allow you to manually trigger a backup operation at any time
+by creating a Backup resource.
Info
+For a full list of available options, see the
+BackupSpec in the
+API reference.
To start an on-demand backup, apply a Backup request custom resource like the
+following:
apiVersion: postgresql.cnpg.io/v1
+kind: Backup
+metadata:
+ name: backup-example
+spec:
+ method: barmanObjectStore
+ cluster:
+ name: pg-backup
+
+In this example, the operator will orchestrate the backup process using the
+barman-cloud-backup tool and store the backup in the configured object store.
You can check the status of the backup using:
+kubectl describe backup backup-example
+
+While the backup is in progress, you'll see output similar to:
+Name: backup-example
+Namespace: default
+...
+Spec:
+ Cluster:
+ Name: pg-backup
+Status:
+ Phase: running
+ Started At: 2020-10-26T13:57:40Z
+Events: <none>
+
+Once the backup has successfully completed, the phase will be set to
+completed, and the output will include additional metadata:
Name: backup-example
+Namespace: default
+...
+Status:
+ Backup Id: 20201026T135740
+ Destination Path: s3://backups/
+ Endpoint URL: http://minio:9000
+ Phase: completed
+ S3 Credentials:
+ Access Key Id:
+ Name: minio
+ Key: ACCESS_KEY_ID
+ Secret Access Key:
+ Name: minio
+ Key: ACCESS_SECRET_KEY
+ Server Name: pg-backup
+ Started At: 2020-10-26T13:57:40Z
+ Stopped At: 2020-10-26T13:57:44Z
+
+Important
+On-demand backups do not include Kubernetes secrets for the PostgreSQL +superuser or application user. You should ensure these secrets are included in +your broader Kubernetes cluster backup strategy.
+CloudNativePG currently supports the following backup methods for scheduled +and on-demand backups:
+plugin – Uses a CNPG-I plugin (requires .spec.pluginConfiguration)volumeSnapshot – Uses native Kubernetes volume snapshotsbarmanObjectStore – Uses Barman Cloud for object storage
+ (deprecated starting with v1.26 in favor of the
+ Barman Cloud Plugin,
+ but still the default for backward compatibility)Specify the method using the .spec.method field (defaults to
+barmanObjectStore).
If your cluster is configured to support volume snapshots, you can enable +scheduled snapshot backups like this:
+spec:
+ method: volumeSnapshot
+
+To use the Barman Cloud Plugin as the backup method, set method: plugin and
+configure the plugin accordingly. You can find an example in the
+"Performing a Base Backup" section of the plugin documentation
Taking a base backup involves reading the entire on-disk data set of a +PostgreSQL instance, which can introduce I/O contention and impact the +performance of the active workload.
+To reduce this impact, CloudNativePG supports taking backups from a standby +instance, leveraging PostgreSQL’s built-in capability to perform backups from +read-only replicas.
+By default, backups are performed on the most up-to-date replica in the +cluster. If no replicas are available, the backup will fall back to the +primary instance.
+Note
+The examples in this section are focused on backup target selection and do not
+take the backup method (spec.method) into account, as it is not relevant to
+the scope being discussed.
When prefer-standby is the target (the default behavior), CloudNativePG will
+attempt to:
This strategy minimizes interference with the primary’s workload.
+Warning
+Although the standby might not always be up to date with the primary,
+in the time continuum from the first available backup to the last
+archived WAL this is normally irrelevant. The base backup indeed
+represents the starting point from which to begin a recovery operation,
+including PITR. Similarly to what happens with
+pg_basebackup,
+when backing up from an online standby we do not force a switch of the WAL on the
+primary. This might produce unexpected results in the short term (before
+archive_timeout kicks in) in deployments with low write activity.
To always run backups on the primary instance, explicitly set the backup target
+to primary in the cluster configuration:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ [...]
+spec:
+ backup:
+ target: "primary"
+
+Warning
+Be cautious when using primary as the target for cold backups using
+volume snapshots, as this will require shutting down the primary instance
+temporarily—interrupting all write operations. The same caution applies to
+single-instance clusters, even if you haven't explicitly set the target.
You can override the cluster-level target on a per-backup basis, using either
+Backup or ScheduledBackup resources. Here's an example of an on-demand
+backup:
apiVersion: postgresql.cnpg.io/v1
+kind: Backup
+metadata:
+ [...]
+spec:
+ cluster:
+ name: [...]
+ target: "primary"
+
+In this example, even if the cluster’s default target is prefer-standby, the
+backup will be taken from the primary instance.
CloudNativePG is evolving toward a backup-agnostic architecture, where +backup responsibilities are delegated to external CNPG-I plugins. These +plugins are expected to offer advanced and customizable data protection +features, including sophisticated retention management, that go beyond the +built-in capabilities and scope of CloudNativePG.
+As part of this transition, the spec.backup.retentionPolicy field in the
+Cluster resource is deprecated and will be removed in a future release.
For more details on available retention features, refer to your chosen plugin’s documentation. +For example: "Retention Policies" with Barman Cloud Plugin.
+Important
+Users are encouraged to rely on the retention mechanisms provided by the +backup plugin they are using. This ensures better flexibility and consistency +with the backup method in use.
+Before we get started, it is essential to go over some terminology that is +specific to Kubernetes and PostgreSQL.
+node-role.kubernetes.io label and
+ taint, as proposed by CloudNativePG.
+ It is also referred to as a postgres node.kubectlkubectl is the command-line tool used to manage a Kubernetes cluster.CloudNativePG requires a Kubernetes version supported by the community. Please refer to the +"Supported releases" page for details.
+Cluster that is in continuous recovery mode from a selected
+ PostgreSQL cluster, normally residing outside the Kubernetes cluster. It is a
+ feature that enables multi-cluster deployments in private, public, hybrid, and
+ multi-cloud contexts.LOGIN and SUPERUSER
+ privileges. For security reasons, CloudNativePG performs administrative tasks
+ by connecting to the postgres database as the postgres user via peer
+ authentication over the local Unix Domain Socket.storage) and the volume for WALs (walStorage).Now that you have familiarized with the terminology, you can decide to +test CloudNativePG on your laptop using a local cluster before +deploying the operator in your selected cloud environment.
+ +The CNPG kubectl plugin provides an easy way for benchmarking a PostgreSQL deployment in Kubernetes using CloudNativePG.
+Benchmarking is focused on two aspects:
+ +Important
+pgbench and fio must be run in a staging or pre-production environment.
+Do not use these plugins in a production environment, as it might have
+catastrophic consequences on your databases and the other
+workloads/applications that run in the same shared environment.
The kubectl CNPG plugin command pgbench executes a user-defined pgbench job
+against an existing Postgres Cluster.
Through the --dry-run flag you can generate the manifest of the job for later
+modification/execution.
A common command structure with pgbench is the following:
kubectl cnpg pgbench \
+ -n <namespace> <cluster-name> \
+ --job-name <pgbench-job> \
+ --db-name <db-name> \
+ -- <pgbench options>
+
+Important
+Please refer to the pgbench documentation
+for information about the specific options to be used in your jobs.
This example creates a job called pgbench-init that initializes for pgbench
+OLTP-like purposes the app database in a Cluster named cluster-example,
+using a scale factor of 1000:
kubectl cnpg pgbench \
+ --job-name pgbench-init \
+ cluster-example \
+ -- --initialize --scale 1000
+
+Note
+This will generate a database with 100000000 records, taking approximately 13GB +of space on disk.
+You can see the progress of the job with:
+kubectl logs jobs/pgbench-run
+
+The following example creates a job called pgbench-run executing pgbench
+against the previously initialized database for 30 seconds, using a single
+connection:
kubectl cnpg pgbench \
+ --job-name pgbench-run \
+ cluster-example \
+ -- --time 30 --client 1 --jobs 1
+
+The next example runs pgbench against an existing database by using the
+--db-name flag and the pgbench namespace:
kubectl cnpg pgbench \
+ --db-name pgbench \
+ --job-name pgbench-job \
+ cluster-example \
+ -- --time 30 --client 1 --jobs 1
+
+By default, jobs do not expire. You can enable automatic deletion with the
+--ttl flag. The job will be deleted after the specified duration (in seconds).
kubectl cnpg pgbench \
+ --job-name pgbench-run \
+ --ttl 600 \
+ cluster-example \
+ -- --time 30 --client 1 --jobs 1
+
+If you want to run a pgbench job on a specific worker node, you can use
+the --node-selector option. Suppose you want to run the previous
+initialization job on a node having the workload=pgbench label, you can run:
kubectl cnpg pgbench \
+ --db-name pgbench \
+ --job-name pgbench-init \
+ --node-selector workload=pgbench \
+ cluster-example \
+ -- --initialize --scale 1000
+
+The job status can be fetched by running:
+kubectl get job/pgbench-job -n <namespace>
+
+NAME COMPLETIONS DURATION AGE
+job-name 1/1 15s 41s
+
+Once the job is completed the results can be gathered by executing:
+kubectl logs job/pgbench-job -n <namespace>
+
+The kubectl CNPG plugin command fio executes a fio job with default values
+and read operations.
+Through the --dry-run flag you can generate the manifest of the job for later
+modification/execution.
Note
+The kubectl plugin command fio will create a deployment with predefined
+fio job values using a ConfigMap. If you want to provide custom job values, we
+recommend generating a manifest using the --dry-run flag and providing your
+custom job values in the generated ConfigMap.
Example of default usage:
+kubectl cnpg fio <fio-name>
+
+Example with custom values:
+kubectl cnpg fio <fio-name> \
+ -n <namespace> \
+ --storageClass <name> \
+ --pvcSize <size>
+
+Example of how to run the fio command against a StorageClass named
+standard and pvcSize: 2Gi in the fio namespace:
kubectl cnpg fio fio-job \
+ -n fio \
+ --storageClass standard \
+ --pvcSize 2Gi
+
+The deployment status can be fetched by running:
+kubectl get deployment/fio-job -n fio
+
+NAME READY UP-TO-DATE AVAILABLE AGE
+fio-job 1/1 1 1 14s
+
+
+After running kubectl plugin command fio.
It will:
+fio-tools image for that.The Pod created by the deployment will be ready when it starts serving the +results. You can forward the port of the pod created by the deployment
+kubectl port-forward -n <namespace> deployment/<fio-name> 8000
+
+and then use a browser and connect to http://localhost:8000/ to get the data.
The default 8k block size has been chosen to emulate a PostgreSQL workload. +Disks that cap the amount of available IOPS can show very different throughput +values when changing this parameter.
+Below is an example diagram of sequential writes on a local disk +mounted on a dedicated Kubernetes node +(1 hour benchmark):
+
After all testing is done, fio deployment and resources can be deleted by:
+kubectl cnpg fio <fio-job-name> --dry-run | kubectl delete -f -
+
+make sure use the same name which was used to create the fio deployment and add namespace if applicable.
+ +This section describes the options available to create a new +PostgreSQL cluster and the design rationale behind them. +There are primarily two ways to bootstrap a new cluster:
+initdb)pg_basebackup)
+ or indirectly through a physical base backup (recovery)The initdb bootstrap also provides the option to import one or more
+databases from an existing PostgreSQL cluster, even if it's outside
+Kubernetes or running a different major version of PostgreSQL.
+For more detailed information about this feature, please refer to the
+"Importing Postgres databases" section.
Important
+Bootstrapping from an existing cluster enables the creation of a +replica cluster—an independent PostgreSQL cluster that remains in +continuous recovery, stays synchronized with the source cluster, and +accepts read-only connections. +For more details, refer to the Replica Cluster section.
+Warning
+CloudNativePG requires both the postgres user and database to
+always exist. Using the local Unix Domain Socket, it needs to connect
+as the postgres user to the postgres database via peer authentication in
+order to perform administrative tasks on the cluster.
+DO NOT DELETE the postgres user or the postgres database!!!
Info
+CloudNativePG is gradually introducing support for
+Kubernetes' native VolumeSnapshot API
+for both incremental and differential copy in backup and recovery
+operations - if supported by the underlying storage classes.
+Please see "Recovery from Volume Snapshot objects"
+for details.
bootstrap sectionThe bootstrap method can be defined in the bootstrap section of the cluster
+specification. CloudNativePG currently supports the following bootstrap methods:
initdb: initialize a new PostgreSQL cluster (default)recovery: create a PostgreSQL cluster by restoring from a base backup of an
+ existing cluster and, if needed, replaying all the available WAL files or up to
+ a given point in timepg_basebackup: create a PostgreSQL cluster by cloning an existing one of
+ the same major version using pg_basebackup through the streaming
+ replication protocol. This method is particularly useful for migrating
+ databases to CloudNativePG, although meeting all requirements can be
+ challenging. Be sure to review the warnings in the
+ pg_basebackup subsection
+ carefully.Only one bootstrap method can be specified in the manifest. +Attempting to define multiple bootstrap methods will result in validation errors.
+In contrast to the initdb method, both recovery and pg_basebackup
+create a new cluster based on another one (either offline or online) and can be
+used to spin up replica clusters. They both rely on the definition of external
+clusters.
+Refer to the replica cluster section for more information.
Given the amount of possible backup methods and combinations of backup
+storage that the CloudNativePG operator provides for recovery, please refer to
+the dedicated "Recovery" section for guidance on each method.
API reference
+Please refer to the "API reference for the bootstrap section
+for more information.
externalClusters sectionThe externalClusters section of the cluster manifest can be used to configure
+access to one or more PostgreSQL clusters as sources.
+The primary use cases include:
initdb bootstrap method.Info
+Ongoing development will extend the functionality of externalClusters to
+accommodate additional use cases, such as logical replication and foreign
+servers in future releases.
As far as bootstrapping is concerned, externalClusters can be used
+to define the source PostgreSQL cluster for either the pg_basebackup
+method or the recovery one. An external cluster needs to have:
source optionat least one of the following:
+Note
+A recovery object store is normally an AWS S3, Azure Blob Storage, +or Google Cloud Storage source that is managed by Barman Cloud.
+When only the streaming connection is defined, the source can be used for the
+pg_basebackup method. When only the recovery object store is defined, the
+source can be used for the recovery method. When both are defined, any of
+the two bootstrap methods can be chosen. The following table summarizes your
+options:
| Content of externalClusters | +pg_basebackup | +recovery | +
|---|---|---|
| Only streaming | +✓ | ++ |
| Only object store | ++ | ✓ | +
| Streaming and object store | +✓ | +✓ | +
Furthermore, in case of pg_basebackup or full recovery point in time, the
+cluster is eligible for replica cluster mode. This means that the cluster is
+continuously fed from the source, either via streaming, via WAL shipping
+through the PostgreSQL's restore_command, or any of the two.
API reference
+Please refer to the "API reference for the externalClusters section
+for more information.
Whenever a password is supplied within an externalClusters entry,
+CloudNativePG autonomously manages a PostgreSQL password file
+for it, residing at /controller/external/NAME/pgpass in each instance.
This approach enables CloudNativePG to securely establish connections with an
+external server without exposing any passwords in the connection string.
+Instead, the connection safely references the aforementioned file through the
+passfile connection parameter.
initdb)The initdb bootstrap method is used to create a new PostgreSQL cluster from
+scratch. It is the default one unless specified differently.
The following example contains the full structure of the initdb
+configuration:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example-initdb
+spec:
+ instances: 3
+
+ bootstrap:
+ initdb:
+ database: app
+ owner: app
+ secret:
+ name: app-secret
+
+ storage:
+ size: 1Gi
+
+The above example of bootstrap will:
+PGDATA folder using PostgreSQL's native initdb commandappapp) using the one in the app-secret
+ secret (make sure that username matches the same name of the owner)app owned by the app user.Thanks to the convention over configuration paradigm, you can let the
+operator choose a default database name (app) and a default application
+user name (same as the database name), as well as randomly generate a
+secure password for both the superuser and the application user in
+PostgreSQL.
Alternatively, you can generate your password, store it as a secret, +and use it in the PostgreSQL cluster - as described in the above example.
+The supplied secret must comply with the specifications of the
+kubernetes.io/basic-auth type.
+As a result, the username in the secret must match the one of the owner
+(for the application secret) and postgres for the superuser one.
The following is an example of a basic-auth secret:
apiVersion: v1
+data:
+ username: YXBw
+ password: cGFzc3dvcmQ=
+kind: Secret
+metadata:
+ name: app-secret
+type: kubernetes.io/basic-auth
+
+The application database is the one that should be used to store application +data. Applications should connect to the cluster with the user that owns +the application database.
+Important
+If you need to create additional users, please refer to +"Declarative database role management".
+In case you don't supply any database name, the operator will proceed
+by convention and create the app database, and adds it to the cluster
+definition using a defaulting webhook.
+The user that owns the database defaults to the database name instead.
The application user is not used internally by the operator, which instead +relies on the superuser to reconcile the cluster with the desired status.
+initdbThe PostgreSQL data directory is initialized using the
+initdb PostgreSQL command.
CloudNativePG enables you to customize the behavior of initdb to modify
+settings such as default locale configurations and data checksums.
Warning
+CloudNativePG acts only as a direct proxy to initdb for locale-related
+options, due to the ongoing and significant enhancements in PostgreSQL's locale
+support. It is your responsibility to ensure that the correct options are
+provided, following the PostgreSQL documentation, and to verify that the
+bootstrap process completes successfully.
To include custom options in the initdb command, you can use the following
+parameters:
builtinLocale is set to a value, CloudNativePG passes it to the
+--builtin-locale option in initdb. This option controls the builtin locale, as
+defined in "Locale Support"
+from the PostgreSQL documentation (default: empty). Note that this option requires
+localeProvider to be set to builtin. Available from PostgreSQL 17.dataChecksums is set to true, CloudNativePG invokes the -k option in
+initdb to enable checksums on data pages and help detect corruption by the
+I/O system - that would otherwise be silent (default: false).encoding set to a value, CloudNativePG passes it to the --encoding
+option in initdb, which selects the encoding of the template database
+(default: UTF8).icuLocale is set to a value, CloudNativePG passes it to the
+--icu-locale option in initdb. This option controls the ICU locale, as
+defined in "Locale Support"
+from the PostgreSQL documentation (default: empty).
+Note that this option requires localeProvider to be set to icu.
+Available from PostgreSQL 15.icuRules is set to a value, CloudNativePG passes it to the
+--icu-rules option in initdb. This option controls the ICU locale, as
+defined in "Locale
+Support" from the
+PostgreSQL documentation (default: empty). Note that this option requires
+localeProvider to be set to icu. Available from PostgreSQL 16.locale is set to a value, CloudNativePG passes it to the --locale
+option in initdb. This option controls the locale, as defined in
+"Locale Support" from
+the PostgreSQL documentation. By default, the locale parameter is empty. In
+this case, environment variables such as LANG are used to determine the
+locale. Be aware that these variables can vary between container images,
+potentially leading to inconsistent behavior.localeCollate is set to a value, CloudNativePG passes it to the --lc-collate
+option in initdb. This option controls the collation order (LC_COLLATE
+subcategory), as defined in "Locale Support"
+from the PostgreSQL documentation (default: C).localeCType is set to a value, CloudNativePG passes it to the --lc-ctype option in
+initdb. This option controls the collation order (LC_CTYPE subcategory), as
+defined in "Locale Support"
+from the PostgreSQL documentation (default: C).localeProvider is set to a value, CloudNativePG passes it to the --locale-provider
+option in initdb. This option controls the locale provider, as defined in
+"Locale Support" from the
+PostgreSQL documentation (default: empty, which means libc for PostgreSQL).
+Available from PostgreSQL 15.walSegmentSize is set to a value, CloudNativePG passes it to the --wal-segsize
+option in initdb (default: not set - defined by PostgreSQL as 16 megabytes).Note
+The only two locale options that CloudNativePG implements during
+the initdb bootstrap refer to the LC_COLLATE and LC_TYPE subcategories.
+The remaining locale subcategories can be configured directly in the PostgreSQL
+configuration, using the lc_messages, lc_monetary, lc_numeric, and
+lc_time parameters.
The following example enables data checksums and sets the default encoding to
+LATIN1:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example-initdb
+spec:
+ instances: 3
+
+ bootstrap:
+ initdb:
+ database: app
+ owner: app
+ dataChecksums: true
+ encoding: 'LATIN1'
+ storage:
+ size: 1Gi
+
+Warning
+CloudNativePG supports another way to customize the behavior of the
+initdb invocation, using the options subsection. However, given that there
+are options that can break the behavior of the operator (such as --auth or
+-d), this technique is deprecated and will be removed from future versions of
+the API.
You can specify a custom list of queries that will be executed once,
+immediately after the cluster is created and configured. These queries will be
+executed as the superuser (postgres) against three different databases, in
+this specific order:
postgres database (postInit section)template1 database (postInitTemplate section)postInitApplication section)For each of these sections, CloudNativePG provides two ways to specify custom +queries, executed in the following order:
+postInitSQL,
+ postInitTemplateSQL, and postInitApplicationSQL stanzas)postInitSQLRefs, postInitTemplateSQLRefs, and
+ postInitApplicationSQLRefs stanzas). Secrets are processed before ConfigMaps.Objects in each list will be processed sequentially.
+Warning
+Use the postInit, postInitTemplate, and postInitApplication options
+with extreme care, as queries are run as a superuser and can disrupt the entire
+cluster. An error in any of those queries will interrupt the bootstrap phase,
+leaving the cluster incomplete and requiring manual intervention.
Important
+Ensure the existence of entries inside the ConfigMaps or Secrets specified
+in postInitSQLRefs, postInitTemplateSQLRefs, and
+postInitApplicationSQLRefs, otherwise the bootstrap will fail. Errors in any
+of those SQL files will prevent the bootstrap phase from completing
+successfully.
The following example runs a single SQL query as part of the postInitSQL
+stanza:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example-initdb
+spec:
+ instances: 3
+
+ bootstrap:
+ initdb:
+ database: app
+ owner: app
+ dataChecksums: true
+ localeCollate: 'en_US'
+ localeCType: 'en_US'
+ postInitSQL:
+ - CREATE DATABASE angus
+ storage:
+ size: 1Gi
+
+The example below relies on postInitApplicationSQLRefs to specify a secret
+and a ConfigMap containing the queries to run after the initialization on the
+application database:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example-initdb
+spec:
+ instances: 3
+
+ bootstrap:
+ initdb:
+ database: app
+ owner: app
+ postInitApplicationSQLRefs:
+ secretRefs:
+ - name: my-secret
+ key: secret.sql
+ configMapRefs:
+ - name: my-configmap
+ key: configmap.sql
+ storage:
+ size: 1Gi
+
+Note
+Within SQL scripts, each SQL statement is executed in a single exec on the
+server according to the PostgreSQL semantics.
+Comments can be included, but internal commands like psql cannot.
CloudNativePG enables bootstrapping a cluster starting from
+another one of the same major version.
+This operation can be carried out either connecting directly to the source cluster via
+streaming replication (pg_basebackup), or indirectly via an existing
+physical base backup (recovery).
The source cluster must be defined in the externalClusters section, identified
+by name (our recommendation is to use the same name of the origin cluster).
Important
+By default the recovery method strictly uses the name of the
+cluster in the externalClusters section to locate the main folder
+of the backup data within the object store, which is normally reserved
+for the name of the server. Backup plugins provide ways to specify a
+different one. For example, the Barman Cloud Plugin provides the serverName parameter
+(by default assigned to the value of name in the external cluster definition).
recovery)Given the variety of backup methods and combinations of backup storage
+options provided by the CloudNativePG operator for recovery, please refer
+to the dedicated "Recovery" section for detailed guidance on
+each method.
pg_basebackup)The pg_basebackup bootstrap mode allows you to create a new cluster
+(target) as an exact physical copy of an existing and binary-compatible
+PostgreSQL instance (source) managed by CloudNativePG, using a valid
+streaming replication connection. The source instance can either be a primary
+or a standby PostgreSQL server. It’s crucial to thoroughly review the
+requirements section below, as the pros and cons of PostgreSQL physical
+replication fully apply.
The primary use cases for this method include:
+Important
+Avoid using this method, based on physical replication, to migrate an +existing PostgreSQL cluster outside of Kubernetes into CloudNativePG, unless you +are completely certain that all requirements are met and +the operation has been +thoroughly tested. The CloudNativePG community does not endorse this approach +for such use cases, and recommends using logical import instead. It is +exceedingly rare that all requirements for physical replication are met in a +way that seamlessly works with CloudNativePG.
+Warning
+In its current implementation, this method clones the source PostgreSQL +instance, thereby creating a snapshot. Once the cloning process has finished, +the new cluster is immediately started. +Refer to "Current limitations" for more details.
+Similar to the recovery bootstrap method, once the cloning operation is
+complete, the operator takes full ownership of the target cluster, starting
+from the first instance. This includes overriding certain configuration
+parameters as required by CloudNativePG, resetting the superuser password,
+creating the streaming_replica user, managing replicas, and more. The
+resulting cluster operates independently from the source instance.
Important
+Configuring the network connection between the target and source instances +lies outside the scope of CloudNativePG documentation, as it depends heavily on +the specific context and environment.
+The streaming replication client on the target instance, managed transparently
+by pg_basebackup, can authenticate on the source instance using one of the
+following methods:
Both authentication methods are detailed below.
+The following requirements apply to the pg_basebackup bootstrap method:
max_wal_senders to grant
+ access from the target for this one-off operation by providing at least
+ one walsender for the backup plus one for WAL streamingREPLICATION LOGIN privileges and must accept
+ connections from the target instance for this role in pg_hba.conf, preferably
+ via TLS (see "About the replication user" below)REPLICATION LOGIN privilegesSeealso
+For further information, please refer to the
+"Planning" section for Warm Standby,
+the
+pg_basebackup page
+and the
+"High Availability, Load Balancing, and Replication" chapter
+in the PostgreSQL documentation.
As explained in the requirements section, you need to have a user
+with either the SUPERUSER or, preferably, just the REPLICATION
+privilege in the source instance.
If the source database is created with CloudNativePG, you
+can reuse the streaming_replica user and take advantage of client
+TLS certificates authentication (which, by default, is the only allowed
+connection method for streaming_replica).
For all other cases, including outside Kubernetes, please verify that
+you already have a user with the REPLICATION privilege, or create
+a new one by following the instructions below.
As postgres user on the source system, please run:
createuser -P --replication streaming_replica
+
+Enter the password at the prompt and save it for later, as you +will need to add it to a secret in the target instance.
+Note
+Although the name is not important, we will use streaming_replica
+for the sake of simplicity. Feel free to change it as you like,
+provided you adapt the instructions in the following sections.
The first authentication method supported by CloudNativePG
+with the pg_basebackup bootstrap is based on username and password matching.
Make sure you have the following information before you start the procedure:
+streaming_replica for simplicity)You might need to add a line similar to the following to the pg_hba.conf
+file on the source PostgreSQL instance:
# A more restrictive rule for TLS and IP of origin is recommended
+host replication streaming_replica all md5
+
+The following manifest creates a new PostgreSQL 18.0 cluster,
+called target-db, using the pg_basebackup bootstrap method
+to clone an external PostgreSQL cluster defined as source-db
+(in the externalClusters array). As you can see, the source-db
+definition points to the source-db.foo.com host and connects as
+the streaming_replica user, whose password is stored in the
+password key of the source-db-replica-user secret.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: target-db
+spec:
+ instances: 3
+ imageName: ghcr.io/cloudnative-pg/postgresql:18.0-system-trixie
+
+ bootstrap:
+ pg_basebackup:
+ source: source-db
+
+ storage:
+ size: 1Gi
+
+ externalClusters:
+ - name: source-db
+ connectionParameters:
+ host: source-db.foo.com
+ user: streaming_replica
+ password:
+ name: source-db-replica-user
+ key: password
+
+All the requirements must be met for the clone operation to work, including +the same PostgreSQL version (in our case 18.0).
+The second authentication method supported by CloudNativePG
+with the pg_basebackup bootstrap is based on TLS client certificates.
+This is the recommended approach from a security standpoint.
The following example clones an existing PostgreSQL cluster (cluster-example)
+in the same Kubernetes cluster.
Note
+This example can be easily adapted to cover an instance that resides +outside the Kubernetes cluster.
+The manifest defines a new PostgreSQL 18.0 cluster called cluster-clone-tls,
+which is bootstrapped using the pg_basebackup method from the cluster-example
+external cluster. The host is identified by the read/write service
+in the same cluster, while the streaming_replica user is authenticated
+thanks to the provided keys, certificate, and certification authority
+information (respectively in the cluster-example-replication and
+cluster-example-ca secrets).
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-clone-tls
+spec:
+ instances: 3
+ imageName: ghcr.io/cloudnative-pg/postgresql:18.0-system-trixie
+
+ bootstrap:
+ pg_basebackup:
+ source: cluster-example
+
+ storage:
+ size: 1Gi
+
+ externalClusters:
+ - name: cluster-example
+ connectionParameters:
+ host: cluster-example-rw.default.svc
+ user: streaming_replica
+ sslmode: verify-full
+ sslKey:
+ name: cluster-example-replication
+ key: tls.key
+ sslCert:
+ name: cluster-example-replication
+ key: tls.crt
+ sslRootCert:
+ name: cluster-example-ca
+ key: ca.crt
+
+We also support to configure the application database for cluster which bootstrap
+from a live cluster, just like the case of initdb and recovery bootstrap method.
+If the new cluster is created as a replica cluster (with replica mode enabled), application
+database configuration will be skipped.
Important
+While the Cluster is in recovery mode, no changes to the database,
+including the catalog, are permitted. This restriction includes any role
+overrides, which are deferred until the Cluster transitions to primary.
+During the recovery phase, roles remain as defined in the source cluster.
The example below configures the app database with the owner app and
+the password stored in the provided secret app-secret, following the
+bootstrap from a live cluster.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ bootstrap:
+ pg_basebackup:
+ database: app
+ owner: app
+ secret:
+ name: app-secret
+ source: cluster-example
+
+With the above configuration, the following will happen only after recovery is +completed:
+app database does not exist, it will be created.app user does not exist, it will be created.app user is not the owner of the app database, ownership will be
+ granted to the app user.username value matches the owner value in the secret, the
+ password for the application user (the app user in this case) will be
+ updated to the password value in the secret.The pg_basebackup method takes a snapshot of the source instance in the form of
+a PostgreSQL base backup. All transactions written from the start of
+the backup to the correct termination of the backup will be streamed to the target
+instance using a second connection (see the --wal-method=stream option for
+pg_basebackup).
Once the backup is completed, the new instance will be started on a new timeline +and diverge from the source. +For this reason, it is advised to stop all write operations to the source database +before migrating to the target database.
+Note that this limitation applies only if the target cluster is not defined as +a replica cluster.
+Important
+Before you attempt a migration, you must test both the procedure +and the applications. In particular, it is fundamental that you run the migration +procedure as many times as needed to systematically measure the downtime of your +applications in production.
+CloudNativePG was designed to natively support TLS certificates. +To set up a cluster, the operator requires:
+Note
+You can find all the secrets used by the cluster and their expiration dates +in the cluster's status.
+CloudNativePG is very flexible when it comes to TLS certificates. It +primarily operates in two modes:
+You can also choose a hybrid approach, where only part of the certificates is +generated outside CNPG.
+Note
+The operator and instances verify server certificates against the CA only,
+disregarding the DNS name. This approach is due to the typical absence of DNS
+names in user-provided certificates for the <cluster>-rw service used for
+communication within the cluster.
By default, the operator automatically generates a single Certificate Authority +(CA) to issue both client and server certificates. These certificates are +managed continuously by the operator, with automatic renewal 7 days before +expiration (within a 90-day validity period).
+Info
+You can adjust this default behavior by configuring the
+CERTIFICATE_DURATION and EXPIRING_CHECK_THRESHOLD environment variables.
+For detailed guidance, refer to the Operator Configuration.
Important
+Certificate renewal does not cause any downtime for the PostgreSQL server, +as a simple reload operation is sufficient. However, any user-managed +certificates not controlled by CloudNativePG must be re-issued following the +renewal process.
+When generating certificates, the operator assumes that the Kubernetes
+cluster's DNS zone is set to cluster.local by default. This behavior can be
+customized by setting the KUBERNETES_CLUSTER_DOMAIN environment variable. A
+convenient alternative is to use the operator's configuration capability.
The operator generates a self-signed CA and stores it in a generic secret +containing the following keys:
+ca.crt – CA certificate used to validate the server certificate, used as
+ sslrootcert in clients' connection strings.ca.key – The key used to sign the server SSL certificate automatically.The operator uses the generated self-signed CA to sign a server TLS
+certificate. It's stored in a secret of type kubernetes.io/tls and configured
+to be used as ssl_cert_file and ssl_key_file by the instances. This
+approach enables clients to verify their identity and connect securely.
In addition to the default ones, you can specify DNS server alternative names +as part of the generated server TLS secret.
+By default, the same self-signed CA as the server CA is used. The public part
+is passed as ssl_ca_file to all the instances so it can verify client
+certificates it signed. The private key is stored in the same secret and used
+to sign client certificates generated by the kubectl cnpg plugin.
streaming_replica certificateThe operator uses the generated self-signed CA to sign a client certificate for
+the user streaming_replica, storing it in a secret of type
+kubernetes.io/tls. To allow secure connection to the primary instance, this
+certificate is passed as sslcert and sslkey in the replicas' connection
+strings.
If required, you can also provide the two server certificates, generating them +using a separate component such as cert-manager. +To use a custom server TLS certificate for a cluster, you must specify +the following parameters:
+serverTLSSecret – The name of a secret of type kubernetes.io/tls
+ containing the server TLS certificate. It must contain both the standard
+ tls.crt and tls.key keys.serverCASecret – The name of a secret containing the ca.crt key.Note
+The operator still creates and manages the two secrets related to client +certificates.
+Note
+The operator and instances verify server certificates against the CA only,
+disregarding the DNS name. This approach is due to the typical absence of DNS
+names in user-provided certificates for the <cluster>-rw service used for
+communication within the cluster.
Note
+If you want ConfigMaps and secrets to be reloaded by instances, you can add
+a label with the key cnpg.io/reload to it. Otherwise you must reload the
+instances using the kubectl cnpg reload subcommand.
Given the following files:
+server-ca.crt – The certificate of the CA that signed the server TLS certificate.server.crt– The certificate of the server TLS certificate.server.key – The private key of the server TLS certificate.Create a secret containing the CA certificate:
+kubectl create secret generic my-postgresql-server-ca \
+ --from-file=ca.crt=./server-ca.crt
+
+Create a secret with the TLS certificate:
+kubectl create secret tls my-postgresql-server \
+ --cert=./server.crt --key=./server.key
+
+Create a PostgreSQL cluster referencing those secrets:
+kubectl apply -f - <<EOF
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+ certificates:
+ serverCASecret: my-postgresql-server-ca
+ serverTLSSecret: my-postgresql-server
+ storage:
+ storageClass: standard
+ size: 1Gi
+EOF
+
+The new cluster uses the provided server certificates for TLS connections.
+This simple example shows how to use cert-manager +to set up a self-signed CA and generate the needed TLS server certificate:
+---
+apiVersion: cert-manager.io/v1
+kind: Issuer
+metadata:
+ name: selfsigned-issuer
+spec:
+ selfSigned: {}
+---
+apiVersion: v1
+kind: Secret
+metadata:
+ name: my-postgres-server-cert
+ labels:
+ cnpg.io/reload: ""
+---
+apiVersion: cert-manager.io/v1
+kind: Certificate
+metadata:
+ name: my-postgres-server-cert
+spec:
+ secretName: my-postgres-server-cert
+ usages:
+ - server auth
+ dnsNames:
+ - cluster-example-lb.internal.mydomain.net
+ - cluster-example-rw
+ - cluster-example-rw.default
+ - cluster-example-rw.default.svc
+ - cluster-example-r
+ - cluster-example-r.default
+ - cluster-example-r.default.svc
+ - cluster-example-ro
+ - cluster-example-ro.default
+ - cluster-example-ro.default.svc
+ issuerRef:
+ name: selfsigned-issuer
+ kind: Issuer
+ group: cert-manager.io
+
+Cert-manager creates a secret named my-postgres-server-cert. It contains all
+the needed files and can be referenced from a cluster as follows:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+ certificates:
+ serverTLSSecret: my-postgres-server-cert
+ serverCASecret: my-postgres-server-cert
+ storage:
+ size: 1Gi
+
+You can find a complete example using cert-manager to manage both server and +client CA and certificates in the +cluster-example-cert-manager.yaml +deployment manifest.
+If required, you can also provide the two client certificates, generating them +using a separate component such as cert-manager or +HashiCorp vault. To +use a custom CA to verify client certificates for a cluster, you must specify +the following parameters:
+replicationTLSSecret – The name of a secret of type kubernetes.io/tls
+ containing the client certificate for user streaming_replica. It must contain
+ both the standard tls.crt and tls.key keys.clientCASecret– The name of a secret containing the ca.crt key of the CA
+ to use to verify client certificate.Note
+The operator still creates and manages the two secrets related to server +certificates.
+Note
+As the cluster isn't in control of the client CA secret key, you can no
+longer generate client certificates using kubectl cnpg certificate.
Note
+If you want ConfigMaps and secrets to be automatically reloaded by
+instances, you can add a label with the key cnpg.io/reload to it. Otherwise,
+you must reload the instances using the kubectl cnpg reload subcommand.
streaming_replica client certificateIn some environments, it may not be possible to generate a certificate with the
+common name streaming_replica due to company policies or other security
+concerns, such as a CA shared across multiple clusters. In such cases, the user
+mapping feature can be used to allow authentication as the streaming_replica
+user with certificates containing different common names.
To configure this setup, add a pg_ident.conf entry for the predefined map
+named cnpg_streaming_replica.
For example, to enable streaming_replica authentication using a certificate
+with the common name streaming-replica.cnpg.svc.cluster.local, add the
+following to your cluster definition:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ postgresql:
+ pg_ident:
+ - cnpg_streaming_replica streaming-replica.cnpg.svc.cluster.local streaming_replica
+
+For further details on how pg_ident.conf is managed by the operator, see the
+"PostgreSQL Configuration" page in
+the documentation.
This simple example shows how to use cert-manager +to set up a self-signed CA and generate the needed TLS server certificate:
+---
+apiVersion: cert-manager.io/v1
+kind: Issuer
+metadata:
+ name: selfsigned-issuer
+spec:
+ selfSigned: {}
+---
+apiVersion: v1
+kind: Secret
+metadata:
+ name: my-postgres-client-cert
+ labels:
+ cnpg.io/reload: ""
+---
+apiVersion: cert-manager.io/v1
+kind: Certificate
+metadata:
+ name: my-postgres-client-cert
+spec:
+ secretName: my-postgres-client-cert
+ usages:
+ - client auth
+ commonName: streaming_replica
+ issuerRef:
+ name: selfsigned-issuer
+ kind: Issuer
+ group: cert-manager.io
+
+Cert-manager creates a secret named my-postgres-client-cert that contains all
+the needed files. You can reference it from a cluster as follows:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+ certificates:
+ clientCASecret: my-postgres-client-cert
+ replicationTLSSecret: my-postgres-client-cert
+ storage:
+ size: 1Gi
+
+You can find a complete example using cert-manager to manage both server and +client CA and certificates in the +cluster-example-cert-manager.yaml +deployment manifest.
+ +Package v1 contains API Schema definitions for the postgresql v1 API group
+ +A Backup resource is a request for a PostgreSQL backup by the user.
+ +| Field | Description |
|---|---|
apiVersion [Required]string | postgresql.cnpg.io/v1 |
kind [Required]string | Backup |
metadata [Required]+meta/v1.ObjectMeta + |
+
+ No description provided.Refer to the Kubernetes API documentation for the fields of the metadata field. |
+
spec [Required]+BackupSpec + |
+
+ Specification of the desired behavior of the backup. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
status+BackupStatus + |
+
+ Most recently observed status of the backup. This data may not be up to +date. Populated by the system. Read-only. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
Cluster defines the API schema for a highly available PostgreSQL database cluster +managed by CloudNativePG.
+ +| Field | Description |
|---|---|
apiVersion [Required]string | postgresql.cnpg.io/v1 |
kind [Required]string | Cluster |
metadata [Required]+meta/v1.ObjectMeta + |
+
+ No description provided.Refer to the Kubernetes API documentation for the fields of the metadata field. |
+
spec [Required]+ClusterSpec + |
+
+ Specification of the desired behavior of the cluster. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
status+ClusterStatus + |
+
+ Most recently observed status of the cluster. This data may not be up +to date. Populated by the system. Read-only. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
ClusterImageCatalog is the Schema for the clusterimagecatalogs API
+ +| Field | Description |
|---|---|
apiVersion [Required]string | postgresql.cnpg.io/v1 |
kind [Required]string | ClusterImageCatalog |
metadata [Required]+meta/v1.ObjectMeta + |
+
+ No description provided.Refer to the Kubernetes API documentation for the fields of the metadata field. |
+
spec [Required]+ImageCatalogSpec + |
+
+ Specification of the desired behavior of the ClusterImageCatalog. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
Database is the Schema for the databases API
+ +| Field | Description |
|---|---|
apiVersion [Required]string | postgresql.cnpg.io/v1 |
kind [Required]string | Database |
metadata [Required]+meta/v1.ObjectMeta + |
+
+ No description provided.Refer to the Kubernetes API documentation for the fields of the metadata field. |
+
spec [Required]+DatabaseSpec + |
+
+ Specification of the desired Database. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
status+DatabaseStatus + |
+
+ Most recently observed status of the Database. This data may not be up to +date. Populated by the system. Read-only. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
Appears in:
+FailoverQuorum contains the information about the current failover +quorum status of a PG cluster. It is updated by the instance manager +of the primary node and reset to zero by the operator to trigger +an update.
+ +| Field | Description |
|---|---|
apiVersion [Required]string | postgresql.cnpg.io/v1 |
kind [Required]string | FailoverQuorum |
metadata [Required]+meta/v1.ObjectMeta + |
+
+ No description provided.Refer to the Kubernetes API documentation for the fields of the metadata field. |
+
status+FailoverQuorumStatus + |
+
+ Most recently observed status of the failover quorum. + |
+
ImageCatalog is the Schema for the imagecatalogs API
+ +| Field | Description |
|---|---|
apiVersion [Required]string | postgresql.cnpg.io/v1 |
kind [Required]string | ImageCatalog |
metadata [Required]+meta/v1.ObjectMeta + |
+
+ No description provided.Refer to the Kubernetes API documentation for the fields of the metadata field. |
+
spec [Required]+ImageCatalogSpec + |
+
+ Specification of the desired behavior of the ImageCatalog. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
Pooler is the Schema for the poolers API
+ +| Field | Description |
|---|---|
apiVersion [Required]string | postgresql.cnpg.io/v1 |
kind [Required]string | Pooler |
metadata [Required]+meta/v1.ObjectMeta + |
+
+ No description provided.Refer to the Kubernetes API documentation for the fields of the metadata field. |
+
spec [Required]+PoolerSpec + |
+
+ Specification of the desired behavior of the Pooler. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
status+PoolerStatus + |
+
+ Most recently observed status of the Pooler. This data may not be up to +date. Populated by the system. Read-only. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
Publication is the Schema for the publications API
+ +| Field | Description |
|---|---|
apiVersion [Required]string | postgresql.cnpg.io/v1 |
kind [Required]string | Publication |
metadata [Required]+meta/v1.ObjectMeta + |
+
+ No description provided.Refer to the Kubernetes API documentation for the fields of the metadata field. |
+
spec [Required]+PublicationSpec + |
++ No description provided. | +
status [Required]+PublicationStatus + |
++ No description provided. | +
ScheduledBackup is the Schema for the scheduledbackups API
+ +| Field | Description |
|---|---|
apiVersion [Required]string | postgresql.cnpg.io/v1 |
kind [Required]string | ScheduledBackup |
metadata [Required]+meta/v1.ObjectMeta + |
+
+ No description provided.Refer to the Kubernetes API documentation for the fields of the metadata field. |
+
spec [Required]+ScheduledBackupSpec + |
+
+ Specification of the desired behavior of the ScheduledBackup. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
status+ScheduledBackupStatus + |
+
+ Most recently observed status of the ScheduledBackup. This data may not be up +to date. Populated by the system. Read-only. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
Subscription is the Schema for the subscriptions API
+ +| Field | Description |
|---|---|
apiVersion [Required]string | postgresql.cnpg.io/v1 |
kind [Required]string | Subscription |
metadata [Required]+meta/v1.ObjectMeta + |
+
+ No description provided.Refer to the Kubernetes API documentation for the fields of the metadata field. |
+
spec [Required]+SubscriptionSpec + |
++ No description provided. | +
status [Required]+SubscriptionStatus + |
++ No description provided. | +
Appears in:
+AffinityConfiguration contains the info we need to create the +affinity rules for Pods
+ +| Field | Description |
|---|---|
enablePodAntiAffinity+bool + |
+
+ Activates anti-affinity for the pods. The operator will define pods +anti-affinity unless this field is explicitly set to false + |
+
topologyKey+string + |
+
+ TopologyKey to use for anti-affinity configuration. See k8s documentation +for more info on that + |
+
nodeSelector+map[string]string + |
+
+ NodeSelector is map of key-value pairs used to define the nodes on which +the pods can run. +More info: https://kubernetes.io/docs/concepts/configuration/assign-pod-node/ + |
+
nodeAffinity+core/v1.NodeAffinity + |
+
+ NodeAffinity describes node affinity scheduling rules for the pod. +More info: https://kubernetes.io/docs/concepts/scheduling-eviction/assign-pod-node/#node-affinity + |
+
tolerations+[]core/v1.Toleration + |
+
+ Tolerations is a list of Tolerations that should be set for all the pods, in order to allow them to run +on tainted nodes. +More info: https://kubernetes.io/docs/concepts/scheduling-eviction/taint-and-toleration/ + |
+
podAntiAffinityType+string + |
+
+ PodAntiAffinityType allows the user to decide whether pod anti-affinity between cluster instance has to be +considered a strong requirement during scheduling or not. Allowed values are: "preferred" (default if empty) or +"required". Setting it to "required", could lead to instances remaining pending until new kubernetes nodes are +added if all the existing nodes don't match the required pod anti-affinity rule. +More info: +https://kubernetes.io/docs/concepts/scheduling-eviction/assign-pod-node/#inter-pod-affinity-and-anti-affinity + |
+
additionalPodAntiAffinity+core/v1.PodAntiAffinity + |
+
+ AdditionalPodAntiAffinity allows to specify pod anti-affinity terms to be added to the ones generated +by the operator if EnablePodAntiAffinity is set to true (default) or to be used exclusively if set to false. + |
+
additionalPodAffinity+core/v1.PodAffinity + |
+
+ AdditionalPodAffinity allows to specify pod affinity terms to be passed to all the cluster's pods. + |
+
Appears in:
+ +AvailableArchitecture represents the state of a cluster's architecture
+ +| Field | Description |
|---|---|
goArch [Required]+string + |
+
+ GoArch is the name of the executable architecture + |
+
hash [Required]+string + |
+
+ Hash is the hash of the executable + |
+
Appears in:
+BackupConfiguration defines how the backup of the cluster are taken. +The supported backup methods are BarmanObjectStore and VolumeSnapshot. +For details and examples refer to the Backup and Recovery section of the +documentation
+ +| Field | Description |
|---|---|
volumeSnapshot+VolumeSnapshotConfiguration + |
+
+ VolumeSnapshot provides the configuration for the execution of volume snapshot backups. + |
+
barmanObjectStore+github.com/cloudnative-pg/barman-cloud/pkg/api.BarmanObjectStoreConfiguration + |
+
+ The configuration for the barman-cloud tool suite + |
+
retentionPolicy+string + |
+
+ RetentionPolicy is the retention policy to be used for backups
+and WALs (i.e. '60d'). The retention policy is expressed in the form
+of |
+
target+BackupTarget + |
+
+ The policy to decide which instance should perform backups. Available
+options are empty string, which will default to |
+
(Alias of string)
Appears in:
+BackupMethod defines the way of executing the physical base backups of +the selected PostgreSQL instance
+ +(Alias of string)
Appears in:
+BackupPhase is the phase of the backup
+ +Appears in:
+BackupPluginConfiguration contains the backup configuration used by +the backup plugin
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ Name is the name of the plugin managing this backup + |
+
parameters+map[string]string + |
+
+ Parameters are the configuration parameters passed to the backup +plugin for this backup + |
+
Appears in:
+ +BackupSnapshotElementStatus is a volume snapshot that is part of a volume snapshot method backup
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ Name is the snapshot resource name + |
+
type [Required]+string + |
+
+ Type is tho role of the snapshot in the cluster, such as PG_DATA, PG_WAL and PG_TABLESPACE + |
+
tablespaceName+string + |
+
+ TablespaceName is the name of the snapshotted tablespace. Only set +when type is PG_TABLESPACE + |
+
Appears in:
+BackupSnapshotStatus the fields exclusive to the volumeSnapshot method backup
+ +| Field | Description |
|---|---|
elements+[]BackupSnapshotElementStatus + |
+
+ The elements list, populated with the gathered volume snapshots + |
+
Appears in:
+ +BackupSource contains the backup we need to restore from, plus some +information that could be needed to correctly restore it.
+ +| Field | Description |
|---|---|
LocalObjectReference+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+(Members of LocalObjectReference are embedded into this type.)
+ No description provided. |
+
endpointCA+github.com/cloudnative-pg/machinery/pkg/api.SecretKeySelector + |
+
+ EndpointCA store the CA bundle of the barman endpoint. +Useful when using self-signed certificates to avoid +errors with certificate issuer and barman-cloud-wal-archive. + |
+
Appears in:
+BackupSpec defines the desired state of Backup
+ +| Field | Description |
|---|---|
cluster [Required]+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ The cluster to backup + |
+
target+BackupTarget + |
+
+ The policy to decide which instance should perform this backup. If empty,
+it defaults to |
+
method+BackupMethod + |
+
+ The backup method to be used, possible options are |
+
pluginConfiguration+BackupPluginConfiguration + |
+
+ Configuration parameters passed to the plugin managing this backup + |
+
online+bool + |
+
+ Whether the default type of backup with volume snapshots is
+online/hot ( |
+
onlineConfiguration+OnlineConfiguration + |
+
+ Configuration parameters to control the online/hot backup with volume snapshots +Overrides the default settings specified in the cluster '.backup.volumeSnapshot.onlineConfiguration' stanza + |
+
Appears in:
+BackupStatus defines the observed state of Backup
+ +| Field | Description |
|---|---|
BarmanCredentials+github.com/cloudnative-pg/barman-cloud/pkg/api.BarmanCredentials + |
+(Members of BarmanCredentials are embedded into this type.)
+ The potential credentials for each cloud provider + |
+
majorVersion [Required]+int + |
+
+ The PostgreSQL major version that was running when the +backup was taken. + |
+
endpointCA+github.com/cloudnative-pg/machinery/pkg/api.SecretKeySelector + |
+
+ EndpointCA store the CA bundle of the barman endpoint. +Useful when using self-signed certificates to avoid +errors with certificate issuer and barman-cloud-wal-archive. + |
+
endpointURL+string + |
+
+ Endpoint to be used to upload data to the cloud, +overriding the automatic endpoint discovery + |
+
destinationPath+string + |
+
+ The path where to store the backup (i.e. s3://bucket/path/to/folder) +this path, with different destination folders, will be used for WALs +and for data. This may not be populated in case of errors. + |
+
serverName+string + |
+
+ The server name on S3, the cluster name is used if this +parameter is omitted + |
+
encryption+string + |
+
+ Encryption method required to S3 API + |
+
backupId+string + |
+
+ The ID of the Barman backup + |
+
backupName+string + |
+
+ The Name of the Barman backup + |
+
phase+BackupPhase + |
+
+ The last backup status + |
+
startedAt+meta/v1.Time + |
+
+ When the backup was started + |
+
stoppedAt+meta/v1.Time + |
+
+ When the backup was terminated + |
+
beginWal+string + |
+
+ The starting WAL + |
+
endWal+string + |
+
+ The ending WAL + |
+
beginLSN+string + |
+
+ The starting xlog + |
+
endLSN+string + |
+
+ The ending xlog + |
+
error+string + |
+
+ The detected error + |
+
commandOutput+string + |
+
+ Unused. Retained for compatibility with old versions. + |
+
commandError+string + |
+
+ The backup command output in case of error + |
+
backupLabelFile+[]byte + |
+
+ Backup label file content as returned by Postgres in case of online (hot) backups + |
+
tablespaceMapFile+[]byte + |
+
+ Tablespace map file content as returned by Postgres in case of online (hot) backups + |
+
instanceID+InstanceID + |
+
+ Information to identify the instance where the backup has been taken from + |
+
snapshotBackupStatus+BackupSnapshotStatus + |
+
+ Status of the volumeSnapshot backup + |
+
method+BackupMethod + |
+
+ The backup method being used + |
+
online+bool + |
+
+ Whether the backup was online/hot ( |
+
pluginMetadata+map[string]string + |
+
+ A map containing the plugin metadata + |
+
(Alias of string)
Appears in:
+BackupTarget describes the preferred targets for a backup
+ +Appears in:
+BootstrapConfiguration contains information about how to create the PostgreSQL
+cluster. Only a single bootstrap method can be defined among the supported
+ones. initdb will be used as the bootstrap method if left
+unspecified. Refer to the Bootstrap page of the documentation for more
+information.
| Field | Description |
|---|---|
initdb+BootstrapInitDB + |
+
+ Bootstrap the cluster via initdb + |
+
recovery+BootstrapRecovery + |
+
+ Bootstrap the cluster from a backup + |
+
pg_basebackup+BootstrapPgBaseBackup + |
+
+ Bootstrap the cluster taking a physical backup of another compatible +PostgreSQL instance + |
+
Appears in:
+ +BootstrapInitDB is the configuration of the bootstrap process when +initdb is used +Refer to the Bootstrap page of the documentation for more information.
+ +| Field | Description |
|---|---|
database+string + |
+
+ Name of the database used by the application. Default: |
+
owner+string + |
+
+ Name of the owner of the database in the instance to be used
+by applications. Defaults to the value of the |
+
secret+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ Name of the secret containing the initial credentials for the +owner of the user database. If empty a new secret will be +created from scratch + |
+
options+[]string + |
+
+ The list of options that must be passed to initdb when creating the cluster. +Deprecated: This could lead to inconsistent configurations, +please use the explicit provided parameters instead. +If defined, explicit values will be ignored. + |
+
dataChecksums+bool + |
+
+ Whether the |
+
encoding+string + |
+
+ The value to be passed as option |
+
localeCollate+string + |
+
+ The value to be passed as option |
+
localeCType+string + |
+
+ The value to be passed as option |
+
locale+string + |
+
+ Sets the default collation order and character classification in the new database. + |
+
localeProvider+string + |
+
+ This option sets the locale provider for databases created in the new cluster. +Available from PostgreSQL 16. + |
+
icuLocale+string + |
+
+ Specifies the ICU locale when the ICU provider is used.
+This option requires |
+
icuRules+string + |
+
+ Specifies additional collation rules to customize the behavior of the default collation.
+This option requires |
+
builtinLocale+string + |
+
+ Specifies the locale name when the builtin provider is used.
+This option requires |
+
walSegmentSize+int + |
+
+ The value in megabytes (1 to 1024) to be passed to the |
+
postInitSQL+[]string + |
+
+ List of SQL queries to be executed as a superuser in the |
+
postInitApplicationSQL+[]string + |
+
+ List of SQL queries to be executed as a superuser in the application +database right after the cluster has been created - to be used with extreme care +(by default empty) + |
+
postInitTemplateSQL+[]string + |
+
+ List of SQL queries to be executed as a superuser in the |
+
import+Import + |
+
+ Bootstraps the new cluster by importing data from an existing PostgreSQL
+instance using logical backup ( |
+
postInitApplicationSQLRefs+SQLRefs + |
+
+ List of references to ConfigMaps or Secrets containing SQL files +to be executed as a superuser in the application database right after +the cluster has been created. The references are processed in a specific order: +first, all Secrets are processed, followed by all ConfigMaps. +Within each group, the processing order follows the sequence specified +in their respective arrays. +(by default empty) + |
+
postInitTemplateSQLRefs+SQLRefs + |
+
+ List of references to ConfigMaps or Secrets containing SQL files
+to be executed as a superuser in the |
+
postInitSQLRefs+SQLRefs + |
+
+ List of references to ConfigMaps or Secrets containing SQL files
+to be executed as a superuser in the |
+
Appears in:
+ +BootstrapPgBaseBackup contains the configuration required to take +a physical backup of an existing PostgreSQL cluster
+ +| Field | Description |
|---|---|
source [Required]+string + |
+
+ The name of the server of which we need to take a physical backup + |
+
database+string + |
+
+ Name of the database used by the application. Default: |
+
owner+string + |
+
+ Name of the owner of the database in the instance to be used
+by applications. Defaults to the value of the |
+
secret+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ Name of the secret containing the initial credentials for the +owner of the user database. If empty a new secret will be +created from scratch + |
+
Appears in:
+ +BootstrapRecovery contains the configuration required to restore
+from an existing cluster using 3 methodologies: external cluster,
+volume snapshots or backup objects. Full recovery and Point-In-Time
+Recovery are supported.
+The method can be also be used to create clusters in continuous recovery
+(replica clusters), also supporting cascading replication when instances >
| Field | Description |
|---|---|
backup+BackupSource + |
+
+ The backup object containing the physical base backup from which to
+initiate the recovery procedure.
+Mutually exclusive with |
+
source+string + |
+
+ The external cluster whose backup we will restore. This is also
+used as the name of the folder under which the backup is stored,
+so it must be set to the name of the source cluster
+Mutually exclusive with |
+
volumeSnapshots+DataSource + |
+
+ The static PVC data source(s) from which to initiate the
+recovery procedure. Currently supporting |
+
recoveryTarget+RecoveryTarget + |
+
+ By default, the recovery process applies all the available
+WAL files in the archive (full recovery). However, you can also
+end the recovery as soon as a consistent state is reached or
+recover to a point-in-time (PITR) by specifying a |
+
database+string + |
+
+ Name of the database used by the application. Default: |
+
owner+string + |
+
+ Name of the owner of the database in the instance to be used
+by applications. Defaults to the value of the |
+
secret+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ Name of the secret containing the initial credentials for the +owner of the user database. If empty a new secret will be +created from scratch + |
+
Appears in:
+ +CatalogImage defines the image and major version
+ +| Field | Description |
|---|---|
image [Required]+string + |
+
+ The image reference + |
+
major [Required]+int + |
+
+ The PostgreSQL major version of the image. Must be unique within the catalog. + |
+
Appears in:
+CertificatesConfiguration contains the needed configurations to handle server certificates.
+ +| Field | Description |
|---|---|
serverCASecret+string + |
+
+ The secret containing the Server CA certificate. If not defined, a new secret will be created +with a self-signed CA and will be used to generate the TLS certificate ServerTLSSecret. + +Contains: + +
|
+
serverTLSSecret+string + |
+
+ The secret of type kubernetes.io/tls containing the server TLS certificate and key that will be set as
+ |
+
replicationTLSSecret+string + |
+
+ The secret of type kubernetes.io/tls containing the client certificate to authenticate as
+the |
+
clientCASecret+string + |
+
+ The secret containing the Client CA certificate. If not defined, a new secret will be created +with a self-signed CA and will be used to generate all the client certificates. + +Contains: + +
|
+
serverAltDNSNames+[]string + |
+
+ The list of the server alternative DNS names to be added to the generated server TLS certificates, when required. + |
+
Appears in:
+ +CertificatesStatus contains configuration certificates and related expiration dates.
+ +| Field | Description |
|---|---|
CertificatesConfiguration+CertificatesConfiguration + |
+(Members of CertificatesConfiguration are embedded into this type.)
+ Needed configurations to handle server certificates, initialized with default values, if needed. + |
+
expirations+map[string]string + |
+
+ Expiration dates for all certificates. + |
+
Appears in:
+ +ClusterMonitoringTLSConfiguration is the type containing the TLS configuration +for the cluster's monitoring
+ +| Field | Description |
|---|---|
enabled+bool + |
+
+ Enable TLS for the monitoring endpoint. +Changing this option will force a rollout of all instances. + |
+
Appears in:
+ClusterSpec defines the desired state of a PostgreSQL cluster managed by +CloudNativePG.
+ +| Field | Description |
|---|---|
description+string + |
+
+ Description of this PostgreSQL cluster + |
+
inheritedMetadata+EmbeddedObjectMetadata + |
+
+ Metadata that will be inherited by all objects related to the Cluster + |
+
imageName+string + |
+
+ Name of the container image, supporting both tags ( |
+
imageCatalogRef+ImageCatalogRef + |
+
+ Defines the major PostgreSQL version we want to use within an ImageCatalog + |
+
imagePullPolicy+core/v1.PullPolicy + |
+
+ Image pull policy.
+One of |
+
schedulerName+string + |
+
+ If specified, the pod will be dispatched by specified Kubernetes +scheduler. If not specified, the pod will be dispatched by the default +scheduler. More info: +https://kubernetes.io/docs/concepts/scheduling-eviction/kube-scheduler/ + |
+
postgresUID+int64 + |
+
+ The UID of the |
+
postgresGID+int64 + |
+
+ The GID of the |
+
instances [Required]+int + |
+
+ Number of instances required in the cluster + |
+
minSyncReplicas+int + |
+
+ Minimum number of instances required in synchronous replication with the +primary. Undefined or 0 allow writes to complete when no standby is +available. + |
+
maxSyncReplicas+int + |
+
+ The target value for the synchronous replication quorum, that can be +decreased if the number of ready standbys is lower than this. +Undefined or 0 disable synchronous replication. + |
+
postgresql+PostgresConfiguration + |
+
+ Configuration of the PostgreSQL server + |
+
replicationSlots+ReplicationSlotsConfiguration + |
+
+ Replication slots management configuration + |
+
bootstrap+BootstrapConfiguration + |
+
+ Instructions to bootstrap this cluster + |
+
replica+ReplicaClusterConfiguration + |
+
+ Replica cluster configuration + |
+
superuserSecret+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ The secret containing the superuser password. If not defined a new +secret will be created with a randomly generated password + |
+
enableSuperuserAccess+bool + |
+
+ When this option is enabled, the operator will use the |
+
certificates+CertificatesConfiguration + |
+
+ The configuration for the CA and related certificates + |
+
imagePullSecrets+[]github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ The list of pull secrets to be used to pull the images + |
+
storage+StorageConfiguration + |
+
+ Configuration of the storage of the instances + |
+
serviceAccountTemplate+ServiceAccountTemplate + |
+
+ Configure the generation of the service account + |
+
walStorage+StorageConfiguration + |
+
+ Configuration of the storage for PostgreSQL WAL (Write-Ahead Log) + |
+
ephemeralVolumeSource+core/v1.EphemeralVolumeSource + |
+
+ EphemeralVolumeSource allows the user to configure the source of ephemeral volumes. + |
+
startDelay+int32 + |
+
+ The time in seconds that is allowed for a PostgreSQL instance to +successfully start up (default 3600). +The startup probe failure threshold is derived from this value using the formula: +ceiling(startDelay / 10). + |
+
stopDelay+int32 + |
+
+ The time in seconds that is allowed for a PostgreSQL instance to +gracefully shutdown (default 1800) + |
+
smartShutdownTimeout+int32 + |
+
+ The time in seconds that controls the window of time reserved for the smart shutdown of Postgres to complete.
+Make sure you reserve enough time for the operator to request a fast shutdown of Postgres
+(that is: |
+
switchoverDelay+int32 + |
+
+ The time in seconds that is allowed for a primary PostgreSQL instance +to gracefully shutdown during a switchover. +Default value is 3600 seconds (1 hour). + |
+
failoverDelay+int32 + |
+
+ The amount of time (in seconds) to wait before triggering a failover +after the primary PostgreSQL instance in the cluster was detected +to be unhealthy + |
+
livenessProbeTimeout+int32 + |
+
+ LivenessProbeTimeout is the time (in seconds) that is allowed for a PostgreSQL instance +to successfully respond to the liveness probe (default 30). +The Liveness probe failure threshold is derived from this value using the formula: +ceiling(livenessProbe / 10). + |
+
affinity+AffinityConfiguration + |
+
+ Affinity/Anti-affinity rules for Pods + |
+
topologySpreadConstraints+[]core/v1.TopologySpreadConstraint + |
+
+ TopologySpreadConstraints specifies how to spread matching pods among the given topology. +More info: +https://kubernetes.io/docs/concepts/scheduling-eviction/topology-spread-constraints/ + |
+
resources+core/v1.ResourceRequirements + |
+
+ Resources requirements of every generated Pod. Please refer to +https://kubernetes.io/docs/concepts/configuration/manage-resources-containers/ +for more information. + |
+
ephemeralVolumesSizeLimit+EphemeralVolumesSizeLimitConfiguration + |
+
+ EphemeralVolumesSizeLimit allows the user to set the limits for the ephemeral +volumes + |
+
priorityClassName+string + |
+
+ Name of the priority class which will be used in every generated Pod, if the PriorityClass +specified does not exist, the pod will not be able to schedule. Please refer to +https://kubernetes.io/docs/concepts/scheduling-eviction/pod-priority-preemption/#priorityclass +for more information + |
+
primaryUpdateStrategy+PrimaryUpdateStrategy + |
+
+ Deployment strategy to follow to upgrade the primary server during a rolling
+update procedure, after all replicas have been successfully updated:
+it can be automated ( |
+
primaryUpdateMethod+PrimaryUpdateMethod + |
+
+ Method to follow to upgrade the primary server during a rolling
+update procedure, after all replicas have been successfully updated:
+it can be with a switchover ( |
+
backup+BackupConfiguration + |
+
+ The configuration to be used for backups + |
+
nodeMaintenanceWindow+NodeMaintenanceWindow + |
+
+ Define a maintenance window for the Kubernetes nodes + |
+
monitoring+MonitoringConfiguration + |
+
+ The configuration of the monitoring infrastructure of this cluster + |
+
externalClusters+[]ExternalCluster + |
+
+ The list of external clusters which are used in the configuration + |
+
logLevel+string + |
+
+ The instances' log level, one of the following values: error, warning, info (default), debug, trace + |
+
projectedVolumeTemplate+core/v1.ProjectedVolumeSource + |
+
+ Template to be used to define projected volumes, projected volumes will be mounted
+under |
+
env+[]core/v1.EnvVar + |
+
+ Env follows the Env format to pass environment variables +to the pods created in the cluster + |
+
envFrom+[]core/v1.EnvFromSource + |
+
+ EnvFrom follows the EnvFrom format to pass environment variables +sources to the pods to be used by Env + |
+
managed+ManagedConfiguration + |
+
+ The configuration that is used by the portions of PostgreSQL that are managed by the instance manager + |
+
seccompProfile+core/v1.SeccompProfile + |
+
+ The SeccompProfile applied to every Pod and Container.
+Defaults to: |
+
podSecurityContext+core/v1.PodSecurityContext + |
+
+ Override the PodSecurityContext applied to every Pod of the cluster. +When set, this overrides the operator's default PodSecurityContext for the cluster. +If omitted, the operator defaults are used. +This field doesn't have any effect if SecurityContextConstraints are present. + |
+
securityContext+core/v1.SecurityContext + |
+
+ Override the SecurityContext applied to every Container in the Pod of the cluster. +When set, this overrides the operator's default Container SecurityContext. +If omitted, the operator defaults are used. + |
+
tablespaces+[]TablespaceConfiguration + |
+
+ The tablespaces configuration + |
+
enablePDB+bool + |
+
+ Manage the |
+
plugins+[]PluginConfiguration + |
+
+ The plugins configuration, containing +any plugin to be loaded with the corresponding configuration + |
+
probes+ProbesConfiguration + |
+
+ The configuration of the probes to be injected +in the PostgreSQL Pods. + |
+
Appears in:
+ClusterStatus defines the observed state of a PostgreSQL cluster managed by +CloudNativePG.
+ +| Field | Description |
|---|---|
instances+int + |
+
+ The total number of PVC Groups detected in the cluster. It may differ from the number of existing instance pods. + |
+
readyInstances+int + |
+
+ The total number of ready instances in the cluster. It is equal to the number of ready instance pods. + |
+
instancesStatus+map[PodStatus][]string + |
+
+ InstancesStatus indicates in which status the instances are + |
+
instancesReportedState+map[PodName]InstanceReportedState + |
+
+ The reported state of the instances during the last reconciliation loop + |
+
managedRolesStatus+ManagedRoles + |
+
+ ManagedRolesStatus reports the state of the managed roles in the cluster + |
+
tablespacesStatus+[]TablespaceState + |
+
+ TablespacesStatus reports the state of the declarative tablespaces in the cluster + |
+
timelineID+int + |
+
+ The timeline of the Postgres cluster + |
+
topology+Topology + |
+
+ Instances topology. + |
+
latestGeneratedNode+int + |
+
+ ID of the latest generated node (used to avoid node name clashing) + |
+
currentPrimary+string + |
+
+ Current primary instance + |
+
targetPrimary+string + |
+
+ Target primary instance, this is different from the previous one +during a switchover or a failover + |
+
lastPromotionToken+string + |
+
+ LastPromotionToken is the last verified promotion token that +was used to promote a replica cluster + |
+
pvcCount+int32 + |
+
+ How many PVCs have been created by this cluster + |
+
jobCount+int32 + |
+
+ How many Jobs have been created by this cluster + |
+
danglingPVC+[]string + |
+
+ List of all the PVCs created by this cluster and still available +which are not attached to a Pod + |
+
resizingPVC+[]string + |
+
+ List of all the PVCs that have ResizingPVC condition. + |
+
initializingPVC+[]string + |
+
+ List of all the PVCs that are being initialized by this cluster + |
+
healthyPVC+[]string + |
+
+ List of all the PVCs not dangling nor initializing + |
+
unusablePVC+[]string + |
+
+ List of all the PVCs that are unusable because another PVC is missing + |
+
writeService+string + |
+
+ Current write pod + |
+
readService+string + |
+
+ Current list of read pods + |
+
phase+string + |
+
+ Current phase of the cluster + |
+
phaseReason+string + |
+
+ Reason for the current phase + |
+
secretsResourceVersion+SecretsResourceVersion + |
+
+ The list of resource versions of the secrets +managed by the operator. Every change here is done in the +interest of the instance manager, which will refresh the +secret data + |
+
configMapResourceVersion+ConfigMapResourceVersion + |
+
+ The list of resource versions of the configmaps, +managed by the operator. Every change here is done in the +interest of the instance manager, which will refresh the +configmap data + |
+
certificates+CertificatesStatus + |
+
+ The configuration for the CA and related certificates, initialized with defaults. + |
+
firstRecoverabilityPoint+string + |
+
+ The first recoverability point, stored as a date in RFC3339 format. +This field is calculated from the content of FirstRecoverabilityPointByMethod. +Deprecated: the field is not set for backup plugins. + |
+
firstRecoverabilityPointByMethod+map[BackupMethod]meta/v1.Time + |
+
+ The first recoverability point, stored as a date in RFC3339 format, per backup method type. +Deprecated: the field is not set for backup plugins. + |
+
lastSuccessfulBackup+string + |
+
+ Last successful backup, stored as a date in RFC3339 format. +This field is calculated from the content of LastSuccessfulBackupByMethod. +Deprecated: the field is not set for backup plugins. + |
+
lastSuccessfulBackupByMethod+map[BackupMethod]meta/v1.Time + |
+
+ Last successful backup, stored as a date in RFC3339 format, per backup method type. +Deprecated: the field is not set for backup plugins. + |
+
lastFailedBackup+string + |
+
+ Last failed backup, stored as a date in RFC3339 format. +Deprecated: the field is not set for backup plugins. + |
+
cloudNativePGCommitHash+string + |
+
+ The commit hash number of which this operator running + |
+
currentPrimaryTimestamp+string + |
+
+ The timestamp when the last actual promotion to primary has occurred + |
+
currentPrimaryFailingSinceTimestamp+string + |
+
+ The timestamp when the primary was detected to be unhealthy
+This field is reported when |
+
targetPrimaryTimestamp+string + |
+
+ The timestamp when the last request for a new primary has occurred + |
+
poolerIntegrations+PoolerIntegrations + |
+
+ The integration needed by poolers referencing the cluster + |
+
cloudNativePGOperatorHash+string + |
+
+ The hash of the binary of the operator + |
+
availableArchitectures+[]AvailableArchitecture + |
+
+ AvailableArchitectures reports the available architectures of a cluster + |
+
conditions+[]meta/v1.Condition + |
+
+ Conditions for cluster object + |
+
instanceNames+[]string + |
+
+ List of instance names in the cluster + |
+
onlineUpdateEnabled+bool + |
+
+ OnlineUpdateEnabled shows if the online upgrade is enabled inside the cluster + |
+
image+string + |
+
+ Image contains the image name used by the pods + |
+
pgDataImageInfo+ImageInfo + |
+
+ PGDataImageInfo contains the details of the latest image that has run on the current data directory. + |
+
pluginStatus+[]PluginStatus + |
+
+ PluginStatus is the status of the loaded plugins + |
+
switchReplicaClusterStatus+SwitchReplicaClusterStatus + |
+
+ SwitchReplicaClusterStatus is the status of the switch to replica cluster + |
+
demotionToken+string + |
+
+ DemotionToken is a JSON token containing the information +from pg_controldata such as Database system identifier, Latest checkpoint's +TimeLineID, Latest checkpoint's REDO location, Latest checkpoint's REDO +WAL file, and Time of latest checkpoint + |
+
systemID+string + |
+
+ SystemID is the latest detected PostgreSQL SystemID + |
+
Appears in:
+ +ConfigMapResourceVersion is the resource versions of the secrets +managed by the operator
+ +| Field | Description |
|---|---|
metrics+map[string]string + |
+
+ A map with the versions of all the config maps used to pass metrics. +Map keys are the config map names, map values are the versions + |
+
(Alias of string)
Appears in:
+ +DataDurabilityLevel specifies how strictly to enforce synchronous replication
+when cluster instances are unavailable. Options are required or preferred.
Appears in:
+ +DataSource contains the configuration required to bootstrap a +PostgreSQL cluster from an existing storage
+ +| Field | Description |
|---|---|
storage [Required]+core/v1.TypedLocalObjectReference + |
+
+ Configuration of the storage of the instances + |
+
walStorage+core/v1.TypedLocalObjectReference + |
+
+ Configuration of the storage for PostgreSQL WAL (Write-Ahead Log) + |
+
tablespaceStorage+map[string]core/v1.TypedLocalObjectReference + |
+
+ Configuration of the storage for PostgreSQL tablespaces + |
+
Appears in:
+DatabaseObjectSpec contains the fields which are common to every +database object
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ Name of the object (extension, schema, FDW, server) + |
+
ensure+EnsureOption + |
+
+ Specifies whether an object (e.g schema) should be present or absent
+in the database. If set to |
+
Appears in:
+ +DatabaseObjectStatus is the status of the managed database objects
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ The name of the object + |
+
applied [Required]+bool + |
+
+ True of the object has been installed successfully in +the database + |
+
message+string + |
+
+ Message is the object reconciliation message + |
+
(Alias of string)
Appears in:
+DatabaseReclaimPolicy describes a policy for end-of-life maintenance of databases.
+ +Appears in:
+ +DatabaseRoleRef is a reference an a role available inside PostgreSQL
+ +| Field | Description |
|---|---|
name+string + |
++ No description provided. | +
Appears in:
+DatabaseSpec is the specification of a Postgresql Database, built around the
+CREATE DATABASE, ALTER DATABASE, and DROP DATABASE SQL commands of
+PostgreSQL.
| Field | Description |
|---|---|
cluster [Required]+core/v1.LocalObjectReference + |
+
+ The name of the PostgreSQL cluster hosting the database. + |
+
ensure+EnsureOption + |
+
+ Ensure the PostgreSQL database is |
+
name [Required]+string + |
+
+ The name of the database to create inside PostgreSQL. This setting cannot be changed. + |
+
owner [Required]+string + |
+
+ Maps to the |
+
template+string + |
+
+ Maps to the |
+
encoding+string + |
+
+ Maps to the |
+
locale+string + |
+
+ Maps to the |
+
localeProvider+string + |
+
+ Maps to the |
+
localeCollate+string + |
+
+ Maps to the |
+
localeCType+string + |
+
+ Maps to the |
+
icuLocale+string + |
+
+ Maps to the |
+
icuRules+string + |
+
+ Maps to the |
+
builtinLocale+string + |
+
+ Maps to the |
+
collationVersion+string + |
+
+ Maps to the |
+
isTemplate+bool + |
+
+ Maps to the |
+
allowConnections+bool + |
+
+ Maps to the |
+
connectionLimit+int + |
+
+ Maps to the |
+
tablespace+string + |
+
+ Maps to the |
+
databaseReclaimPolicy+DatabaseReclaimPolicy + |
+
+ The policy for end-of-life maintenance of this database. + |
+
schemas+[]SchemaSpec + |
+
+ The list of schemas to be managed in the database + |
+
extensions+[]ExtensionSpec + |
+
+ The list of extensions to be managed in the database + |
+
fdws+[]FDWSpec + |
+
+ The list of foreign data wrappers to be managed in the database + |
+
servers+[]ServerSpec + |
+
+ The list of foreign servers to be managed in the database + |
+
Appears in:
+DatabaseStatus defines the observed state of Database
+ +| Field | Description |
|---|---|
observedGeneration+int64 + |
+
+ A sequence number representing the latest +desired state that was synchronized + |
+
applied+bool + |
+
+ Applied is true if the database was reconciled correctly + |
+
message+string + |
+
+ Message is the reconciliation output message + |
+
schemas+[]DatabaseObjectStatus + |
+
+ Schemas is the status of the managed schemas + |
+
extensions+[]DatabaseObjectStatus + |
+
+ Extensions is the status of the managed extensions + |
+
fdws+[]DatabaseObjectStatus + |
+
+ FDWs is the status of the managed FDWs + |
+
servers+[]DatabaseObjectStatus + |
+
+ Servers is the status of the managed servers + |
+
Appears in:
+EmbeddedObjectMetadata contains metadata to be inherited by all resources related to a Cluster
+ +| Field | Description |
|---|---|
labels+map[string]string + |
++ No description provided. | +
annotations+map[string]string + |
++ No description provided. | +
(Alias of string)
Appears in:
+EnsureOption represents whether we should enforce the presence or absence of +a Role in a PostgreSQL instance
+ +Appears in:
+EphemeralVolumesSizeLimitConfiguration contains the configuration of the ephemeral +storage
+ +| Field | Description |
|---|---|
shm+k8s.io/apimachinery/pkg/api/resource.Quantity + |
+
+ Shm is the size limit of the shared memory volume + |
+
temporaryData+k8s.io/apimachinery/pkg/api/resource.Quantity + |
+
+ TemporaryData is the size limit of the temporary data volume + |
+
Appears in:
+ +ExtensionConfiguration is the configuration used to add +PostgreSQL extensions to the Cluster.
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ The name of the extension, required + |
+
image [Required]+core/v1.ImageVolumeSource + |
+
+ The image containing the extension, required + |
+
extension_control_path+[]string + |
+
+ The list of directories inside the image which should be added to extension_control_path. +If not defined, defaults to "/share". + |
+
dynamic_library_path+[]string + |
+
+ The list of directories inside the image which should be added to dynamic_library_path. +If not defined, defaults to "/lib". + |
+
ld_library_path+[]string + |
+
+ The list of directories inside the image which should be added to ld_library_path. + |
+
Appears in:
+ExtensionSpec configures an extension in a database
+ +| Field | Description |
|---|---|
DatabaseObjectSpec+DatabaseObjectSpec + |
+(Members of DatabaseObjectSpec are embedded into this type.)
+ Common fields + |
+
version [Required]+string + |
+
+ The version of the extension to install. If empty, the operator will +install the default version (whatever is specified in the +extension's control file) + |
+
schema [Required]+string + |
+
+ The name of the schema in which to install the extension's objects, +in case the extension allows its contents to be relocated. If not +specified (default), and the extension's control file does not +specify a schema either, the current default object creation schema +is used. + |
+
Appears in:
+ExternalCluster represents the connection parameters to an +external cluster which is used in the other sections of the configuration
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ The server name, required + |
+
connectionParameters+map[string]string + |
+
+ The list of connection parameters, such as dbname, host, username, etc + |
+
sslCert+core/v1.SecretKeySelector + |
+
+ The reference to an SSL certificate to be used to connect to this +instance + |
+
sslKey+core/v1.SecretKeySelector + |
+
+ The reference to an SSL private key to be used to connect to this +instance + |
+
sslRootCert+core/v1.SecretKeySelector + |
+
+ The reference to an SSL CA public key to be used to connect to this +instance + |
+
password+core/v1.SecretKeySelector + |
+
+ The reference to the password to be used to connect to the server.
+If a password is provided, CloudNativePG creates a PostgreSQL
+passfile at |
+
barmanObjectStore+github.com/cloudnative-pg/barman-cloud/pkg/api.BarmanObjectStoreConfiguration + |
+
+ The configuration for the barman-cloud tool suite + |
+
plugin [Required]+PluginConfiguration + |
+
+ The configuration of the plugin that is taking care +of WAL archiving and backups for this external cluster + |
+
Appears in:
+FDWSpec configures an Foreign Data Wrapper in a database
+ +| Field | Description |
|---|---|
DatabaseObjectSpec+DatabaseObjectSpec + |
+(Members of DatabaseObjectSpec are embedded into this type.)
+ Common fields + |
+
handler+string + |
+
+ Name of the handler function (e.g., "postgres_fdw_handler"). +This will be empty if no handler is specified. In that case, +the default handler is registered when the FDW extension is created. + |
+
validator+string + |
+
+ Name of the validator function (e.g., "postgres_fdw_validator"). +This will be empty if no validator is specified. In that case, +the default validator is registered when the FDW extension is created. + |
+
owner+string + |
+
+ Owner specifies the database role that will own the Foreign Data Wrapper. +The role must have superuser privileges in the target database. + |
+
options+[]OptionSpec + |
+
+ Options specifies the configuration options for the FDW. + |
+
usage+[]UsageSpec + |
+
+ List of roles for which |
+
Appears in:
+ +FailoverQuorumStatus is the latest observed status of the failover +quorum of the PG cluster.
+ +| Field | Description |
|---|---|
method+string + |
+
+ Contains the latest reported Method value. + |
+
standbyNames+[]string + |
+
+ StandbyNames is the list of potentially synchronous +instance names. + |
+
standbyNumber+int + |
+
+ StandbyNumber is the number of synchronous standbys that transactions +need to wait for replies from. + |
+
primary+string + |
+
+ Primary is the name of the primary instance that updated +this object the latest time. + |
+
Appears in:
+ImageCatalogRef defines the reference to a major version in an ImageCatalog
+ +| Field | Description |
|---|---|
TypedLocalObjectReference+core/v1.TypedLocalObjectReference + |
+(Members of TypedLocalObjectReference are embedded into this type.)
+ No description provided. |
+
major [Required]+int + |
+
+ The major version of PostgreSQL we want to use from the ImageCatalog + |
+
Appears in:
+ImageCatalogSpec defines the desired ImageCatalog
+ +| Field | Description |
|---|---|
images [Required]+[]CatalogImage + |
+
+ List of CatalogImages available in the catalog + |
+
Appears in:
+ +ImageInfo contains the information about a PostgreSQL image
+ +| Field | Description |
|---|---|
image [Required]+string + |
+
+ Image is the image name + |
+
majorVersion [Required]+int + |
+
+ MajorVersion is the major version of the image + |
+
Appears in:
+ +Import contains the configuration to init a database from a logic snapshot of an externalCluster
+ +| Field | Description |
|---|---|
source [Required]+ImportSource + |
+
+ The source of the import + |
+
type [Required]+SnapshotType + |
+
+ The import type. Can be |
+
databases [Required]+[]string + |
+
+ The databases to import + |
+
roles+[]string + |
+
+ The roles to import + |
+
postImportApplicationSQL+[]string + |
+
+ List of SQL queries to be executed as a superuser in the application +database right after is imported - to be used with extreme care +(by default empty). Only available in microservice type. + |
+
schemaOnly+bool + |
+
+ When set to true, only the |
+
pgDumpExtraOptions+[]string + |
+
+ List of custom options to pass to the IMPORTANT: Use with caution. The operator does not validate these options, +and certain flags may interfere with its intended functionality or design. +You are responsible for ensuring that the provided options are compatible +with your environment and desired behavior. + |
+
pgRestoreExtraOptions+[]string + |
+
+ List of custom options to pass to the IMPORTANT: Use with caution. The operator does not validate these options, +and certain flags may interfere with its intended functionality or design. +You are responsible for ensuring that the provided options are compatible +with your environment and desired behavior. + |
+
pgRestorePredataOptions+[]string + |
+
+ Custom options to pass to the IMPORTANT: Use with caution. The operator does not validate these options, +and certain flags may interfere with its intended functionality or design. +You are responsible for ensuring that the provided options are compatible +with your environment and desired behavior. + |
+
pgRestoreDataOptions+[]string + |
+
+ Custom options to pass to the IMPORTANT: Use with caution. The operator does not validate these options, +and certain flags may interfere with its intended functionality or design. +You are responsible for ensuring that the provided options are compatible +with your environment and desired behavior. + |
+
pgRestorePostdataOptions+[]string + |
+
+ Custom options to pass to the IMPORTANT: Use with caution. The operator does not validate these options, +and certain flags may interfere with its intended functionality or design. +You are responsible for ensuring that the provided options are compatible +with your environment and desired behavior. + |
+
Appears in:
+ImportSource describes the source for the logical snapshot
+ +| Field | Description |
|---|---|
externalCluster [Required]+string + |
+
+ The name of the externalCluster used for import + |
+
Appears in:
+InstanceID contains the information to identify an instance
+ +| Field | Description |
|---|---|
podName+string + |
+
+ The pod name + |
+
ContainerID+string + |
+
+ The container ID + |
+
Appears in:
+ +InstanceReportedState describes the last reported state of an instance during a reconciliation loop
+ +| Field | Description |
|---|---|
isPrimary [Required]+bool + |
+
+ indicates if an instance is the primary one + |
+
timeLineID+int + |
+
+ indicates on which TimelineId the instance is + |
+
ip [Required]+string + |
+
+ IP address of the instance + |
+
Appears in:
+ +IsolationCheckConfiguration contains the configuration for the isolation check +functionality in the liveness probe
+ +| Field | Description |
|---|---|
enabled+bool + |
+
+ Whether primary isolation checking is enabled for the liveness probe + |
+
requestTimeout+int + |
+
+ Timeout in milliseconds for requests during the primary isolation check + |
+
connectionTimeout+int + |
+
+ Timeout in milliseconds for connections during the primary isolation check + |
+
Appears in:
+LDAPBindAsAuth provides the required fields to use the +bind authentication for LDAP
+ +| Field | Description |
|---|---|
prefix+string + |
+
+ Prefix for the bind authentication option + |
+
suffix+string + |
+
+ Suffix for the bind authentication option + |
+
Appears in:
+LDAPBindSearchAuth provides the required fields to use +the bind+search LDAP authentication process
+ +| Field | Description |
|---|---|
baseDN+string + |
+
+ Root DN to begin the user search + |
+
bindDN+string + |
+
+ DN of the user to bind to the directory + |
+
bindPassword+core/v1.SecretKeySelector + |
+
+ Secret with the password for the user to bind to the directory + |
+
searchAttribute+string + |
+
+ Attribute to match against the username + |
+
searchFilter+string + |
+
+ Search filter to use when doing the search+bind authentication + |
+
Appears in:
+ +LDAPConfig contains the parameters needed for LDAP authentication
+ +| Field | Description |
|---|---|
server+string + |
+
+ LDAP hostname or IP address + |
+
port+int + |
+
+ LDAP server port + |
+
scheme+LDAPScheme + |
+
+ LDAP schema to be used, possible options are |
+
bindAsAuth+LDAPBindAsAuth + |
+
+ Bind as authentication configuration + |
+
bindSearchAuth+LDAPBindSearchAuth + |
+
+ Bind+Search authentication configuration + |
+
tls+bool + |
+
+ Set to 'true' to enable LDAP over TLS. 'false' is default + |
+
(Alias of string)
Appears in:
+LDAPScheme defines the possible schemes for LDAP
+ +Appears in:
+ +LivenessProbe is the configuration of the liveness probe
+ +| Field | Description |
|---|---|
Probe+Probe + |
+(Members of Probe are embedded into this type.)
+ Probe is the standard probe configuration + |
+
isolationCheck+IsolationCheckConfiguration + |
+
+ Configure the feature that extends the liveness probe for a primary +instance. In addition to the basic checks, this verifies whether the +primary is isolated from the Kubernetes API server and from its +replicas, ensuring that it can be safely shut down if network +partition or API unavailability is detected. Enabled by default. + |
+
Appears in:
+ManagedConfiguration represents the portions of PostgreSQL that are managed +by the instance manager
+ +| Field | Description |
|---|---|
roles+[]RoleConfiguration + |
+
+ Database roles managed by the |
+
services+ManagedServices + |
+
+ Services roles managed by the |
+
Appears in:
+ +ManagedRoles tracks the status of a cluster's managed roles
+ +| Field | Description |
|---|---|
byStatus+map[RoleStatus][]string + |
+
+ ByStatus gives the list of roles in each state + |
+
cannotReconcile+map[string][]string + |
+
+ CannotReconcile lists roles that cannot be reconciled in PostgreSQL, +with an explanation of the cause + |
+
passwordStatus+map[string]PasswordState + |
+
+ PasswordStatus gives the last transaction id and password secret version for each managed role + |
+
Appears in:
+ +ManagedService represents a specific service managed by the cluster. +It includes the type of service and its associated template specification.
+ +| Field | Description |
|---|---|
selectorType [Required]+ServiceSelectorType + |
+
+ SelectorType specifies the type of selectors that the service will have. +Valid values are "rw", "r", and "ro", representing read-write, read, and read-only services. + |
+
updateStrategy+ServiceUpdateStrategy + |
+
+ UpdateStrategy describes how the service differences should be reconciled + |
+
serviceTemplate [Required]+ServiceTemplateSpec + |
+
+ ServiceTemplate is the template specification for the service. + |
+
Appears in:
+ +ManagedServices represents the services managed by the cluster.
+ +| Field | Description |
|---|---|
disabledDefaultServices+[]ServiceSelectorType + |
+
+ DisabledDefaultServices is a list of service types that are disabled by default. +Valid values are "r", and "ro", representing read, and read-only services. + |
+
additional+[]ManagedService + |
+
+ Additional is a list of additional managed services specified by the user. + |
+
Appears in:
+Metadata is a structure similar to the metav1.ObjectMeta, but still +parseable by controller-gen to create a suitable CRD for the user. +The comment of PodTemplateSpec has an explanation of why we are +not using the core data types.
+ +| Field | Description |
|---|---|
name+string + |
+
+ The name of the resource. Only supported for certain types + |
+
labels+map[string]string + |
+
+ Map of string keys and values that can be used to organize and categorize +(scope and select) objects. May match selectors of replication controllers +and services. +More info: http://kubernetes.io/docs/user-guide/labels + |
+
annotations+map[string]string + |
+
+ Annotations is an unstructured key value map stored with a resource that may be +set by external tools to store and retrieve arbitrary metadata. They are not +queryable and should be preserved when modifying objects. +More info: http://kubernetes.io/docs/user-guide/annotations + |
+
Appears in:
+MonitoringConfiguration is the type containing all the monitoring +configuration for a certain cluster
+ +| Field | Description |
|---|---|
disableDefaultQueries+bool + |
+
+ Whether the default queries should be injected.
+Set it to |
+
customQueriesConfigMap+[]github.com/cloudnative-pg/machinery/pkg/api.ConfigMapKeySelector + |
+
+ The list of config maps containing the custom queries + |
+
customQueriesSecret+[]github.com/cloudnative-pg/machinery/pkg/api.SecretKeySelector + |
+
+ The list of secrets containing the custom queries + |
+
enablePodMonitor+bool + |
+
+ Enable or disable the Deprecated: This feature will be removed in an upcoming release. If +you need this functionality, you can create a PodMonitor manually. + |
+
tls+ClusterMonitoringTLSConfiguration + |
+
+ Configure TLS communication for the metrics endpoint. +Changing tls.enabled option will force a rollout of all instances. + |
+
podMonitorMetricRelabelings+[]github.com/prometheus-operator/prometheus-operator/pkg/apis/monitoring/v1.RelabelConfig + |
+
+ The list of metric relabelings for the Deprecated: This feature will be removed in an upcoming release. If +you need this functionality, you can create a PodMonitor manually. + |
+
podMonitorRelabelings+[]github.com/prometheus-operator/prometheus-operator/pkg/apis/monitoring/v1.RelabelConfig + |
+
+ The list of relabelings for the Deprecated: This feature will be removed in an upcoming release. If +you need this functionality, you can create a PodMonitor manually. + |
+
metricsQueriesTTL+meta/v1.Duration + |
+
+ The interval during which metrics computed from queries are considered current. +Once it is exceeded, a new scrape will trigger a rerun +of the queries. +If not set, defaults to 30 seconds, in line with Prometheus scraping defaults. +Setting this to zero disables the caching mechanism and can cause heavy load on the PostgreSQL server. + |
+
Appears in:
+NodeMaintenanceWindow contains information that the operator +will use while upgrading the underlying node.
+This option is only useful when the chosen storage prevents the Pods +from being freely moved across nodes.
+ +| Field | Description |
|---|---|
reusePVC+bool + |
+
+ Reuse the existing PVC (wait for the node to come
+up again) or not (recreate it elsewhere - when |
+
inProgress+bool + |
+
+ Is there a node maintenance activity in progress? + |
+
Appears in:
+OnlineConfiguration contains the configuration parameters for the online volume snapshot
+ +| Field | Description |
|---|---|
waitForArchive+bool + |
+
+ If false, the function will return immediately after the backup is completed, +without waiting for WAL to be archived. +This behavior is only useful with backup software that independently monitors WAL archiving. +Otherwise, WAL required to make the backup consistent might be missing and make the backup useless. +By default, or when this parameter is true, pg_backup_stop will wait for WAL to be archived when archiving is +enabled. +On a standby, this means that it will wait only when archive_mode = always. +If write activity on the primary is low, it may be useful to run pg_switch_wal on the primary in order to trigger +an immediate segment switch. + |
+
immediateCheckpoint+bool + |
+
+ Control whether the I/O workload for the backup initial checkpoint will
+be limited, according to the |
+
Appears in:
+OptionSpec holds the name, value and the ensure field for an option
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ Name of the option + |
+
value [Required]+string + |
+
+ Value of the option + |
+
ensure+EnsureOption + |
+
+ Specifies whether an option should be present or absent in
+the database. If set to |
+
Appears in:
+PasswordState represents the state of the password of a managed RoleConfiguration
+ +| Field | Description |
|---|---|
transactionID+int64 + |
+
+ the last transaction ID to affect the role definition in PostgreSQL + |
+
resourceVersion+string + |
+
+ the resource version of the password secret + |
+
Appears in:
+ +PgBouncerIntegrationStatus encapsulates the needed integration for the pgbouncer poolers referencing the cluster
+ +| Field | Description |
|---|---|
secrets+[]string + |
++ No description provided. | +
(Alias of string)
Appears in:
+ +PgBouncerPoolMode is the mode of PgBouncer
+ +Appears in:
+ +PgBouncerSecrets contains the versions of the secrets used +by pgbouncer
+ +| Field | Description |
|---|---|
authQuery+SecretVersion + |
+
+ The auth query secret version + |
+
Appears in:
+PgBouncerSpec defines how to configure PgBouncer
+ +| Field | Description |
|---|---|
poolMode+PgBouncerPoolMode + |
+
+ The pool mode. Default: |
+
serverTLSSecret+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ ServerTLSSecret, when pointing to a TLS secret, provides pgbouncer's
+ |
+
serverCASecret+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ ServerCASecret provides PgBouncer’s server_tls_ca_file, the root +CA for validating PostgreSQL certificates + |
+
clientCASecret+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ ClientCASecret provides PgBouncer’s client_tls_ca_file, the root +CA for validating client certificates + |
+
clientTLSSecret+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ ClientTLSSecret provides PgBouncer’s client_tls_key_file (private key) +and client_tls_cert_file (certificate) used to accept client connections + |
+
authQuerySecret+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ The credentials of the user that need to be used for the authentication +query. In case it is specified, also an AuthQuery +(e.g. "SELECT usename, passwd FROM pg_catalog.pg_shadow WHERE usename=$1") +has to be specified and no automatic CNPG Cluster integration will be triggered. +Deprecated. + |
+
authQuery+string + |
+
+ The query that will be used to download the hash of the password +of a certain user. Default: "SELECT usename, passwd FROM public.user_search($1)". +In case it is specified, also an AuthQuerySecret has to be specified and +no automatic CNPG Cluster integration will be triggered. + |
+
parameters+map[string]string + |
+
+ Additional parameters to be passed to PgBouncer - please check +the CNPG documentation for a list of options you can configure + |
+
pg_hba+[]string + |
+
+ PostgreSQL Host Based Authentication rules (lines to be appended +to the pg_hba.conf file) + |
+
paused+bool + |
+
+ When set to |
+
Appears in:
+PluginConfiguration specifies a plugin that need to be loaded for this +cluster to be reconciled
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ Name is the plugin name + |
+
enabled+bool + |
+
+ Enabled is true if this plugin will be used + |
+
isWALArchiver+bool + |
+
+ Marks the plugin as the WAL archiver. At most one plugin can be
+designated as a WAL archiver. This cannot be enabled if the
+ |
+
parameters+map[string]string + |
+
+ Parameters is the configuration of the plugin + |
+
Appears in:
+ +PluginStatus is the status of a loaded plugin
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ Name is the name of the plugin + |
+
version [Required]+string + |
+
+ Version is the version of the plugin loaded by the +latest reconciliation loop + |
+
capabilities+[]string + |
+
+ Capabilities are the list of capabilities of the +plugin + |
+
operatorCapabilities+[]string + |
+
+ OperatorCapabilities are the list of capabilities of the +plugin regarding the reconciler + |
+
walCapabilities+[]string + |
+
+ WALCapabilities are the list of capabilities of the +plugin regarding the WAL management + |
+
backupCapabilities+[]string + |
+
+ BackupCapabilities are the list of capabilities of the +plugin regarding the Backup management + |
+
restoreJobHookCapabilities+[]string + |
+
+ RestoreJobHookCapabilities are the list of capabilities of the +plugin regarding the RestoreJobHook management + |
+
status+string + |
+
+ Status contain the status reported by the plugin through the SetStatusInCluster interface + |
+
Appears in:
+PodTemplateSpec is a structure allowing the user to set +a template for Pod generation.
+Unfortunately we can't use the corev1.PodTemplateSpec +type because the generated CRD won't have the field for the +metadata section.
+References: +https://github.com/kubernetes-sigs/controller-tools/issues/385 +https://github.com/kubernetes-sigs/controller-tools/issues/448 +https://github.com/prometheus-operator/prometheus-operator/issues/3041
+ +| Field | Description |
|---|---|
metadata+Metadata + |
+
+ Standard object's metadata. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#metadata + |
+
spec+core/v1.PodSpec + |
+
+ Specification of the desired behavior of the pod. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
(Alias of map[string]string)
Appears in:
+PodTopologyLabels represent the topology of a Pod. map[labelName]labelValue
+ +Appears in:
+ +PoolerIntegrations encapsulates the needed integration for the poolers referencing the cluster
+ +| Field | Description |
|---|---|
pgBouncerIntegration+PgBouncerIntegrationStatus + |
++ No description provided. | +
Appears in:
+PoolerMonitoringConfiguration is the type containing all the monitoring +configuration for a certain Pooler.
+Mirrors the Cluster's MonitoringConfiguration but without the custom queries +part for now.
+ +| Field | Description |
|---|---|
enablePodMonitor+bool + |
+
+ Enable or disable the |
+
podMonitorMetricRelabelings+[]github.com/prometheus-operator/prometheus-operator/pkg/apis/monitoring/v1.RelabelConfig + |
+
+ The list of metric relabelings for the |
+
podMonitorRelabelings+[]github.com/prometheus-operator/prometheus-operator/pkg/apis/monitoring/v1.RelabelConfig + |
+
+ The list of relabelings for the |
+
Appears in:
+PoolerSecrets contains the versions of all the secrets used
+ +| Field | Description |
|---|---|
clientTLS+SecretVersion + |
+
+ The client TLS secret version + |
+
serverTLS+SecretVersion + |
+
+ The server TLS secret version + |
+
serverCA+SecretVersion + |
+
+ The server CA secret version + |
+
clientCA+SecretVersion + |
+
+ The client CA secret version + |
+
pgBouncerSecrets+PgBouncerSecrets + |
+
+ The version of the secrets used by PgBouncer + |
+
Appears in:
+PoolerSpec defines the desired state of Pooler
+ +| Field | Description |
|---|---|
cluster [Required]+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ This is the cluster reference on which the Pooler will work. +Pooler name should never match with any cluster name within the same namespace. + |
+
type+PoolerType + |
+
+ Type of service to forward traffic to. Default: |
+
instances+int32 + |
+
+ The number of replicas we want. Default: 1. + |
+
template+PodTemplateSpec + |
+
+ The template of the Pod to be created + |
+
pgbouncer [Required]+PgBouncerSpec + |
+
+ The PgBouncer configuration + |
+
deploymentStrategy+apps/v1.DeploymentStrategy + |
+
+ The deployment strategy to use for pgbouncer to replace existing pods with new ones + |
+
monitoring+PoolerMonitoringConfiguration + |
+
+ The configuration of the monitoring infrastructure of this pooler. +Deprecated: This feature will be removed in an upcoming release. If +you need this functionality, you can create a PodMonitor manually. + |
+
serviceTemplate+ServiceTemplateSpec + |
+
+ Template for the Service to be created + |
+
Appears in:
+PoolerStatus defines the observed state of Pooler
+ +| Field | Description |
|---|---|
secrets+PoolerSecrets + |
+
+ The resource version of the config object + |
+
instances+int32 + |
+
+ The number of pods trying to be scheduled + |
+
(Alias of string)
Appears in:
+PoolerType is the type of the connection pool, meaning the service
+we are targeting. Allowed values are rw and ro.
Appears in:
+PostgresConfiguration defines the PostgreSQL configuration
+ +| Field | Description |
|---|---|
parameters+map[string]string + |
+
+ PostgreSQL configuration options (postgresql.conf) + |
+
synchronous+SynchronousReplicaConfiguration + |
+
+ Configuration of the PostgreSQL synchronous replication feature + |
+
pg_hba+[]string + |
+
+ PostgreSQL Host Based Authentication rules (lines to be appended +to the pg_hba.conf file) + |
+
pg_ident+[]string + |
+
+ PostgreSQL User Name Maps rules (lines to be appended +to the pg_ident.conf file) + |
+
syncReplicaElectionConstraint+SyncReplicaElectionConstraints + |
+
+ Requirements to be met by sync replicas. This will affect how the "synchronous_standby_names" parameter will be +set up. + |
+
shared_preload_libraries+[]string + |
+
+ Lists of shared preload libraries to add to the default ones + |
+
ldap+LDAPConfig + |
+
+ Options to specify LDAP configuration + |
+
promotionTimeout+int32 + |
+
+ Specifies the maximum number of seconds to wait when promoting an instance to primary. +Default value is 40000000, greater than one year in seconds, +big enough to simulate an infinite timeout + |
+
enableAlterSystem+bool + |
+
+ If this parameter is true, the user will be able to invoke |
+
extensions+[]ExtensionConfiguration + |
+
+ The configuration of the extensions to be added + |
+
(Alias of string)
Appears in:
+PrimaryUpdateMethod contains the method to use when upgrading +the primary server of the cluster as part of rolling updates
+ +(Alias of string)
Appears in:
+PrimaryUpdateStrategy contains the strategy to follow when upgrading +the primary server of the cluster as part of rolling updates
+ +Appears in:
+Probe describes a health check to be performed against a container to determine whether it is +alive or ready to receive traffic.
+ +| Field | Description |
|---|---|
initialDelaySeconds+int32 + |
+
+ Number of seconds after the container has started before liveness probes are initiated. +More info: https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle#container-probes + |
+
timeoutSeconds+int32 + |
+
+ Number of seconds after which the probe times out. +Defaults to 1 second. Minimum value is 1. +More info: https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle#container-probes + |
+
periodSeconds+int32 + |
+
+ How often (in seconds) to perform the probe. +Default to 10 seconds. Minimum value is 1. + |
+
successThreshold+int32 + |
+
+ Minimum consecutive successes for the probe to be considered successful after having failed. +Defaults to 1. Must be 1 for liveness and startup. Minimum value is 1. + |
+
failureThreshold+int32 + |
+
+ Minimum consecutive failures for the probe to be considered failed after having succeeded. +Defaults to 3. Minimum value is 1. + |
+
terminationGracePeriodSeconds+int64 + |
+
+ Optional duration in seconds the pod needs to terminate gracefully upon probe failure. +The grace period is the duration in seconds after the processes running in the pod are sent +a termination signal and the time when the processes are forcibly halted with a kill signal. +Set this value longer than the expected cleanup time for your process. +If this value is nil, the pod's terminationGracePeriodSeconds will be used. Otherwise, this +value overrides the value provided by the pod spec. +Value must be non-negative integer. The value zero indicates stop immediately via +the kill signal (no opportunity to shut down). +This is a beta field and requires enabling ProbeTerminationGracePeriod feature gate. +Minimum value is 1. spec.terminationGracePeriodSeconds is used if unset. + |
+
(Alias of string)
Appears in:
+ +ProbeStrategyType is the type of the strategy used to declare a PostgreSQL instance +ready
+ +Appears in:
+ +ProbeWithStrategy is the configuration of the startup probe
+ +| Field | Description |
|---|---|
Probe+Probe + |
+(Members of Probe are embedded into this type.)
+ Probe is the standard probe configuration + |
+
type+ProbeStrategyType + |
+
+ The probe strategy + |
+
maximumLag+k8s.io/apimachinery/pkg/api/resource.Quantity + |
+
+ Lag limit. Used only for |
+
Appears in:
+ProbesConfiguration represent the configuration for the probes +to be injected in the PostgreSQL Pods
+ +| Field | Description |
|---|---|
startup [Required]+ProbeWithStrategy + |
+
+ The startup probe configuration + |
+
liveness [Required]+LivenessProbe + |
+
+ The liveness probe configuration + |
+
readiness [Required]+ProbeWithStrategy + |
+
+ The readiness probe configuration + |
+
(Alias of string)
Appears in:
+ +PublicationReclaimPolicy defines a policy for end-of-life maintenance of Publications.
+ +Appears in:
+PublicationSpec defines the desired state of Publication
+ +| Field | Description |
|---|---|
cluster [Required]+core/v1.LocalObjectReference + |
+
+ The name of the PostgreSQL cluster that identifies the "publisher" + |
+
name [Required]+string + |
+
+ The name of the publication inside PostgreSQL + |
+
dbname [Required]+string + |
+
+ The name of the database where the publication will be installed in +the "publisher" cluster + |
+
parameters+map[string]string + |
+
+ Publication parameters part of the |
+
target [Required]+PublicationTarget + |
+
+ Target of the publication as expected by PostgreSQL |
+
publicationReclaimPolicy+PublicationReclaimPolicy + |
+
+ The policy for end-of-life maintenance of this publication + |
+
Appears in:
+PublicationStatus defines the observed state of Publication
+ +| Field | Description |
|---|---|
observedGeneration+int64 + |
+
+ A sequence number representing the latest +desired state that was synchronized + |
+
applied+bool + |
+
+ Applied is true if the publication was reconciled correctly + |
+
message+string + |
+
+ Message is the reconciliation output message + |
+
Appears in:
+ +PublicationTarget is what this publication should publish
+ +| Field | Description |
|---|---|
allTables+bool + |
+
+ Marks the publication as one that replicates changes for all tables
+in the database, including tables created in the future.
+Corresponding to |
+
objects+[]PublicationTargetObject + |
+
+ Just the following schema objects + |
+
Appears in:
+ +PublicationTargetObject is an object to publish
+ +| Field | Description |
|---|---|
tablesInSchema+string + |
+
+ Marks the publication as one that replicates changes for all tables
+in the specified list of schemas, including tables created in the
+future. Corresponding to |
+
table+PublicationTargetTable + |
+
+ Specifies a list of tables to add to the publication. Corresponding
+to |
+
Appears in:
+ +PublicationTargetTable is a table to publish
+ +| Field | Description |
|---|---|
only+bool + |
+
+ Whether to limit to the table only or include all its descendants + |
+
name [Required]+string + |
+
+ The table name + |
+
schema+string + |
+
+ The schema name + |
+
columns+[]string + |
+
+ The columns to publish + |
+
Appears in:
+ +RecoveryTarget allows to configure the moment where the recovery process +will stop. All the target options except TargetTLI are mutually exclusive.
+ +| Field | Description |
|---|---|
backupID+string + |
+
+ The ID of the backup from which to start the recovery process. +If empty (default) the operator will automatically detect the backup +based on targetTime or targetLSN if specified. Otherwise use the +latest available backup in chronological order. + |
+
targetTLI+string + |
+
+ The target timeline ("latest" or a positive integer) + |
+
targetXID+string + |
+
+ The target transaction ID + |
+
targetName+string + |
+
+ The target name (to be previously created
+with |
+
targetLSN+string + |
+
+ The target LSN (Log Sequence Number) + |
+
targetTime+string + |
+
+ The target time as a timestamp in the RFC3339 standard + |
+
targetImmediate+bool + |
+
+ End recovery as soon as a consistent state is reached + |
+
exclusive+bool + |
+
+ Set the target to be exclusive. If omitted, defaults to false, so that
+in Postgres, |
+
Appears in:
+ReplicaClusterConfiguration encapsulates the configuration of a replica +cluster
+ +| Field | Description |
|---|---|
self+string + |
+
+ Self defines the name of this cluster. It is used to determine if this is a primary
+or a replica cluster, comparing it with |
+
primary+string + |
+
+ Primary defines which Cluster is defined to be the primary in the distributed PostgreSQL cluster, based on the +topology specified in externalClusters + |
+
source [Required]+string + |
+
+ The name of the external cluster which is the replication origin + |
+
enabled+bool + |
+
+ If replica mode is enabled, this cluster will be a replica of an +existing cluster. Replica cluster can be created from a recovery +object store or via streaming through pg_basebackup. +Refer to the Replica clusters page of the documentation for more information. + |
+
promotionToken+string + |
+
+ A demotion token generated by an external cluster used to +check if the promotion requirements are met. + |
+
minApplyDelay+meta/v1.Duration + |
+
+ When replica mode is enabled, this parameter allows you to replay +transactions only when the system time is at least the configured +time past the commit time. This provides an opportunity to correct +data loss errors. Note that when this parameter is set, a promotion +token cannot be used. + |
+
Appears in:
+ReplicationSlotsConfiguration encapsulates the configuration +of replication slots
+ +| Field | Description |
|---|---|
highAvailability+ReplicationSlotsHAConfiguration + |
+
+ Replication slots for high availability configuration + |
+
updateInterval+int + |
+
+ Standby will update the status of the local replication slots
+every |
+
synchronizeReplicas+SynchronizeReplicasConfiguration + |
+
+ Configures the synchronization of the user defined physical replication slots + |
+
Appears in:
+ +ReplicationSlotsHAConfiguration encapsulates the configuration +of the replication slots that are automatically managed by +the operator to control the streaming replication connections +with the standby instances for high availability (HA) purposes. +Replication slots are a PostgreSQL feature that makes sure +that PostgreSQL automatically keeps WAL files in the primary +when a streaming client (in this specific case a replica that +is part of the HA cluster) gets disconnected.
+ +| Field | Description |
|---|---|
enabled+bool + |
+
+ If enabled (default), the operator will automatically manage replication slots +on the primary instance and use them in streaming replication +connections with all the standby instances that are part of the HA +cluster. If disabled, the operator will not take advantage +of replication slots in streaming connections with the replicas. +This feature also controls replication slots in replica cluster, +from the designated primary to its cascading replicas. + |
+
slotPrefix+string + |
+
+ Prefix for replication slots managed by the operator for HA.
+It may only contain lower case letters, numbers, and the underscore character.
+This can only be set at creation time. By default set to |
+
synchronizeLogicalDecoding+bool + |
+
+ When enabled, the operator automatically manages synchronization of logical +decoding (replication) slots across high-availability clusters. +Requires one of the following conditions: +
|
+
Appears in:
+ +RoleConfiguration is the representation, in Kubernetes, of a PostgreSQL role +with the additional field Ensure specifying whether to ensure the presence or +absence of the role in the database
+The defaults of the CREATE ROLE command are applied +Reference: https://www.postgresql.org/docs/current/sql-createrole.html
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ Name of the role + |
+
comment+string + |
+
+ Description of the role + |
+
ensure+EnsureOption + |
+
+ Ensure the role is |
+
passwordSecret+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ Secret containing the password of the role (if present) +If null, the password will be ignored unless DisablePassword is set + |
+
connectionLimit+int64 + |
+
+ If the role can log in, this specifies how many concurrent
+connections the role can make. |
+
validUntil+meta/v1.Time + |
+
+ Date and time after which the role's password is no longer valid. +When omitted, the password will never expire (default). + |
+
inRoles+[]string + |
+
+ List of one or more existing roles to which this role will be +immediately added as a new member. Default empty. + |
+
inherit+bool + |
+
+ Whether a role "inherits" the privileges of roles it is a member of.
+Defaults is |
+
disablePassword+bool + |
+
+ DisablePassword indicates that a role's password should be set to NULL in Postgres + |
+
superuser+bool + |
+
+ Whether the role is a |
+
createdb+bool + |
+
+ When set to |
+
createrole+bool + |
+
+ Whether the role will be permitted to create, alter, drop, comment
+on, change the security label for, and grant or revoke membership in
+other roles. Default is |
+
login+bool + |
+
+ Whether the role is allowed to log in. A role having the |
+
replication+bool + |
+
+ Whether a role is a replication role. A role must have this
+attribute (or be a superuser) in order to be able to connect to the
+server in replication mode (physical or logical replication) and in
+order to be able to create or drop replication slots. A role having
+the |
+
bypassrls+bool + |
+
+ Whether a role bypasses every row-level security (RLS) policy.
+Default is |
+
Appears in:
+ +SQLRefs holds references to ConfigMaps or Secrets +containing SQL files. The references are processed in a specific order: +first, all Secrets are processed, followed by all ConfigMaps. +Within each group, the processing order follows the sequence specified +in their respective arrays.
+ +| Field | Description |
|---|---|
secretRefs+[]github.com/cloudnative-pg/machinery/pkg/api.SecretKeySelector + |
+
+ SecretRefs holds a list of references to Secrets + |
+
configMapRefs+[]github.com/cloudnative-pg/machinery/pkg/api.ConfigMapKeySelector + |
+
+ ConfigMapRefs holds a list of references to ConfigMaps + |
+
Appears in:
+ +ScheduledBackupSpec defines the desired state of ScheduledBackup
+ +| Field | Description |
|---|---|
suspend+bool + |
+
+ If this backup is suspended or not + |
+
immediate+bool + |
+
+ If the first backup has to be immediately start after creation or not + |
+
schedule [Required]+string + |
+
+ The schedule does not follow the same format used in Kubernetes CronJobs +as it includes an additional seconds specifier, +see https://pkg.go.dev/github.com/robfig/cron#hdr-CRON_Expression_Format + |
+
cluster [Required]+github.com/cloudnative-pg/machinery/pkg/api.LocalObjectReference + |
+
+ The cluster to backup + |
+
backupOwnerReference+string + |
+
+ Indicates which ownerReference should be put inside the created backup resources. +
|
+
target+BackupTarget + |
+
+ The policy to decide which instance should perform this backup. If empty,
+it defaults to |
+
method+BackupMethod + |
+
+ The backup method to be used, possible options are |
+
pluginConfiguration+BackupPluginConfiguration + |
+
+ Configuration parameters passed to the plugin managing this backup + |
+
online+bool + |
+
+ Whether the default type of backup with volume snapshots is
+online/hot ( |
+
onlineConfiguration+OnlineConfiguration + |
+
+ Configuration parameters to control the online/hot backup with volume snapshots +Overrides the default settings specified in the cluster '.backup.volumeSnapshot.onlineConfiguration' stanza + |
+
Appears in:
+ +ScheduledBackupStatus defines the observed state of ScheduledBackup
+ +| Field | Description |
|---|---|
lastCheckTime+meta/v1.Time + |
+
+ The latest time the schedule + |
+
lastScheduleTime+meta/v1.Time + |
+
+ Information when was the last time that backup was successfully scheduled. + |
+
nextScheduleTime+meta/v1.Time + |
+
+ Next time we will run a backup + |
+
Appears in:
+SchemaSpec configures a schema in a database
+ +| Field | Description |
|---|---|
DatabaseObjectSpec+DatabaseObjectSpec + |
+(Members of DatabaseObjectSpec are embedded into this type.)
+ Common fields + |
+
owner [Required]+string + |
+
+ The role name of the user who owns the schema inside PostgreSQL.
+It maps to the |
+
Appears in:
+SecretVersion contains a secret name and its ResourceVersion
+ +| Field | Description |
|---|---|
name+string + |
+
+ The name of the secret + |
+
version+string + |
+
+ The ResourceVersion of the secret + |
+
Appears in:
+ +SecretsResourceVersion is the resource versions of the secrets +managed by the operator
+ +| Field | Description |
|---|---|
superuserSecretVersion+string + |
+
+ The resource version of the "postgres" user secret + |
+
replicationSecretVersion+string + |
+
+ The resource version of the "streaming_replica" user secret + |
+
applicationSecretVersion+string + |
+
+ The resource version of the "app" user secret + |
+
managedRoleSecretVersion+map[string]string + |
+
+ The resource versions of the managed roles secrets + |
+
caSecretVersion+string + |
+
+ Unused. Retained for compatibility with old versions. + |
+
clientCaSecretVersion+string + |
+
+ The resource version of the PostgreSQL client-side CA secret version + |
+
serverCaSecretVersion+string + |
+
+ The resource version of the PostgreSQL server-side CA secret version + |
+
serverSecretVersion+string + |
+
+ The resource version of the PostgreSQL server-side secret version + |
+
barmanEndpointCA+string + |
+
+ The resource version of the Barman Endpoint CA if provided + |
+
externalClusterSecretVersion+map[string]string + |
+
+ The resource versions of the external cluster secrets + |
+
metrics+map[string]string + |
+
+ A map with the versions of all the secrets used to pass metrics. +Map keys are the secret names, map values are the versions + |
+
Appears in:
+ServerSpec configures a server of a foreign data wrapper
+ +| Field | Description |
|---|---|
DatabaseObjectSpec+DatabaseObjectSpec + |
+(Members of DatabaseObjectSpec are embedded into this type.)
+ Common fields + |
+
fdw [Required]+string + |
+
+ The name of the Foreign Data Wrapper (FDW) + |
+
options+[]OptionSpec + |
+
+ Options specifies the configuration options for the server +(key is the option name, value is the option value). + |
+
usage+[]UsageSpec + |
+
+ List of roles for which |
+
Appears in:
+ServiceAccountTemplate contains the template needed to generate the service accounts
+ +| Field | Description |
|---|---|
metadata [Required]+Metadata + |
+
+ Metadata are the metadata to be used for the generated +service account + |
+
(Alias of string)
Appears in:
+ServiceSelectorType describes a valid value for generating the service selectors. +It indicates which type of service the selector applies to, such as read-write, read, or read-only
+ +Appears in:
+ServiceTemplateSpec is a structure allowing the user to set +a template for Service generation.
+ +| Field | Description |
|---|---|
metadata+Metadata + |
+
+ Standard object's metadata. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#metadata + |
+
spec+core/v1.ServiceSpec + |
+
+ Specification of the desired behavior of the service. +More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status + |
+
(Alias of string)
Appears in:
+ +ServiceUpdateStrategy describes how the changes to the managed service should be handled
+ +(Alias of string)
Appears in:
+ +SnapshotOwnerReference defines the reference type for the owner of the snapshot. +This specifies which owner the processed resources should relate to.
+ +(Alias of string)
Appears in:
+SnapshotType is a type of allowed import
+ +Appears in:
+StorageConfiguration is the configuration used to create and reconcile PVCs, +usable for WAL volumes, PGDATA volumes, or tablespaces
+ +| Field | Description |
|---|---|
storageClass+string + |
+
+ StorageClass to use for PVCs. Applied after +evaluating the PVC template, if available. +If not specified, the generated PVCs will use the +default storage class + |
+
size+string + |
+
+ Size of the storage. Required if not already specified in the PVC template. +Changes to this field are automatically reapplied to the created PVCs. +Size cannot be decreased. + |
+
resizeInUseVolumes+bool + |
+
+ Resize existent PVCs, defaults to true + |
+
pvcTemplate+core/v1.PersistentVolumeClaimSpec + |
+
+ Template to be used to generate the Persistent Volume Claim + |
+
(Alias of string)
Appears in:
+ +SubscriptionReclaimPolicy describes a policy for end-of-life maintenance of Subscriptions.
+ +Appears in:
+SubscriptionSpec defines the desired state of Subscription
+ +| Field | Description |
|---|---|
cluster [Required]+core/v1.LocalObjectReference + |
+
+ The name of the PostgreSQL cluster that identifies the "subscriber" + |
+
name [Required]+string + |
+
+ The name of the subscription inside PostgreSQL + |
+
dbname [Required]+string + |
+
+ The name of the database where the publication will be installed in +the "subscriber" cluster + |
+
parameters+map[string]string + |
+
+ Subscription parameters included in the |
+
publicationName [Required]+string + |
+
+ The name of the publication inside the PostgreSQL database in the +"publisher" + |
+
publicationDBName+string + |
+
+ The name of the database containing the publication on the external +cluster. Defaults to the one in the external cluster definition. + |
+
externalClusterName [Required]+string + |
+
+ The name of the external cluster with the publication ("publisher") + |
+
subscriptionReclaimPolicy+SubscriptionReclaimPolicy + |
+
+ The policy for end-of-life maintenance of this subscription + |
+
Appears in:
+SubscriptionStatus defines the observed state of Subscription
+ +| Field | Description |
|---|---|
observedGeneration+int64 + |
+
+ A sequence number representing the latest +desired state that was synchronized + |
+
applied+bool + |
+
+ Applied is true if the subscription was reconciled correctly + |
+
message+string + |
+
+ Message is the reconciliation output message + |
+
Appears in:
+ +SwitchReplicaClusterStatus contains all the statuses regarding the switch of a cluster to a replica cluster
+ +| Field | Description |
|---|---|
inProgress+bool + |
+
+ InProgress indicates if there is an ongoing procedure of switching a cluster to a replica cluster. + |
+
Appears in:
+ +SyncReplicaElectionConstraints contains the constraints for sync replicas election.
+For anti-affinity parameters two instances are considered in the same location +if all the labels values match.
+In future synchronous replica election restriction by name will be supported.
+ +| Field | Description |
|---|---|
nodeLabelsAntiAffinity+[]string + |
+
+ A list of node labels values to extract and compare to evaluate if the pods reside in the same topology or not + |
+
enabled [Required]+bool + |
+
+ This flag enables the constraints for sync replicas + |
+
Appears in:
+ +SynchronizeReplicasConfiguration contains the configuration for the synchronization of user defined +physical replication slots
+ +| Field | Description |
|---|---|
enabled [Required]+bool + |
+
+ When set to true, every replication slot that is on the primary is synchronized on each standby + |
+
excludePatterns+[]string + |
+
+ List of regular expression patterns to match the names of replication slots to be excluded (by default empty) + |
+
Appears in:
+ +SynchronousReplicaConfiguration contains the configuration of the
+PostgreSQL synchronous replication feature.
+Important: at this moment, also .spec.minSyncReplicas and .spec.maxSyncReplicas
+need to be considered.
| Field | Description |
|---|---|
method [Required]+SynchronousReplicaConfigurationMethod + |
+
+ Method to select synchronous replication standbys from the listed +servers, accepting 'any' (quorum-based synchronous replication) or +'first' (priority-based synchronous replication) as values. + |
+
number [Required]+int + |
+
+ Specifies the number of synchronous standby servers that +transactions must wait for responses from. + |
+
maxStandbyNamesFromCluster+int + |
+
+ Specifies the maximum number of local cluster pods that can be
+automatically included in the |
+
standbyNamesPre+[]string + |
+
+ A user-defined list of application names to be added to
+ |
+
standbyNamesPost+[]string + |
+
+ A user-defined list of application names to be added to
+ |
+
dataDurability+DataDurabilityLevel + |
+
+ If set to "required", data durability is strictly enforced. Write operations
+with synchronous commit settings ( |
+
failoverQuorum+bool + |
+
+ FailoverQuorum enables a quorum-based check before failover, improving +data durability and safety during failover events in CloudNativePG-managed +PostgreSQL clusters. + |
+
(Alias of string)
Appears in:
+ +SynchronousReplicaConfigurationMethod configures whether to use +quorum based replication or a priority list
+ +Appears in:
+TablespaceConfiguration is the configuration of a tablespace, and includes +the storage specification for the tablespace
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ The name of the tablespace + |
+
storage [Required]+StorageConfiguration + |
+
+ The storage configuration for the tablespace + |
+
owner+DatabaseRoleRef + |
+
+ Owner is the PostgreSQL user owning the tablespace + |
+
temporary+bool + |
+
+ When set to true, the tablespace will be added as a |
+
Appears in:
+ +TablespaceState represents the state of a tablespace in a cluster
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ Name is the name of the tablespace + |
+
owner+string + |
+
+ Owner is the PostgreSQL user owning the tablespace + |
+
state [Required]+TablespaceStatus + |
+
+ State is the latest reconciliation state + |
+
error+string + |
+
+ Error is the reconciliation error, if any + |
+
(Alias of string)
Appears in:
+ +TablespaceStatus represents the status of a tablespace in the cluster
+ +Appears in:
+ +Topology contains the cluster topology
+ +| Field | Description |
|---|---|
instances+map[PodName]PodTopologyLabels + |
+
+ Instances contains the pod topology of the instances + |
+
nodesUsed+int32 + |
+
+ NodesUsed represents the count of distinct nodes accommodating the instances. +A value of '1' suggests that all instances are hosted on a single node, +implying the absence of High Availability (HA). Ideally, this value should +be the same as the number of instances in the Postgres HA cluster, implying +shared nothing architecture on the compute side. + |
+
successfullyExtracted+bool + |
+
+ SuccessfullyExtracted indicates if the topology data was extract. It is useful to enact fallback behaviors +in synchronous replica election in case of failures + |
+
Appears in:
+UsageSpec configures a usage for a foreign data wrapper
+ +| Field | Description |
|---|---|
name [Required]+string + |
+
+ Name of the usage + |
+
type+UsageSpecType + |
+
+ The type of usage + |
+
(Alias of string)
Appears in:
+UsageSpecType describes the type of usage specified in the usage field of the
+Database object.
Appears in:
+ +VolumeSnapshotConfiguration represents the configuration for the execution of snapshot backups.
+ +| Field | Description |
|---|---|
labels+map[string]string + |
+
+ Labels are key-value pairs that will be added to .metadata.labels snapshot resources. + |
+
annotations+map[string]string + |
+
+ Annotations key-value pairs that will be added to .metadata.annotations snapshot resources. + |
+
className+string + |
+
+ ClassName specifies the Snapshot Class to be used for PG_DATA PersistentVolumeClaim. +It is the default class for the other types if no specific class is present + |
+
walClassName+string + |
+
+ WalClassName specifies the Snapshot Class to be used for the PG_WAL PersistentVolumeClaim. + |
+
tablespaceClassName+map[string]string + |
+
+ TablespaceClassName specifies the Snapshot Class to be used for the tablespaces. +defaults to the PGDATA Snapshot Class, if set + |
+
snapshotOwnerReference+SnapshotOwnerReference + |
+
+ SnapshotOwnerReference indicates the type of owner reference the snapshot should have + |
+
online+bool + |
+
+ Whether the default type of backup with volume snapshots is
+online/hot ( |
+
onlineConfiguration+OnlineConfiguration + |
+
+ Configuration parameters to control the online/hot backup with volume snapshots + |
+
CloudNativePG supports mounting custom files inside the Postgres pods through
+.spec.projectedVolumeTemplate. This ability is useful for several Postgres
+features and extensions that require additional data files.
+In CloudNativePG, the .spec.projectedVolumeTemplate field is a
+projected volume
+template in Kubernetes that allows you to mount arbitrary data under the
+/projected folder in Postgres pods.
This simple example shows how to mount an existing TLS secret (named
+sample-secret) as files into Postgres pods. The values for the secret keys
+tls.crt and tls.key in sample-secret are mounted as files into the paths
+/projected/certificate/tls.crt and /projected/certificate/tls.key in the
+Postgres pod.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example-projected-volumes
+spec:
+ instances: 3
+ projectedVolumeTemplate:
+ sources:
+ - secret:
+ name: sample-secret
+ items:
+ - key: tls.crt
+ path: certificate/tls.crt
+ - key: tls.key
+ path: certificate/tls.key
+ storage:
+ size: 1Gi
+
+You can find a complete example that uses a projected volume template to mount +the secret and ConfigMap in the +cluster-example-projected-volume.yaml +deployment manifest.
+CloudNativePG relies on ephemeral volumes +for part of the internal activities. Ephemeral volumes exist for the sole +duration of a pod's life, without persisting across pod restarts.
+The operator uses by default an emptyDir volume, which can be customized by using the .spec.ephemeralVolumesSizeLimit field.
+This can be overridden by specifying a volume claim template in the .spec.ephemeralVolumeSource field.
In the following example, a 1Gi ephemeral volume is set.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example-ephemeral-volume-source
+spec:
+ instances: 3
+ ephemeralVolumeSource:
+ volumeClaimTemplate:
+ spec:
+ accessModes: ["ReadWriteOnce"]
+ # example storageClassName, replace with one existing in your Kubernetes cluster
+ storageClassName: "scratch-storage-class"
+ resources:
+ requests:
+ storage: 1Gi
+
+Both .spec.emphemeralVolumeSource and .spec.ephemeralVolumesSizeLimit.temporaryData cannot be specified simultaneously.
This volume is used as shared memory space for Postgres and as an ephemeral
+type but stored in memory. You can configure an upper bound on the size using
+the .spec.ephemeralVolumesSizeLimit.shm field in the cluster spec.
+Use this field only in case of
+PostgreSQL running with posix shared memory dynamic allocation.
You can customize some system behavior using environment variables. One example
+is the LDAPCONF variable, which can point to a custom LDAP configuration
+file. Another example is the TZ environment variable, which represents the
+timezone used by the PostgreSQL container.
CloudNativePG allows you to set custom environment variables using the env
+and the envFrom stanza of the cluster specification.
This example defines a PostgreSQL cluster using the Australia/Sydney
+timezone as the default cluster-level timezone:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+
+ env:
+ - name: TZ
+ value: Australia/Sydney
+
+ storage:
+ size: 1Gi
+
+The envFrom stanza can refer to ConfigMaps or secrets to use their content as
+environment variables:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+
+ envFrom:
+ - configMapRef:
+ name: config-map-name
+ - secretRef:
+ name: secret-name
+
+ storage:
+ size: 1Gi
+
+The operator doesn't allow setting the following environment variables:
+POD_NAMENAMESPACEPG.Any change in the env or in the envFrom section triggers a rolling
+update of the PostgreSQL pods.
If the env or the envFrom section refers to a secret or a ConfigMap, the
+operator doesn't detect any changes in them and doesn't trigger a rollout. The
+kubelet uses the same behavior with pods, and you must trigger the pod rollout
+manually.
Cilium is a CNCF Graduated project that was accepted as +an Incubating project in 2021 and graduated in 2023. It was originally created +by Isovalent. It is an advanced networking, security, and observability +solution for cloud native environments, built on top of +eBPF technology. Cilium manages network traffic in +Kubernetes clusters by dynamically injecting eBPF programs into the Linux +Kernel, enabling low-latency, high-performance communication, and enforcing +fine-grained security policies.
+Key features of Cilium:
+To install Cilium in your environment, follow the instructions in the documentation: +https://docs.cilium.io/en/stable/gettingstarted/k8s-install-default/
+Kubernetes’ default behavior is to allow traffic between any two Pods in the cluster network.
+Cilium provides advanced L3/L4 network security using the CiliumNetworkPolicy resource. This
+enables fine-grained control over network traffic between Pods within a Kubernetes cluster. It is
+especially useful for securing communication between application workloads and backend
+services.
In the following examples, we demonstrate how Cilium can be used to secure a +CloudNativePG PostgreSQL instance by restricting ingress traffic to only +authorized Pods.
+Important
+Before proceeding, ensure that the cluster-example Postgres cluster is up
+and running in your environment.
By default, Cilium does not deny all traffic unless explicitly configured +to do so. In contrast to Kubernetes NetworkPolicy, which uses a deny-by-default +model once a policy is present in a namespace, Cilium provides more flexible +control over default deny behavior.
+To enforce a default deny posture with Cilium, you need to explicitly create a
+policy that denies all traffic to a set of Pods unless otherwise allowed. This
+is commonly achieved by using an empty ingress section in combination
+with endpointSelector, or by enabling --enable-default-deny at the
+Cilium agent level for broader enforcement.
A minimal example of a default deny policy:
+apiVersion: cilium.io/v2
+kind: CiliumNetworkPolicy
+metadata:
+ name: default-deny
+ namespace: default
+spec:
+ description: "Default deny all ingress traffic to all Pods in this namespace"
+ endpointSelector: {}
+ ingress: []
+
+When working with a network policy, Cilium or not, the first step is to make
+sure that the operator can reach the Pods in the target namespace. This is
+important because the operator needs to be able to perform checks and actions
+on the Pods, and one of those actions requires access to the port 8000 on the
+Pods to get the current status of the PostgreSQL instance running inside.
The following CiliumNetworkPolicy allows the operator to access the Pods in
+the target default namespace:
apiVersion: cilium.io/v2
+kind: CiliumNetworkPolicy
+metadata:
+ name: cnpg-operator-access
+ namespace: default
+spec:
+ description: "Allow CloudNativePG operator access to any pod in the target namespace"
+ endpointSelector: {}
+ ingress:
+ - fromEndpoints:
+ - matchLabels:
+ io.kubernetes.pod.namespace: cnpg-system
+ toPorts:
+ - ports:
+ - port: "8000"
+ protocol: TCP
+
+Important
+The cnpg-system namespace is the default namespace for the operator when
+using the YAML manifests. If the operator was installed using a different
+process (Helm, OLM, etc.), the namespace may be different. Make sure to adjust
+the namespace properly.
Since the default policy is "deny all", we need to explicitly allow access +between the cluster Pods in the same namespace. We will improve our previous +policy by adding the required ingress rule:
+apiVersion: cilium.io/v2
+kind: CiliumNetworkPolicy
+metadata:
+ name: cnpg-cluster-internal-access
+ namespace: default
+spec:
+ description: "Allow CloudNativePG operator access and connection between pods in the same namespace"
+ endpointSelector: {}
+ ingress:
+ - fromEndpoints:
+ - matchLabels:
+ io.kubernetes.pod.namespace: cnpg-system
+ - matchLabels:
+ io.kubernetes.pod.namespace: default
+ cnpg.io/cluster: cluster-example
+ toPorts:
+ - ports:
+ - port: "8000"
+ protocol: TCP
+ - port: "5432"
+ protocol: TCP
+
+The policy allows access from cnpg-system Pods and from default namespace
+Pods that also belong to cluster-example. The matchLabels selector requires
+Pods to have the complete set of listed labels. Missing even one label means
+the Pod will not match.
In this example, we define a CiliumNetworkPolicy that allows only Pods
+labeled role=backend in the default namespace to connect to a PostgreSQL
+cluster named cluster-example. All other ingress traffic is blocked by
+default.
apiVersion: cilium.io/v2
+kind: CiliumNetworkPolicy
+metadata:
+ name: postgres-access-backend-label
+ namespace: default
+spec:
+ description: "Allow PostgreSQL access on port 5432 from Pods with role=backend"
+ endpointSelector:
+ matchLabels:
+ cnpg.io/cluster: cluster-example
+ ingress:
+ - fromEndpoints:
+ - matchLabels:
+ role: backend
+ toPorts:
+ - ports:
+ - port: "5432"
+ protocol: TCP
+
+This CiliumNetworkPolicy ensures that only Pods labeled with role=backend
+can access the PostgreSQL instance managed by CloudNativePG via port 5432 in
+the default namespace.
In the following policy, we demonstrate how to allow ingress traffic to port
+5432 of a PostgreSQL cluster named cluster-example, only from Pods with the
+label role=backend in any namespace.
apiVersion: cilium.io/v2
+kind: CiliumNetworkPolicy
+metadata:
+ name: postgres-access-backend-any-ns
+ namespace: default
+spec:
+ description: "Allow PostgreSQL access on port 5432 from Pods with role=backend in any namespace"
+ endpointSelector:
+ matchLabels:
+ cnpg.io/cluster: cluster-example
+ ingress:
+ - fromEndpoints:
+ - labelSelector:
+ matchLabels:
+ role: backend
+ matchExpressions:
+ - key: io.kubernetes.pod.namespace
+ operator: Exists
+ toPorts:
+ - ports:
+ - port: "5432"
+ protocol: TCP
+
+The following example allows ingress traffic to port 5432 of the
+cluster-example cluster (located in the default namespace) from any Pods in
+the backend namespace.
apiVersion: cilium.io/v2
+kind: CiliumNetworkPolicy
+metadata:
+ name: postgres-access-backend-namespace
+ namespace: default
+spec:
+ description: "Allow PostgreSQL access on port 5432 from any Pods in the backend namespace"
+ endpointSelector:
+ matchLabels:
+ cnpg.io/cluster: cluster-example
+ ingress:
+ - fromEndpoints:
+ - matchLabels:
+ io.kubernetes.pod.namespace: backend
+ toPorts:
+ - ports:
+ - port: "5432"
+ protocol: TCP
+
+Using Cilium’s L3/L4 policy model, we define a CiliumNetworkPolicy that
+explicitly allows ingress traffic to cluster Pods only from application Pods in
+the backend namespace. All other traffic is implicitly denied unless
+explicitly permitted by additional policies.
The following example allows ingress traffic to port 5432 of the
+cluster-example cluster (located in the default namespace) from any source
+within the Kubernetes cluster.
apiVersion: cilium.io/v2
+kind: CiliumNetworkPolicy
+metadata:
+ name: postgres-access-cluster-wide
+ namespace: default
+spec:
+ description: "Allow ingress traffic to port 5432 of the cluster-example from any pods within the Kubernetes cluster"
+ endpointSelector:
+ matchLabels:
+ cnpg.io/cluster: cluster-example
+ ingress:
+ - fromEntities:
+ - cluster
+ toPorts:
+ - ports:
+ - port: "5432"
+ protocol: TCP
+
+You may consider using editor.networkpolicy.io, +a visual and interactive tool that simplifies the creation and validation of +Cilium Network Policies. It’s especially helpful for avoiding misconfigurations +and understanding traffic rules more clearly by presenting in a visual way.
+With these policies, you've established baseline access controls for +PostgreSQL. You can layer additional egress or audit rules using Cilium's +policy language or extend to L7 enforcement with Envoy.
+ +External Secrets is a CNCF Sandbox +project, accepted in 2022 under the sponsorship of TAG Security.
+The External Secrets Operator (ESO) is a Kubernetes operator that enhances
+secret management by decoupling the storage of secrets from Kubernetes itself.
+It enables seamless synchronization between external secret management systems
+and native Kubernetes Secret resources.
ESO supports a wide range of backends, including:
+…and many more. For a full and up-to-date list of supported providers, refer to +the official External Secrets documentation.
+When it comes to PostgreSQL databases, External Secrets integrates seamlessly +with CloudNativePG in two major use cases:
+Automated password management: ESO can handle the automatic generation
+ and rotation of database user passwords stored in Kubernetes Secret
+ resources, ensuring that applications running inside the cluster always have
+ access to up-to-date credentials.
Cross-platform secret access: It enables transparent synchronization of
+ those passwords with an external Key Management Service (KMS) via a
+ SecretStore resources. This allows applications and developers outside the
+ Kubernetes cluster—who may not have access to Kubernetes secrets—to retrieve
+ the database credentials directly from the external KMS.
Let’s walk through how to automatically rotate the password of the app user
+every 24 hours in the cluster-example Postgres cluster from the
+quickstart guide.
Important
+Before proceeding, ensure that the cluster-example Postgres cluster is up
+and running in your environment.
By default, CloudNativePG generates and manages a Kubernetes Secret named
+cluster-example-app, which contains the credentials for the app user in the
+cluster-example cluster. You can read more about this in the
+“Connecting from an application” section.
With External Secrets, the goal is to:
+Password generator that specifies how to generate the password.ExternalSecret resource that keeps the cluster-example-app
+ secret in sync by updating only the password and pgpass fields.The following example creates a
+Password generator
+resource named pg-password-generator in the default namespace. You can
+customize the name and properties to suit your needs:
apiVersion: generators.external-secrets.io/v1alpha1
+kind: Password
+metadata:
+ name: pg-password-generator
+spec:
+ length: 42
+ digits: 5
+ symbols: 5
+ symbolCharacters: "-_$@"
+ noUpper: false
+ allowRepeat: true
+
+This specification defines the characteristics of the generated password, +including its length and the inclusion of digits, symbols, and uppercase +letters.
+The example below creates an ExternalSecret resource named
+cluster-example-app-secret, which refreshes the password every 24 hours. It
+uses a Merge policy to update only the specified fields (password, pgpass,
+jdbc-uri and uri) in the cluster-example-app secret.
apiVersion: external-secrets.io/v1
+kind: ExternalSecret
+metadata:
+ name: cluster-example-app-secret
+spec:
+ refreshInterval: "24h"
+ target:
+ name: cluster-example-app
+ creationPolicy: Merge
+ template:
+ metadata:
+ labels:
+ cnpg.io/reload: "true"
+ data:
+ password: "{{ .password }}"
+ pgpass: "cluster-example-rw:5432:app:app:{{ .password }}"
+ jdbc-uri: "jdbc:postgresql://cluster-example-rw.default:5432/app?password={{ .password }}&user=app"
+ uri: "postgresql://app:{{ .password }}@cluster-example-rw.default:5432/app"
+ dataFrom:
+ - sourceRef:
+ generatorRef:
+ apiVersion: generators.external-secrets.io/v1alpha1
+ kind: Password
+ name: pg-password-generator
+
+The label cnpg.io/reload: "true" ensures that CloudNativePG triggers a reload
+of the user password in the database when the secret changes.
To check that the ExternalSecret is correctly synchronizing:
kubectl get es cluster-example-app-secret
+
+To observe the password being refreshed in real time, temporarily reduce the
+refreshInterval to 30s and run the following command repeatedly:
kubectl get secret cluster-example-app \
+ -o jsonpath="{.data.password}" | base64 -d
+
+You should see the password change every 30 seconds, confirming that the +rotation is working correctly.
+While the example above focuses on the default cluster-example-app secret
+created by CloudNativePG, the same approach can be extended to manage any
+custom secrets or PostgreSQL users you create to regularly rotate their
+password.
One of the most widely used Key Management Service (KMS) providers in the CNCF +ecosystem is HashiCorp Vault. Although Vault is +licensed under the Business Source License (BUSL), a fully compatible and +actively maintained open source alternative is available: OpenBao. +OpenBao supports all the same interfaces as HashiCorp Vault, making it a true +drop-in replacement.
+In this example, we'll demonstrate how to integrate CloudNativePG, +External Secrets Operator, and HashiCorp Vault to automatically rotate +a PostgreSQL password and securely store it in Vault.
+Important
+This example assumes that HashiCorp Vault is already installed and properly +configured in your environment, and that your team has the necessary expertise +to operate it. There are various ways to deploy Vault, and detailing them is +outside the scope of CloudNativePG. While it's possible to run Vault inside +Kubernetes, it is more commonly deployed externally. For detailed instructions, +consult the HashiCorp Vault documentation.
+Continuing from the previous example, we will now create the necessary
+SecretStore and PushSecret resources to complete the integration with
+Vault.
SecretStoreIn this example, we assume that HashiCorp Vault is accessible from within the
+namespace at http://vault.vault.svc:8200, and that a Kubernetes Secret
+named vault-token exists in the same namespace, containing the token used to
+authenticate with Vault.
apiVersion: external-secrets.io/v1
+kind: SecretStore
+metadata:
+ name: vault-backend
+spec:
+ provider:
+ vault:
+ server: "http://vault.vault.svc:8200"
+ path: "secrets"
+ # Specifies the Vault KV secret engine version ("v1" or "v2").
+ # Defaults to "v2" if not set.
+ version: "v2"
+ auth:
+ # References a Kubernetes Secret that contains the Vault token.
+ # See: https://www.vaultproject.io/docs/auth/token
+ tokenSecretRef:
+ name: "vault-token"
+ key: "token"
+---
+apiVersion: v1
+kind: Secret
+metadata:
+ name: vault-token
+data:
+ token: aHZzLioqKioqKio= # hvs.*******
+
+This configuration creates a SecretStore resource named vault-backend.
Important
+This example uses basic token-based authentication, which is suitable for +testing API, and CLI use cases. While it is the default method enabled in +Vault, it is not recommended for production environments. For production, +consider using more secure authentication methods. +Refer to the External Secrets Operator documentation +for a full list of supported authentication mechanisms.
+Info
+HashiCorp Vault must have a KV secrets engine enabled at the secrets path
+with version v2. If your Vault instance uses a different path or
+version, be sure to update the path and version fields accordingly.
PushSecretThe PushSecret resource is used to push a Kubernetes Secret to HashiCorp
+Vault. In this simplified example, we'll push the credentials for the app
+user of the sample cluster cluster-example.
For more details on configuring PushSecret, refer to the
+External Secrets Operator documentation.
apiVersion: external-secrets.io/v1alpha1
+kind: PushSecret
+metadata:
+ name: pushsecret-example
+spec:
+ deletionPolicy: Delete
+ refreshInterval: 24h
+ secretStoreRefs:
+ - name: vault-backend
+ kind: SecretStore
+ selector:
+ secret:
+ name: cluster-example-app
+ data:
+ - match:
+ remoteRef:
+ remoteKey: cluster-example-app
+
+In this example, the PushSecret resource instructs the External Secrets
+Operator to push the Kubernetes Secret named cluster-example-app to
+HashiCorp Vault (from the previous example). The remoteKey defines the name
+under which the secret will be stored in Vault, using the SecretStore named
+vault-backend.
To verify that the PushSecret is functioning correctly, navigate to the
+HashiCorp Vault UI. In the kv secrets engine at the path secrets, you
+should find a secret named cluster-example-app, corresponding to the
+remoteKey defined above.
The CloudNativePG Interface (CNPG-I) +is a standard way to extend and customize CloudNativePG without modifying its +core codebase.
+CloudNativePG supports a wide range of use cases, but sometimes its built-in +functionality isn’t enough, or adding certain features directly to the main +project isn’t practical.
+Before CNPG-I, users had two main options:
+Both approaches created maintenance overhead, slowed upgrades, and delayed delivery of critical features.
+CNPG-I solves these problems by providing a stable, gRPC-based integration +point for extending CloudNativePG at key points in a cluster’s lifecycle —such +as backups, recovery, and sub-resource reconciliation— without disrupting the +core project.
+CNPG-I can extend:
+CNPG-I is inspired by the Kubernetes +Container Storage Interface (CSI). +The operator communicates with registered plugins using gRPC, following the +CNPG-I protocol.
+CloudNativePG discovers plugins at startup. You can register them in one of two ways:
+In both cases, the plugin must be packaged as a container image.
+When running as a sidecar, the plugin must expose its gRPC server via a Unix
+domain socket. This socket must be placed in a directory shared with the
+operator container, mounted at the path set in PLUGIN_SOCKET_DIR (default:
+/plugin).
Example:
+apiVersion: apps/v1
+kind: Deployment
+metadata:
+ name: controller-manager
+spec:
+ template:
+ spec:
+ containers:
+ - image: cloudnative-pg:latest
+ [...]
+ name: manager
+ volumeMounts:
+ - mountPath: /plugins
+ name: cnpg-i-plugins
+
+ - image: cnpg-i-plugin-example:latest
+ name: cnpg-i-plugin-example
+ volumeMounts:
+ - mountPath: /plugins
+ name: cnpg-i-plugins
+ volumes:
+ - name: cnpg-i-plugins
+ emptyDir: {}
+
+Running a plugin as its own Deployment decouples its lifecycle from the +operator’s and allows independent scaling. In this setup, the plugin exposes a +TCP gRPC endpoint behind a Service, with mTLS for secure communication.
+Warning
+CloudNativePG does not discover plugins dynamically. If you deploy a new +plugin, you must restart the operator to detect it.
+Example Deployment:
+apiVersion: apps/v1
+kind: Deployment
+metadata:
+ name: cnpg-i-plugin-example
+spec:
+ template:
+ [...]
+ spec:
+ containers:
+ - name: cnpg-i-plugin-example
+ image: cnpg-i-plugin-example:latest
+ ports:
+ - containerPort: 9090
+ protocol: TCP
+
+The related Service for the plugin must include:
+cnpg.io/plugin: <plugin-name> — required for CloudNativePG to
+ discover the plugincnpg.io/pluginPort: <port> — specifies the port where the
+ plugin’s gRPC server is exposedExample Service:
+apiVersion: v1
+kind: Service
+metadata:
+ annotations:
+ cnpg.io/pluginPort: "9090"
+ labels:
+ cnpg.io/pluginName: cnpg-i-plugin-example.my-org.io
+ name: cnpg-i-plugin-example
+spec:
+ ports:
+ - port: 9090
+ protocol: TCP
+ targetPort: 9090
+ selector:
+ app: cnpg-i-plugin-example
+
+When a plugin runs as a Deployment, communication with CloudNativePG happens
+over the network. To secure it, mTLS is enforced, requiring TLS
+certificates for both sides.
Certificates must be stored as Kubernetes TLS Secrets
+and referenced in the plugin’s Service annotations
+(cnpg.io/pluginClientSecret and cnpg.io/pluginServerSecret):
apiVersion: v1
+kind: Service
+metadata:
+ annotations:
+ cnpg.io/pluginClientSecret: cnpg-i-plugin-example-client-tls
+ cnpg.io/pluginServerSecret: cnpg-i-plugin-example-server-tls
+ cnpg.io/pluginPort: "9090"
+ name: barman-cloud
+ namespace: postgresql-operator-system
+spec:
+ [...]
+
+Note
+You can provide your own certificate bundles, but the recommended method is +to use Cert-manager.
+To enable a plugin, configure the .spec.plugins section in your Cluster
+resource. Refer to the CloudNativePG API Reference for the full
+PluginConfiguration
+specification.
Example:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-with-plugins
+spec:
+ instances: 1
+ storage:
+ size: 1Gi
+ plugins:
+ - name: cnpg-i-plugin-example.my-org.io
+ enabled: true
+ parameters:
+ key1: value1
+ key2: value2
+
+Each plugin may have its own parameters—check the plugin’s documentation for
+details. The name field in spec.plugins depends on how the plugin is
+deployed:
cnpg.io/pluginName labelThe CNPG-I protocol has quickly become a proven and reliable pattern for +extending CloudNativePG while keeping the core project maintainable. +Over time, the community has built and shared plugins that address real-world +needs and serve as examples for developers.
+For a complete and up-to-date list of plugins built with CNPG-I, please refer to the +CNPG-I GitHub page.
+ +CloudNativePG provides native support for connection pooling with
+PgBouncer, one of the most popular open source
+connection poolers for PostgreSQL, through the Pooler custom resource definition (CRD).
In brief, a pooler in CloudNativePG is a deployment of PgBouncer pods that sits
+between your applications and a PostgreSQL service, for example, the rw
+service. It creates a separate, scalable, configurable, and highly available
+database access layer.
Warning
+CloudNativePG requires the auth_dbname feature in PgBouncer.
+Make sure to use a PgBouncer container image version 1.19 or higher.
The following diagram highlights how introducing a database access layer based +on PgBouncer changes the architecture of CloudNativePG. Instead of directly +connecting to the PostgreSQL primary service, applications can connect to the +equivalent service for PgBouncer. This ability enables reuse of existing +connections for faster performance and better resource management on the +PostgreSQL side.
+
This example helps to show how CloudNativePG implements a PgBouncer +pooler:
+apiVersion: postgresql.cnpg.io/v1
+kind: Pooler
+metadata:
+ name: pooler-example-rw
+spec:
+ cluster:
+ name: cluster-example
+
+ instances: 3
+ type: rw
+ pgbouncer:
+ poolMode: session
+ parameters:
+ max_client_conn: "1000"
+ default_pool_size: "10"
+
+Important
+The pooler name can't be the same as any cluster name in the same namespace.
+This example creates a Pooler resource called pooler-example-rw
+that's strictly associated with the Postgres Cluster resource called
+cluster-example. It points to the primary, identified by the read/write
+service (rw, therefore cluster-example-rw).
The Pooler resource must live in the same namespace as the Postgres cluster.
+It consists of a Kubernetes deployment of 3 pods running the
+latest stable image of PgBouncer,
+configured with the session pooling mode
+and accepting up to 1000 connections each. The default pool size is 10
+user/database pairs toward PostgreSQL.
Important
+The Pooler resource sets only the * fallback database in PgBouncer. This setting means that
+that all parameters in the connection strings passed from the client are
+relayed to the PostgreSQL server. For details, see "Section [databases]"
+in the PgBouncer documentation.
CloudNativePG also creates a secret with the same name as the pooler containing +the configuration files used with PgBouncer.
+API reference
+For details, see PgBouncerSpec
+in the API reference.
Pooler resources are not managed automatically by the operator. You create
+them manually when needed, and you can deploy multiple poolers for the same
+PostgreSQL cluster.
The key point to understand is that the lifecycles of the Cluster and
+Pooler resources are independent. Deleting a cluster does not automatically
+remove its poolers, and deleting a pooler does not affect the cluster.
Info
+Once you are familiar with how poolers work, you have complete flexibility +in designing your architecture. You can run clusters without poolers, clusters +with a single pooler, or clusters with multiple poolers (for example, one per +application).
+Important
+When the operator itself is upgraded, pooler pods will also undergo a +rolling upgrade. This ensures that the instance manager inside the pooler +pods is upgraded consistently.
+Any PgBouncer pooler is transparently integrated with CloudNativePG support for +in-transit encryption by way of TLS connections, both on the client +(application) and server (PostgreSQL) side of the pool.
+Containers run as the pgbouncer system user, and access to the pgbouncer
+administration database is allowed only by way of local connections, through
+peer authentication.
By default, a PgBouncer pooler reuses the same certificates as the PostgreSQL
+cluster. It relies on TLS client certificate authentication to connect to the
+PostgreSQL server and run the auth_query used for client password
+authentication (see "Authentication").
Supplying your own secrets disables the built-in integration. From that point, +you gain complete control (and responsibility) for managing authentication. +Supported secret formats are:
+For Opaque secrets, the Pooler resource expects the following keys:
tls.crttls.keyIn practice, this means you can treat an Opaque secret as a TLS secret, +starting from the same structure.
+By default, CloudNativePG natively supports password-based authentication for +PgBouncer clients connecting to the PostgreSQL database.
+This built-in mechanism leverages PgBouncer’s auth_dbname (introduced in
+version 1.19), together with the auth_user and auth_query options.
Important
+If you provide your own certificate secrets, the built-in integration is +disabled. In that case, you are fully responsible for configuring and +managing PgBouncer authentication.
+The built-in integration performs the following tasks:
+cnpg_pooler_pgbouncer in the PostgreSQL
+ serverpostgres database and grants execution
+ privileges to cnpg_pooler_pgbouncer (following PoLA principles)cnpg_pooler_pgbouncer as the auth_user and
+ postgres as the auth_dbnamecnpg_pooler_pgbouncer against
+ PostgreSQL using the issued TLS certificateAs part of the built-in integration, CloudNativePG automatically executes a set
+of SQL statements during reconciliation. These statements are run by the
+instance manager using the postgres user against the postgres database.
Role creation:
+CREATE ROLE cnpg_pooler_pgbouncer WITH LOGIN;
+
+Grant access to the postgres database:
GRANT CONNECT ON DATABASE postgres TO cnpg_pooler_pgbouncer;
+
+Create the lookup function for password verification. This function is created
+in the postgres database with SECURITY DEFINER privileges and is used by
+PgBouncer’s auth_query option:
CREATE OR REPLACE FUNCTION public.user_search(uname TEXT)
+ RETURNS TABLE (usename name, passwd text)
+ LANGUAGE sql SECURITY DEFINER AS
+ 'SELECT usename, passwd FROM pg_catalog.pg_shadow WHERE usename=$1;';
+
+Restrict and grant permissions on the lookup function:
+REVOKE ALL ON FUNCTION public.user_search(text)
+ FROM public;
+
+GRANT EXECUTE ON FUNCTION public.user_search(text)
+ TO cnpg_pooler_pgbouncer;
+
+Providing your own certificate secrets disables the built-in integration.
+This gives you the flexibility — and responsibility — to manage the +authentication process yourself. You can follow the instructions above to +replicate similar behavior to the default setup.
+You can take advantage of pod templates specification in the template
+section of a Pooler resource. For details, see
+PoolerSpec in the API reference.
Using templates, you can configure pods as you like, including fine control
+over affinity and anti-affinity rules for pods and nodes. By default,
+containers use images from ghcr.io/cloudnative-pg/pgbouncer.
This example shows Pooler specifying `PodAntiAffinity``:
apiVersion: postgresql.cnpg.io/v1
+kind: Pooler
+metadata:
+ name: pooler-example-rw
+spec:
+ cluster:
+ name: cluster-example
+ instances: 3
+ type: rw
+
+ template:
+ metadata:
+ labels:
+ app: pooler
+ spec:
+ containers: []
+ affinity:
+ podAntiAffinity:
+ requiredDuringSchedulingIgnoredDuringExecution:
+ - labelSelector:
+ matchExpressions:
+ - key: app
+ operator: In
+ values:
+ - pooler
+ topologyKey: "kubernetes.io/hostname"
+
+Note
+Explicitly set .spec.template.spec.containers to [] when not modified,
+as it's a required field for a PodSpec. If .spec.template.spec.containers
+isn't set, the Kubernetes api-server returns the following error when trying to
+apply the manifest:error validating "pooler.yaml": error validating data:
+ValidationError(Pooler.spec.template.spec): missing required field
+"containers"
This example sets resources and changes the used image:
+apiVersion: postgresql.cnpg.io/v1
+kind: Pooler
+metadata:
+ name: pooler-example-rw
+spec:
+ cluster:
+ name: cluster-example
+ instances: 3
+ type: rw
+
+ template:
+ metadata:
+ labels:
+ app: pooler
+ spec:
+ containers:
+ - name: pgbouncer
+ image: my-pgbouncer:latest
+ resources:
+ requests:
+ cpu: "0.1"
+ memory: 100Mi
+ limits:
+ cpu: "0.5"
+ memory: 500Mi
+
+Sometimes, your pooler will require some different labels, annotations, or even change
+the type of the service, you can achieve that by using the serviceTemplate field:
apiVersion: postgresql.cnpg.io/v1
+kind: Pooler
+metadata:
+ name: pooler-example-rw
+spec:
+ cluster:
+ name: cluster-example
+ instances: 3
+ type: rw
+ serviceTemplate:
+ metadata:
+ labels:
+ app: pooler
+ spec:
+ type: LoadBalancer
+ pgbouncer:
+ poolMode: session
+ parameters:
+ max_client_conn: "1000"
+ default_pool_size: "10"
+
+The operator by default adds a ServicePort with the following data:
ports:
+ - name: pgbouncer
+ port: 5432
+ protocol: TCP
+ targetPort: pgbouncer
+
+Warning
+Specifying a ServicePort with the name pgbouncer or the port 5432 will prevent the default ServicePort from being added.
+This because ServicePort entries with the same name or port are not allowed on Kubernetes and result in errors.
Because of Kubernetes' deployments, you can configure your pooler to run on a
+single instance or over multiple pods. The exposed service makes sure that your
+clients are randomly distributed over the available pods running PgBouncer,
+which then manages and reuses connections toward the underlying server (if
+using the rw service) or servers (if using the ro service with multiple
+replicas).
Warning
+If your infrastructure spans multiple availability zones with high latency +across them, be aware of network hops. Consider, for example, the case of your +application running in zone 2, connecting to PgBouncer running in zone 3, and +pointing to the PostgreSQL primary in zone 1.
+The operator manages most of the configuration options for PgBouncer, +allowing you to modify only a subset of them.
+Warning
+You are responsible for correctly setting the value of each option, as the +operator doesn't validate them.
+These are the PgBouncer options you can customize, with links to the PgBouncer +documentation for each parameter. Unless stated otherwise, the default values +are the ones directly set by PgBouncer.
+auth_typeapplication_name_add_hostautodb_idle_timeoutcancel_wait_timeoutclient_idle_timeoutclient_login_timeoutclient_tls_sslmodedefault_pool_sizedisable_pqexecdns_max_ttldns_nxdomain_ttlidle_transaction_timeoutignore_startup_parameters:
+ to be appended to extra_float_digits,options - required by CloudNativePGlisten_backloglog_connectionslog_disconnectionslog_pooler_errorslog_stats: by default
+ disabled (0), given that statistics are already collected by the Prometheus
+ export as described in the "Monitoring" section belowmax_client_connmax_db_connectionsmax_packet_sizemax_prepared_statementsmax_user_connectionsmin_pool_sizepkt_bufquery_timeoutquery_wait_timeoutreserve_pool_sizereserve_pool_timeoutsbuf_loopcntserver_check_delayserver_check_queryserver_connect_timeoutserver_fast_closeserver_idle_timeoutserver_lifetimeserver_login_retryserver_reset_queryserver_reset_query_alwaysserver_round_robinserver_tls_ciphersserver_tls_protocolsserver_tls_sslmodestats_periodsuspend_timeouttcp_defer_accepttcp_keepalivetcp_keepcnttcp_keepidletcp_keepintvltcp_user_timeouttcp_socket_buffertrack_extra_parametersverboseCustomizations of the PgBouncer configuration are written declaratively in the
+.spec.pgbouncer.parameters map.
The operator reacts to the changes in the pooler specification, and every +PgBouncer instance reloads the updated configuration without disrupting the +service.
+Warning
+Every PgBouncer pod has the same configuration, aligned +with the parameters in the specification. A mistake in these +parameters might disrupt the operability of the whole pooler. +The operator doesn't validate the value of any option.
+The PgBouncer implementation of the Pooler comes with a default
+Prometheus exporter. It makes available several
+metrics having the cnpg_pgbouncer_ prefix by running:
SHOW LISTS (prefix: cnpg_pgbouncer_lists)SHOW POOLS (prefix: cnpg_pgbouncer_pools)SHOW STATS (prefix: cnpg_pgbouncer_stats)Like the CloudNativePG instance, the exporter runs on port
+9127 of each pod running PgBouncer and also provides metrics related to the
+Go runtime (with the prefix go_*).
Info
+You can inspect the exported metrics on a pod running PgBouncer. For instructions, see
+How to inspect the exported metrics.
+Make sure that you use the correct IP and the 9127 port.
This example shows the output for cnpg_pgbouncer metrics:
# HELP cnpg_pgbouncer_collection_duration_seconds Collection time duration in seconds
+# TYPE cnpg_pgbouncer_collection_duration_seconds gauge
+cnpg_pgbouncer_collection_duration_seconds{collector="Collect.up"} 0.002338805
+# HELP cnpg_pgbouncer_collection_errors_total Total errors occurred accessing PostgreSQL for metrics.
+# TYPE cnpg_pgbouncer_collection_errors_total counter
+cnpg_pgbouncer_collection_errors_total{collector="sql: Scan error on column index 16, name \"load_balance_hosts\": converting NULL to int is unsupported"} 5
+# HELP cnpg_pgbouncer_collections_total Total number of times PostgreSQL was accessed for metrics.
+# TYPE cnpg_pgbouncer_collections_total counter
+cnpg_pgbouncer_collections_total 5
+# HELP cnpg_pgbouncer_last_collection_error 1 if the last collection ended with error, 0 otherwise.
+# TYPE cnpg_pgbouncer_last_collection_error gauge
+cnpg_pgbouncer_last_collection_error 0
+# HELP cnpg_pgbouncer_lists_databases Count of databases.
+# TYPE cnpg_pgbouncer_lists_databases gauge
+cnpg_pgbouncer_lists_databases 1
+# HELP cnpg_pgbouncer_lists_dns_names Count of DNS names in the cache.
+# TYPE cnpg_pgbouncer_lists_dns_names gauge
+cnpg_pgbouncer_lists_dns_names 0
+# HELP cnpg_pgbouncer_lists_dns_pending Not used.
+# TYPE cnpg_pgbouncer_lists_dns_pending gauge
+cnpg_pgbouncer_lists_dns_pending 0
+# HELP cnpg_pgbouncer_lists_dns_queries Count of in-flight DNS queries.
+# TYPE cnpg_pgbouncer_lists_dns_queries gauge
+cnpg_pgbouncer_lists_dns_queries 0
+# HELP cnpg_pgbouncer_lists_dns_zones Count of DNS zones in the cache.
+# TYPE cnpg_pgbouncer_lists_dns_zones gauge
+cnpg_pgbouncer_lists_dns_zones 0
+# HELP cnpg_pgbouncer_lists_free_clients Count of free clients.
+# TYPE cnpg_pgbouncer_lists_free_clients gauge
+cnpg_pgbouncer_lists_free_clients 49
+# HELP cnpg_pgbouncer_lists_free_servers Count of free servers.
+# TYPE cnpg_pgbouncer_lists_free_servers gauge
+cnpg_pgbouncer_lists_free_servers 0
+# HELP cnpg_pgbouncer_lists_login_clients Count of clients in login state.
+# TYPE cnpg_pgbouncer_lists_login_clients gauge
+cnpg_pgbouncer_lists_login_clients 0
+# HELP cnpg_pgbouncer_lists_pools Count of pools.
+# TYPE cnpg_pgbouncer_lists_pools gauge
+cnpg_pgbouncer_lists_pools 1
+# HELP cnpg_pgbouncer_lists_used_clients Count of used clients.
+# TYPE cnpg_pgbouncer_lists_used_clients gauge
+cnpg_pgbouncer_lists_used_clients 1
+# HELP cnpg_pgbouncer_lists_used_servers Count of used servers.
+# TYPE cnpg_pgbouncer_lists_used_servers gauge
+cnpg_pgbouncer_lists_used_servers 0
+# HELP cnpg_pgbouncer_lists_users Count of users.
+# TYPE cnpg_pgbouncer_lists_users gauge
+cnpg_pgbouncer_lists_users 2
+# HELP cnpg_pgbouncer_pools_cl_active Client connections that are linked to server connection and can process queries.
+# TYPE cnpg_pgbouncer_pools_cl_active gauge
+cnpg_pgbouncer_pools_cl_active{database="pgbouncer",user="pgbouncer"} 1
+# HELP cnpg_pgbouncer_pools_cl_active_cancel_req Client connections that have forwarded query cancellations to the server and are waiting for the server response.
+# TYPE cnpg_pgbouncer_pools_cl_active_cancel_req gauge
+cnpg_pgbouncer_pools_cl_active_cancel_req{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_cl_cancel_req Client connections that have not forwarded query cancellations to the server yet.
+# TYPE cnpg_pgbouncer_pools_cl_cancel_req gauge
+cnpg_pgbouncer_pools_cl_cancel_req{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_cl_waiting Client connections that have sent queries but have not yet got a server connection.
+# TYPE cnpg_pgbouncer_pools_cl_waiting gauge
+cnpg_pgbouncer_pools_cl_waiting{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_cl_waiting_cancel_req Client connections that have not forwarded query cancellations to the server yet.
+# TYPE cnpg_pgbouncer_pools_cl_waiting_cancel_req gauge
+cnpg_pgbouncer_pools_cl_waiting_cancel_req{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_load_balance_hosts Number of hosts not load balancing between hosts
+# TYPE cnpg_pgbouncer_pools_load_balance_hosts gauge
+cnpg_pgbouncer_pools_load_balance_hosts{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_maxwait How long the first (oldest) client in the queue has waited, in seconds. If this starts increasing, then the current pool of servers does not handle requests quickly enough. The reason may be either an overloaded server or just too small of a pool_size setting.
+# TYPE cnpg_pgbouncer_pools_maxwait gauge
+cnpg_pgbouncer_pools_maxwait{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_maxwait_us Microsecond part of the maximum waiting time.
+# TYPE cnpg_pgbouncer_pools_maxwait_us gauge
+cnpg_pgbouncer_pools_maxwait_us{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_pool_mode The pooling mode in use. 1 for session, 2 for transaction, 3 for statement, -1 if unknown
+# TYPE cnpg_pgbouncer_pools_pool_mode gauge
+cnpg_pgbouncer_pools_pool_mode{database="pgbouncer",user="pgbouncer"} 3
+# HELP cnpg_pgbouncer_pools_sv_active Server connections that are linked to a client.
+# TYPE cnpg_pgbouncer_pools_sv_active gauge
+cnpg_pgbouncer_pools_sv_active{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_sv_active_cancel Server connections that are currently forwarding a cancel request
+# TYPE cnpg_pgbouncer_pools_sv_active_cancel gauge
+cnpg_pgbouncer_pools_sv_active_cancel{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_sv_idle Server connections that are unused and immediately usable for client queries.
+# TYPE cnpg_pgbouncer_pools_sv_idle gauge
+cnpg_pgbouncer_pools_sv_idle{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_sv_login Server connections currently in the process of logging in.
+# TYPE cnpg_pgbouncer_pools_sv_login gauge
+cnpg_pgbouncer_pools_sv_login{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_sv_tested Server connections that are currently running either server_reset_query or server_check_query.
+# TYPE cnpg_pgbouncer_pools_sv_tested gauge
+cnpg_pgbouncer_pools_sv_tested{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_sv_used Server connections that have been idle for more than server_check_delay, so they need server_check_query to run on them before they can be used again.
+# TYPE cnpg_pgbouncer_pools_sv_used gauge
+cnpg_pgbouncer_pools_sv_used{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_pools_sv_wait_cancels Servers that normally could become idle, but are waiting to do so until all in-flight cancel requests have completed that were sent to cancel a query on this server.
+# TYPE cnpg_pgbouncer_pools_sv_wait_cancels gauge
+cnpg_pgbouncer_pools_sv_wait_cancels{database="pgbouncer",user="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_avg_bind_count Average number of prepared statements readied for execution by clients and forwarded to PostgreSQL by pgbouncer.
+# TYPE cnpg_pgbouncer_stats_avg_bind_count gauge
+cnpg_pgbouncer_stats_avg_bind_count{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_avg_client_parse_count Average number of prepared statements created by clients.
+# TYPE cnpg_pgbouncer_stats_avg_client_parse_count gauge
+cnpg_pgbouncer_stats_avg_client_parse_count{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_avg_query_count Average queries per second in last stat period.
+# TYPE cnpg_pgbouncer_stats_avg_query_count gauge
+cnpg_pgbouncer_stats_avg_query_count{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_avg_query_time Average query duration, in microseconds.
+# TYPE cnpg_pgbouncer_stats_avg_query_time gauge
+cnpg_pgbouncer_stats_avg_query_time{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_avg_recv Average received (from clients) bytes per second.
+# TYPE cnpg_pgbouncer_stats_avg_recv gauge
+cnpg_pgbouncer_stats_avg_recv{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_avg_sent Average sent (to clients) bytes per second.
+# TYPE cnpg_pgbouncer_stats_avg_sent gauge
+cnpg_pgbouncer_stats_avg_sent{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_avg_server_parse_count Average number of prepared statements created by pgbouncer on a server.
+# TYPE cnpg_pgbouncer_stats_avg_server_parse_count gauge
+cnpg_pgbouncer_stats_avg_server_parse_count{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_avg_wait_time Time spent by clients waiting for a server, in microseconds (average per second).
+# TYPE cnpg_pgbouncer_stats_avg_wait_time gauge
+cnpg_pgbouncer_stats_avg_wait_time{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_avg_xact_count Average transactions per second in last stat period.
+# TYPE cnpg_pgbouncer_stats_avg_xact_count gauge
+cnpg_pgbouncer_stats_avg_xact_count{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_avg_xact_time Average transaction duration, in microseconds.
+# TYPE cnpg_pgbouncer_stats_avg_xact_time gauge
+cnpg_pgbouncer_stats_avg_xact_time{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_total_bind_count Total number of prepared statements readied for execution by clients and forwarded to PostgreSQL by pgbouncer
+# TYPE cnpg_pgbouncer_stats_total_bind_count gauge
+cnpg_pgbouncer_stats_total_bind_count{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_total_client_parse_count Total number of prepared statements created by clients.
+# TYPE cnpg_pgbouncer_stats_total_client_parse_count gauge
+cnpg_pgbouncer_stats_total_client_parse_count{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_total_query_count Total number of SQL queries pooled by pgbouncer.
+# TYPE cnpg_pgbouncer_stats_total_query_count gauge
+cnpg_pgbouncer_stats_total_query_count{database="pgbouncer"} 15
+# HELP cnpg_pgbouncer_stats_total_query_time Total number of microseconds spent by pgbouncer when actively connected to PostgreSQL, executing queries.
+# TYPE cnpg_pgbouncer_stats_total_query_time gauge
+cnpg_pgbouncer_stats_total_query_time{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_total_received Total volume in bytes of network traffic received by pgbouncer.
+# TYPE cnpg_pgbouncer_stats_total_received gauge
+cnpg_pgbouncer_stats_total_received{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_total_sent Total volume in bytes of network traffic sent by pgbouncer.
+# TYPE cnpg_pgbouncer_stats_total_sent gauge
+cnpg_pgbouncer_stats_total_sent{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_total_server_parse_count Total number of prepared statements created by pgbouncer on a server.
+# TYPE cnpg_pgbouncer_stats_total_server_parse_count gauge
+cnpg_pgbouncer_stats_total_server_parse_count{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_total_wait_time Time spent by clients waiting for a server, in microseconds.
+# TYPE cnpg_pgbouncer_stats_total_wait_time gauge
+cnpg_pgbouncer_stats_total_wait_time{database="pgbouncer"} 0
+# HELP cnpg_pgbouncer_stats_total_xact_count Total number of SQL transactions pooled by pgbouncer.
+# TYPE cnpg_pgbouncer_stats_total_xact_count gauge
+cnpg_pgbouncer_stats_total_xact_count{database="pgbouncer"} 15
+# HELP cnpg_pgbouncer_stats_total_xact_time Total number of microseconds spent by pgbouncer when connected to PostgreSQL in a transaction, either idle in transaction or executing queries.
+# TYPE cnpg_pgbouncer_stats_total_xact_time gauge
+cnpg_pgbouncer_stats_total_xact_time{database="pgbouncer"} 0
+
+Info
+For a better understanding of the metrics please refer to the PgBouncer documentation.
+As for clusters, a specific pooler can be monitored using the
+Prometheus operator's
+PodMonitor resource.
You can deploy a PodMonitor for a specific pooler using the following basic example, and change it as needed:
apiVersion: monitoring.coreos.com/v1
+kind: PodMonitor
+metadata:
+ name: <POOLER_NAME>
+spec:
+ selector:
+ matchLabels:
+ cnpg.io/poolerName: <POOLER_NAME>
+ podMetricsEndpoints:
+ - port: metrics
+
+PodMonitor CreationFeature Deprecation Notice
+The .spec.monitoring.enablePodMonitor field in the Pooler resource is
+now deprecated and will be removed in a future version of the operator.
If you are currently using this feature, we strongly recommend you either
+remove or set .spec.monitoring.enablePodMonitor to false and manually
+create a PodMonitor resource for your pooler as described above.
+This change ensures that you have complete ownership of your monitoring
+configuration, preventing it from being managed or overwritten by the operator.
Logs are directly sent to standard output, in JSON format, like in the +following example:
+{
+ "level": "info",
+ "ts": SECONDS.MICROSECONDS,
+ "msg": "record",
+ "pipe": "stderr",
+ "record": {
+ "timestamp": "YYYY-MM-DD HH:MM:SS.MS UTC",
+ "pid": "<PID>",
+ "level": "LOG",
+ "msg": "kernel file descriptor limit: 1048576 (hard: 1048576); max_client_conn: 100, max expected fd use: 112"
+ }
+}
+
+The Pooler specification allows you to take advantage of PgBouncer's PAUSE
+and RESUME commands, using only declarative configuration. You can ado this
+using the paused option, which by default is set to false. When set to
+true, the operator internally invokes the PAUSE command in PgBouncer,
+which:
When the paused option is reset to false, the operator invokes the
+RESUME command in PgBouncer, reopening the taps toward the PostgreSQL
+service defined in the Pooler resource.
PAUSE
+For more information, see
+PAUSE in the PgBouncer documentation.
Important
+In future versions, the switchover operation will be fully integrated
+with the PgBouncer pooler and take advantage of the PAUSE/RESUME
+features to reduce the perceived downtime by client applications.
+Currently, you can achieve the same results by setting the paused
+attribute to true, issuing the switchover command through the
+cnpg plugin, and then restoring the paused
+attribute to false.
The current implementation of the pooler is designed to work as part of a +specific CloudNativePG cluster (a service). It isn't currently possible to +create a pooler that spans multiple clusters.
+CloudNativePG transparently manages several configuration options that are used
+for the PgBouncer layer to communicate with PostgreSQL. Such options aren't
+configurable from outside and include TLS certificates, authentication
+settings, the databases section, and the users section. Also, considering
+the specific use case for the single PostgreSQL cluster, the adopted criteria
+is to explicitly list the options that can be configured by users.
Note
+The adopted solution likely addresses the majority of +use cases. It leaves room for the future implementation of a separate +operator for PgBouncer to complete the gamma with more advanced and customized +scenarios.
+The CloudNativePG operator for Kubernetes is designed to work with any +compatible PostgreSQL container image that meets the following requirements:
+initdbpostgrespg_ctlpg_controldatapg_basebackupOptional Components:
+du (used for kubectl cnpg status)Important
+Only PostgreSQL versions officially supported by PGDG are allowed.
+Info
+Barman Cloud executables are no longer required in CloudNativePG. The +recommended approach is to use the dedicated Barman Cloud Plugin.
+No entry point or command is required in the image definition. CloudNativePG +automatically overrides it with its instance manager.
+Warning
+CloudNativePG only supports Primary with multiple/optional Hot Standby +Servers architecture for PostgreSQL application container images.
+The CloudNativePG community provides and maintains +public PostgreSQL container images +that are fully compatible with CloudNativePG. These images are published on +ghcr.io.
+To ensure the operator makes informed decisions, it must accurately detect the +PostgreSQL major version. This detection can occur in two ways:
+major field of the imageCatalogRef, if defined.imageName if
+ not explicitly specified.For auto-detection to work, the image tag must adhere to a specific format. It +should commence with a valid PostgreSQL major version number (e.g., 15.6 or +16), optionally followed by a dot and the patch level.
+Following this, the tag can include any character combination valid and +accepted in a Docker tag, preceded by a dot, an underscore, or a minus sign.
+Examples of accepted image tags:
+12.113.3.2.1-113.41415.5-1016.0Warning
+latest is not considered a valid tag for the image.
Note
+Image tag requirements do not apply for images defined in a catalog.
+Kubernetes uses the +Controller pattern +to align the current cluster state with the desired one.
+Stateful applications are usually managed with the
+StatefulSet
+controller, which creates and reconciles a set of Pods built from the same
+specification, and assigns them a sticky identity.
CloudNativePG implements its own custom controller to manage PostgreSQL
+instances, instead of relying on the StatefulSet controller.
+While bringing more complexity to the implementation, this design choice
+provides the operator with more flexibility on how we manage the cluster,
+while being transparent on the topology of PostgreSQL clusters.
Like many choices in the design realm, different ones lead to other +compromises. The following sections discuss a few points where we believe +this design choice has made the implementation of CloudNativePG +more reliable, and easier to understand.
+This is a well known limitation of StatefulSet: it does not support resizing
+PVCs. This is inconvenient for a database. Resizing volumes requires
+convoluted workarounds.
In contrast, CloudNativePG leverages the configured storage class to +manage the underlying PVCs directly, and can handle PVC resizing if +the storage class supports it.
+The StatefulSet controller is designed to create a set of Pods
+from just one template. Given that we use one Pod per PostgreSQL instance,
+we have two kinds of Pods:
This difference is relevant when deciding the correct deployment strategy to +execute for a given operation.
+Some operations should be performed on the replicas first,
+and then on the primary, but only after an updated replica is promoted
+as the new primary.
+For example, when you want to apply a different PostgreSQL image version,
+or when you increase configuration parameters like max_connections (which are
+treated specially by PostgreSQL because CloudNativePG uses hot standby
+replicas).
While doing that, CloudNativePG considers the PostgreSQL instance's +role - and not just its serial number.
+Sometimes the operator needs to follow the opposite process: work on the
+primary first and then on the replicas. For example, when you
+lower max_connections. In that case, CloudNativePG will:
The StatefulSet controller, being application-independent, can't
+incorporate this behavior, which is specific to PostgreSQL's native
+replication technology.
PostgreSQL instances can be configured to work with multiple PVCs: this is how
+WAL storage can be separated from PGDATA.
The two data stores need to be coherent from the PostgreSQL point of view,
+as they're used simultaneously. If you delete the PVC corresponding to
+the WAL storage of an instance, the PVC where PGDATA is stored will not be
+usable anymore.
This behavior is specific to PostgreSQL and is not implemented in the
+StatefulSet controller - the latter not being application specific.
After the user dropped a PVC, a StatefulSet would just recreate it, leading
+to a corrupted PostgreSQL instance.
CloudNativePG would instead classify the remaining PVC as unusable, and +start creating a new pair of PVCs for another instance to join the cluster +correctly.
+Sometimes you need to take down a Kubernetes node to do an upgrade. +After the upgrade, depending on your upgrade strategy, the updated node +could go up again, or a new node could replace it.
+Supposing the unavailable node was hosting a PostgreSQL instance, +depending on your database size and your cloud infrastructure, you +may prefer to choose one of the following actions:
+drop the PVC and the Pod residing on the downed node; + create a new PVC cloning the data from another PVC; + after that, schedule a Pod for it
+drop the Pod, schedule the Pod in a different node, and mount + the PVC from there
+leave the Pod and the PVC as they are, and wait for the node to + be back up.
+The first solution is practical when your database size permits, allowing +you to immediately bring back the desired number of replicas.
+The second solution is only feasible when you're not using the storage of the +local node, and re-mounting the PVC in another host is possible in a reasonable +amount of time (which only you and your organization know).
+The third solution is appropriate when the database is big and uses local +node storage for maximum performance and data durability.
+The CloudNativePG controller implements all these strategies so that the +user can select the preferred behavior at the cluster level (read the +"Kubernetes upgrade" section for details).
+Being generic, the StatefulSet doesn't allow this level of
+customization.
This section describes how to import one or more existing PostgreSQL +databases inside a brand new CloudNativePG cluster.
+The import operation is based on the concept of online logical backups in PostgreSQL,
+and relies on pg_dump via a network connection to the origin host, and pg_restore.
+Thanks to native Multi-Version Concurrency Control (MVCC) and snapshots,
+PostgreSQL enables taking consistent backups over the network, in a concurrent
+manner, without stopping any write activity.
Logical backups are also the most common, flexible and reliable technique to +perform major upgrades of PostgreSQL versions.
+As a result, the instructions in this section are suitable for both:
+Warning
+When performing major upgrades of PostgreSQL you are responsible for making +sure that applications are compatible with the new version and that the +upgrade path of the objects contained in the database (including extensions) is +feasible.
+In both cases, the operation is performed on a consistent snapshot of the +origin database.
+Important
+For this reason we suggest to stop write operations on the source before
+the final import in the Cluster resource, as changes done to the source
+database after the start of the backup will not be in the destination cluster -
+hence why this feature is referred to as "offline import" or "offline major
+upgrade".
Conceptually, the import requires you to create a new cluster from scratch
+(destination cluster), using the initdb bootstrap method,
+and then complete the initdb.import subsection to import objects from an
+existing Postgres cluster (source cluster). As per PostgreSQL recommendation,
+we suggest that the PostgreSQL major version of the destination cluster is
+greater or equal than the one of the source cluster.
CloudNativePG provides two main ways to import objects from the source cluster +into the destination cluster:
+microservice approach: the destination cluster is designed to host a + single application database owned by the specified application user, as + recommended by the CloudNativePG project
+monolith approach: the destination cluster is designed to host multiple + databases and different users, imported from the source cluster
+The first import method is available via the microservice type, the
+second via the monolith type.
Warning
+It is your responsibility to ensure that the destination cluster can
+access the source cluster with a superuser or a user having enough
+privileges to take a logical backup with pg_dump. Please refer to the
+PostgreSQL documentation on pg_dump
+for further information.
microservice typeWith the microservice approach, you can specify a single database you want to +import from the source cluster into the destination cluster. The operation is +performed in 4 steps:
+initdb bootstrap of the new clusterinitdb.import.databases) using
+ pg_dump -Fdpg_restore --no-acl --no-owner into the
+ initdb.database (application database) owned by the initdb.owner userpostImportApplicationSQL parameterANALYZE VERBOSE on the imported databaseIn the figure below, a single PostgreSQL cluster containing N databases is +imported into separate CloudNativePG clusters, with each cluster using a +microservice import for one of the N source databases.
+
For example, the YAML below creates a new 3 instance PostgreSQL cluster (latest
+available major version at the time the operator was released) called
+cluster-microservice that imports the angus database from the
+cluster-pg96 cluster (with the unsupported PostgreSQL 9.6), by connecting to
+the postgres database using the postgres user, via the password stored in
+the cluster-pg96-superuser secret.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-microservice
+spec:
+ instances: 3
+
+ bootstrap:
+ initdb:
+ import:
+ type: microservice
+ databases:
+ - angus
+ source:
+ externalCluster: cluster-pg96
+ #postImportApplicationSQL:
+ #- |
+ # INSERT YOUR SQL QUERIES HERE
+ storage:
+ size: 1Gi
+ externalClusters:
+ - name: cluster-pg96
+ connectionParameters:
+ # Use the correct IP or host name for the source database
+ host: pg96.local
+ user: postgres
+ dbname: postgres
+ password:
+ name: cluster-pg96-superuser
+ key: password
+
+Warning
+The example above deliberately uses a source database running a version of
+PostgreSQL that is not supported anymore by the Community, and consequently by
+CloudNativePG.
+Data export from the source instance is performed using the version of
+pg_dump in the destination cluster, which must be a supported one, and
+equal or greater than the source one.
+Based on our experience, this way of exporting data should work on older
+and unsupported versions of Postgres too, giving you the chance to move your
+legacy data to a better system, inside Kubernetes.
+This is the main reason why we used 9.6 in the examples of this section.
+We'd be interested to hear from you, should you experience any issues in this area.
There are a few things you need to be aware of when using the microservice type:
externalCluster that points to an existing PostgreSQL
+ instance containing the data to import (for more information, please refer to
+ "The externalClusters section")externalCluster during the operationpg_dump and read roles information (superuser is OK)pg_dump -Fd result is stored temporarily inside the dumps
+ folder in the PGDATA volume, so there should be enough available space to
+ temporarily contain the dump result on the assigned node, as well as the
+ restored data and indexes. Once the import operation is completed, this
+ folder is automatically deleted by the operator.initdb.import.databases arrayinitdb.import.rolesHint
+The microservice approach adheres to CloudNativePG conventions and defaults
+for the destination cluster. If you do not set initdb.database or
+initdb.owner for the destination cluster, both parameters will default to
+app.
monolith typeWith the monolith approach, you can specify a set of roles and databases you +want to import from the source cluster into the destination cluster. +The operation is performed in the following steps:
+initdb bootstrap of the new clusterinitdb.import.databases), one at a time,
+ using pg_dump -Fdpg_restoreANALYZE on each imported database
For example, the YAML below creates a new 3 instance PostgreSQL cluster (latest
+available major version at the time the operator was released) called
+cluster-monolith that imports the accountant and the bank_user roles,
+as well as the accounting, banking, resort databases from the
+cluster-pg96 cluster (with the unsupported PostgreSQL 9.6), by connecting to
+the postgres database using the postgres user, via the password stored in
+the cluster-pg96-superuser secret.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-monolith
+spec:
+ instances: 3
+ bootstrap:
+ initdb:
+ import:
+ type: monolith
+ databases:
+ - accounting
+ - banking
+ - resort
+ roles:
+ - accountant
+ - bank_user
+ source:
+ externalCluster: cluster-pg96
+ storage:
+ size: 1Gi
+ externalClusters:
+ - name: cluster-pg96
+ connectionParameters:
+ # Use the correct IP or host name for the source database
+ host: pg96.local
+ user: postgres
+ dbname: postgres
+ sslmode: require
+ password:
+ name: cluster-pg96-superuser
+ key: password
+
+There are a few things you need to be aware of when using the monolith type:
externalCluster that points to an existing PostgreSQL
+ instance containing the data to import (for more information, please refer to
+ "The externalClusters section")externalCluster during the operationpg_dump and retrieve roles information (superuser is
+ OK)pg_dump -Fd result is stored temporarily inside the dumps
+ folder in the PGDATA volume of the destination cluster's instances, so
+ there should be enough available space to
+ temporarily contain the dump result on the assigned node, as well as the
+ restored data and indexes. Once the import operation is completed, this
+ folder is automatically deleted by the operator.initdb.import.databases arrayinitdb.import.roles, with the limitations below:postgres, streaming_replica, cnpg_pooler_pgbouncerSUPERUSER option is removed from any imported role"*" can be used as the only element in the databases and/or
+ roles arrays to import every object of the kind; When matching databases
+ the wildcard will ignore the postgres database, template databases
+ and those databases not allowing connectionsANALYZE VERBOSE is executed for every
+ database.postImportApplicationSQL field is not supportedHint
+The databases and their owners are preserved exactly as they exist in the
+source cluster—no app database or user will be created during import. If your
+bootstrap.initdb stanza specifies custom database and owner values that
+do not match any of the databases or users being imported, the instance
+manager will create a new, empty application database and owner role with those
+specified names, while leaving the imported databases and owners unchanged.
There is nothing to stop you from using the monolith approach to import a
+single database. It is interesting to see how the results of doing so would
+differ from using the microservice approach.
Given a source cluster, for example the following, with a database named
+mydb owned by role me:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 1
+
+ postgresql:
+ pg_hba:
+ - host all all all trust
+
+ storage:
+ size: 1Gi
+
+ bootstrap:
+ initdb:
+ database: mydb
+ owner: me
+
+We can import it via microservice:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example-microservice
+spec:
+ instances: 1
+
+ storage:
+ size: 1Gi
+
+ bootstrap:
+ initdb:
+ import:
+ type: microservice
+ databases:
+ - mydb
+ source:
+ externalCluster: cluster-example
+
+ externalClusters:
+ - name: cluster-example
+ connectionParameters:
+ host: cluster-example-rw
+ dbname: postgres
+
+as well as via monolith:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example-monolith
+spec:
+ instances: 1
+
+ storage:
+ size: 1Gi
+
+ bootstrap:
+ initdb:
+ import:
+ type: monolith
+ databases:
+ - mydb
+ roles:
+ - me
+ source:
+ externalCluster: cluster-example
+
+ externalClusters:
+ - name: cluster-example
+ connectionParameters:
+ host: cluster-example-rw
+ dbname: postgres
+
+In both cases, the database's contents will be imported, but:
+app, or whichever configuration for the fields database and owner are
+ set in the bootstrap.initdb stanza.mydb and me respectively. No app database nor user will be
+ created. If there are custom settings for database and owner in the
+ bootstrap.initdb stanza that don't match the source databases/owners to
+ import, the instance manager will create a new empty application database and
+ owner role, but will leave the imported databases/owners intact.During the logical import of a database, CloudNativePG optimizes the +configuration of PostgreSQL in order to prioritize speed versus data +durability, by forcing:
+archive_mode to offfsync to offfull_page_writes to offmax_wal_senders to 0wal_level to minimalBefore completing the import job, CloudNativePG restores the expected
+configuration, then runs initdb --sync-only to ensure that data is
+permanently written on disk.
Important
+WAL archiving, if requested, and WAL level will be honored after the +database import process has completed. Similarly, replicas will be cloned +after the bootstrap phase, when the actual cluster resource starts.
+There are other optimizations you can do during the import phase. Although this
+topic is beyond the scope of CloudNativePG, we recommend that you reduce
+unnecessary writes in the checkpoint area by tuning Postgres GUCs like
+shared_buffers, max_wal_size, checkpoint_timeout directly in the
+Cluster configuration.
pg_dump and pg_restore behaviorYou can customize the behavior of pg_dump and pg_restore by specifying additional
+options using the pgDumpExtraOptions and pgRestoreExtraOptions parameters.
+This is especially useful for improving performance or managing import/export complexity.
For example, enabling parallel jobs can significantly speed up data transfer:
+bootstrap:
+ initdb:
+ import:
+ type: microservice
+ databases:
+ - app
+ source:
+ externalCluster: cluster-example
+ pgDumpExtraOptions:
+ - '--jobs=2'
+ pgRestoreExtraOptions:
+ - '--jobs=2'
+
+pg_restore optionsFor more granular control over the import process, CloudNativePG supports
+stage-specific pg_restore options for the following phases:
pre-data – e.g., schema definitionsdata – e.g., table contentspost-data – e.g., indexes, constraints and triggersBy specifying options for each phase, you can optimize parallelism and apply +flags tailored to the nature of the objects being restored.
+bootstrap:
+ initdb:
+ import:
+ type: microservice
+ schemaOnly: false
+ databases:
+ - mynewdb
+ source:
+ externalCluster: sourcedb-external
+ pgRestorePredataOptions:
+ - '--jobs=1'
+ pgRestoreDataOptions:
+ - '--jobs=4'
+ pgRestorePostdataOptions:
+ - '--jobs=2'
+
+In the example above:
+--jobs=1 is applied to the pre-data stage to preserve the ordering of
+ schema creation.--jobs=4 increases parallelism during the data stage, speeding up large
+ data imports.--jobs=2 balances performance and dependency handling in the post-data
+ stage.These stage-specific settings are particularly valuable for large databases or +resource-sensitive environments where tuning concurrency can significantly +improve performance.
+Note
+When provided, stage-specific options take precedence over the general
+pgRestoreExtraOptions.
Warning
+The pgDumpExtraOptions, pgRestoreExtraOptions, and all stage-specific
+restore options (pgRestorePredataOptions, pgRestoreDataOptions,
+pgRestorePostdataOptions) are passed directly to the underlying PostgreSQL
+tools without validation by the operator. Certain flags may conflict with the
+operator’s intended functionality or design. Use these options with caution
+and always test them thoroughly in a safe, controlled environment before
+applying them in production.
Logical replication offers a powerful way to import any PostgreSQL database +accessible over the network using the following approach:
+Subscription Resource: Set up continuous replication to synchronize
+ data changes.This technique can also be leveraged for performing major PostgreSQL upgrades +with minimal downtime, making it ideal for seamless migrations and system +upgrades.
+For more details, including limitations and best practices, refer to the +Logical Replication section in the documentation.
+ +CloudNativePG simplifies PostgreSQL database provisioning by automatically
+creating an application database named app by default. This default behavior
+is explained in the "Bootstrap an Empty Cluster"
+section.
For more advanced use cases, CloudNativePG introduces declarative database
+management, which empowers users to define and control the lifecycle of
+PostgreSQL databases using the Database Custom Resource Definition (CRD).
+This method seamlessly integrates with Kubernetes, providing a scalable,
+automated, and consistent approach to managing PostgreSQL databases.
Important
+CloudNativePG manages global objects in PostgreSQL clusters, including +databases, roles, and tablespaces. However, it does not manage database content +beyond extensions and schemas (e.g., tables). To manage database content, use specialized +tools or rely on the applications themselves.
+Database ManifestThe following example demonstrates how a Database resource interacts with a
+Cluster:
apiVersion: postgresql.cnpg.io/v1
+kind: Database
+metadata:
+ name: cluster-example-one
+spec:
+ name: one
+ owner: app
+ cluster:
+ name: cluster-example
+ extensions:
+ - name: bloom
+ ensure: present
+
+When applied, this manifest creates a Database object called
+cluster-example-one requesting a database named one, owned by the app
+role, in the cluster-example PostgreSQL cluster.
Info
+Please refer to the API reference
+the full list of attributes you can define for each Database object.
Database Manifestmetadata.name: Unique name of the Kubernetes object within its namespace.spec.name: Name of the database as it will appear in PostgreSQL.spec.owner: PostgreSQL role that owns the database.spec.cluster.name: Name of the target PostgreSQL cluster.The Database object must reference a specific Cluster, determining where
+the database will be created. It is managed by the cluster's primary instance,
+ensuring the database is created or updated as needed.
Info
+The distinction between metadata.name and spec.name allows multiple
+Database resources to reference databases with the same name across different
+CloudNativePG clusters in the same Kubernetes namespace.
PostgreSQL automatically creates databases such as postgres, template0, and
+template1. These names are reserved and cannot be used for new Database
+objects in CloudNativePG.
Important
+Creating a Database with spec.name set to postgres, template0, or
+template1 is not allowed.
Once a Database object is reconciled successfully:
status.applied will be set to true.status.observedGeneration will match the metadata.generation of the last
+ applied configuration.Example of a reconciled Database object:
apiVersion: postgresql.cnpg.io/v1
+kind: Database
+metadata:
+ generation: 1
+ name: cluster-example-one
+spec:
+ cluster:
+ name: cluster-example
+ name: one
+ owner: app
+status:
+ observedGeneration: 1
+ applied: true
+
+If an error occurs during reconciliation, status.applied will be false, and
+an error message will be included in the status.message field.
CloudNativePG supports two methods for database deletion:
+delete reclaim policyensure field to absentdelete Reclaim PolicyThe databaseReclaimPolicy field determines the behavior when a Database
+object is deleted:
retain (default): The database remains in PostgreSQL for manual management.delete: The database is automatically removed from PostgreSQL.Example:
+apiVersion: postgresql.cnpg.io/v1
+kind: Database
+metadata:
+ name: cluster-example-two
+spec:
+ databaseReclaimPolicy: delete
+ name: two
+ owner: app
+ cluster:
+ name: cluster-example
+
+Deleting this Database object will automatically remove the two database
+from the cluster-example cluster.
ensure: absentTo remove a database, set the ensure field to absent like in the following
+example:.
apiVersion: postgresql.cnpg.io/v1
+kind: Database
+metadata:
+ name: cluster-example-database-to-drop
+spec:
+ cluster:
+ name: cluster-example
+ name: database-to-drop
+ owner: app
+ ensure: absent
+
+This manifest ensures that the database-to-drop database is removed from the
+cluster-example cluster.
Info
+While extensions are database-scoped rather than global objects, +CloudNativePG provides a declarative interface for managing them. This approach +is necessary because installing certain extensions may require superuser +privileges, which CloudNativePG recommends disabling by default. By leveraging +this API, users can efficiently manage extensions in a scalable and controlled +manner without requiring elevated privileges.
+CloudNativePG simplifies and automates the management of PostgreSQL extensions within the +target database.
+To enable this feature, define the spec.extensions field
+with a list of extension specifications, as shown in the following example:
# ...
+spec:
+ extensions:
+ - name: bloom
+ ensure: present
+# ...
+
+Each extension entry supports the following properties:
+name (mandatory): The name of the extension.ensure: Specifies whether the extension should be present or absent in the
+ database:present: Ensures that the extension is installed (default).absent: Ensures that the extension is removed.version: The specific version of the extension to install or
+ upgrade to.schema: The schema in which the extension should be installed.Info
+CloudNativePG manages extensions using the following PostgreSQL’s SQL commands:
+CREATE EXTENSION,
+DROP EXTENSION,
+ALTER EXTENSION
+(limited to UPDATE TO and SET SCHEMA).
The operator reconciles only the extensions explicitly listed in
+spec.extensions. Any existing extensions not specified in this list remain
+unchanged.
Warning
+Before the introduction of declarative extension management, CloudNativePG
+did not offer a straightforward way to create extensions through configuration.
+To address this, the "managed extensions"
+feature was introduced, enabling the automated and transparent management
+of key extensions like pg_stat_statements. Currently, it is your
+responsibility to ensure there are no conflicts between extension support in
+the Database CRD and the managed extensions feature.
Info
+Schema management in PostgreSQL is an exception to CloudNativePG's primary +focus on managing global objects. Since schemas exist within a database, they +are typically managed as part of the application development process. However, +CloudNativePG provides a declarative interface for schema management, primarily +to complete the support of extensions deployment within schemas.
+CloudNativePG simplifies and automates the management of PostgreSQL schemas within the +target database.
+To enable this feature, define the spec.schemas field
+with a list of schema specifications, as shown in the following example:
# ...
+spec:
+ schemas:
+ - name: app
+ owner: app
+# ...
+
+Each schema entry supports the following properties:
+name (mandatory): The name of the schema.owner: The owner of the schema.ensure: Specifies whether the schema should be present or absent in the
+ database:present: Ensures that the schema is installed (default).absent: Ensures that the schema is removed.Info
+CloudNativePG manages schemas using the following PostgreSQL’s SQL commands:
+CREATE SCHEMA,
+DROP SCHEMA,
+ALTER SCHEMA.
Info
+Foreign Data Wrappers (FDWs) are database-scoped objects that typically +require superuser privileges to create or modify. CloudNativePG provides a +declarative API for managing FDWs, enabling users to define and maintain them +in a controlled, Kubernetes-native way without directly executing SQL commands +or escalating privileges.
+CloudNativePG enables seamless and automated management of PostgreSQL foreign +data wrappers in the target database using declarative configuration.
+To enable this feature, define the spec.fdws field with a list of FDW
+specifications, as shown in the following example:
# ...
+spec:
+ fdws:
+ - name: postgres_fdw
+ usage:
+ - name: app
+ type: grant
+# ...
+
+Each FDW entry supports the following properties:
+name: The name of the foreign data wrapper (mandatory).ensure: Indicates whether the FDW should be present or absent in the
+ database (default is present).handler: The name of the handler function used by the FDW. If not
+ specified, the default handler defined by the FDW extension (if any) will be
+ used.validator: The name of the validator function used by the FDW. If not
+ specified, the default validator defined by the FDW extension (if any) will
+ be used.owner: The owner of the FDW (must be a superuser).usage: The list of USAGE permissions of the FDW, with the following fields:name : The name of the role to which the usage permission should be
+ granted or from which it should be revoked (mandatory).type : The type of the usage permission. Supports grant and revoke.options: A map of FDW-specific options to manage, where each key is the
+ name of an option. Each option supports the following fields:value: The string value of the option.ensure: Indicates whether the option should be present or absent.Info
+Both handler and validator are optional, and if not specified, the
+default handler and validator defined by the FDW extension (if any) will be
+used. Setting handler or validator to "-" will remove the handler or
+validator from the FDW respectively. This follows the PostgreSQL convention,
+where "-" denotes the absence of a handler or validator.
Warning
+PostgreSQL restricts ownership of foreign data wrappers to roles with
+superuser privileges only. Attempting to assign ownership to a non-superuser
+(e.g., an app role) will be ignored or rejected, as PostgreSQL does not allow
+non-superuser ownership of foreign data wrappers. By default, they are
+owned by the postgres user.
The operator reconciles only the FDWs explicitly listed in spec.fdws. Any
+existing FDWs not declared in this list are left untouched.
Info
+CloudNativePG manages FDWs using PostgreSQL's native SQL commands:
+ CREATE FOREIGN DATA WRAPPER,
+ ALTER FOREIGN DATA WRAPPER,
+ and DROP FOREIGN DATA WRAPPER.
+ The ALTER command supports option updates.
CloudNativePG provides seamless, automated management of PostgreSQL foreign +servers in a target database using declarative configuration.
+A foreign server encapsulates the connection details that a foreign data +wrapper (FDW) uses to access an external data source. For user-specific +connection details, you can define user mappings.
+Important
+CloudNativePG does not currently support declarative configuration of user mappings. +However, once an FDW and its foreign server are defined, you can grant +usage privileges to a standard database role. This allows you to manage user +mappings as part of your SQL schema, without requiring superuser privileges.
+To enable this feature, declare the spec.servers field in a Database
+resource with a list of foreign server specifications, for example:
# ...
+spec:
+ servers:
+ - name: angus
+ fdw: postgres_fdw
+ ensure: present
+ usage:
+ - name: app
+ type: grant
+ options:
+ - name: host
+ value: angus-rw
+ - name: dbname
+ value: app
+# ...
+
+Each foreign server entry supports the following properties:
+name: The name of the foreign server (mandatory).fdw: The name of the foreign data wrapper the server belongs to
+ (mandatory).ensure: Whether the foreign server should be present or absent in the
+ database (default: present).usage: The list of USAGE permissions of the foreign server, with the
+ following fields:name : The name of the role to which the usage permission should be
+ granted or from which it should be revoked (mandatory).type : The type of the usage permission. Supports grant and revoke.options: A list of FDW-specific option specifications.
+ Each entry in the list supports the following keys:name: The name of the option (mandatory).value: The string value of the option.ensure: Indicates whether the option should be present or absent.Important
+The fdw field must reference an existing foreign data wrapper already defined in the database.
+If the specified FDW does not exist, the foreign server will not be created.
Info
+CloudNativePG manages foreign servers using PostgreSQL’s native SQL commands:
+CREATE SERVER,
+ALTER SERVER, and
+DROP SERVER.
+The ALTER SERVER command is used to update server options.
The operator reconciles only the foreign servers explicitly listed in
+spec.servers. Any existing servers not included in this list are left
+unchanged.
While CloudNativePG adheres to PostgreSQL’s
+CREATE DATABASE and
+ALTER DATABASE
+commands, renaming databases is not supported.
+Attempting to modify spec.name in an existing Database object will result
+in rejection by Kubernetes.
CREATE DATABASE statement.ALTER DATABASE is used to apply changes.It is important to note that there are some differences between these two
+Postgres commands: in particular, the options accepted by ALTER are a subset
+of those accepted by CREATE.
Warning
+Some fields, such as encoding and collation settings, are immutable in +PostgreSQL. Attempts to modify these fields on existing databases will be +ignored.
+Database objects declared on replica clusters cannot be enforced, as replicas +lack write privileges. These objects will remain in a pending state until the +replica is promoted.
+If two Database objects in the same namespace manage the same PostgreSQL
+database (i.e., identical spec.name and spec.cluster.name), the second
+object will be rejected.
Example status message:
+status:
+ applied: false
+ message: 'reconciliation error: database "one" is already managed by Database object "cluster-example-one"'
+
+CloudNativePG adheres to PostgreSQL's capabilities. For example, features like
+ICU_RULES introduced in PostgreSQL 16 are unavailable in earlier versions.
+Errors from PostgreSQL will be reflected in the Database object's status.
CloudNativePG does not overwrite manual changes to databases. Once reconciled,
+a Database object will not be reapplied unless its metadata.generation
+changes, giving flexibility for direct PostgreSQL modifications.
CloudNativePG is designed to keep PostgreSQL clusters up, running and available +anytime.
+There are some kinds of workloads that require the database to be up only when +the workload is active. Batch-driven solutions are one such case.
+In batch-driven solutions, the database needs to be up only when the batch +process is running.
+The declarative hibernation feature enables saving CPU power by removing the +database Pods, while keeping the database PVCs.
+To hibernate a cluster, set the cnpg.io/hibernation=on annotation:
$ kubectl annotate cluster <cluster-name> --overwrite cnpg.io/hibernation=on
+
+A hibernated cluster won't have any running Pods, while the PVCs are retained +so that the cluster can be rehydrated at a later time. Replica PVCs will be +kept in addition to the primary's PVC.
+The hibernation procedure will delete the primary Pod and then the replica +Pods, avoiding switchover, to ensure the replicas are kept in sync.
+The hibernation status can be monitored by looking for the cnpg.io/hibernation
+condition:
$ kubectl get cluster <cluster-name> -o "jsonpath={.status.conditions[?(.type==\"cnpg.io/hibernation\")]}"
+
+{
+ "lastTransitionTime":"2023-03-05T16:43:35Z",
+ "message":"Cluster has been hibernated",
+ "reason":"Hibernated",
+ "status":"True",
+ "type":"cnpg.io/hibernation"
+}
+
+The hibernation status can also be read with the status sub-command of the
+cnpg plugin for kubectl:
$ kubectl cnpg status <cluster-name>
+Cluster Summary
+Name: cluster-example
+Namespace: default
+PostgreSQL Image: ghcr.io/cloudnative-pg/postgresql:18.0-system-trixie
+Primary instance: cluster-example-2
+Status: Cluster in healthy state
+Instances: 3
+Ready instances: 0
+
+Hibernation
+Status Hibernated
+Message Cluster has been hibernated
+Time 2023-03-05 16:43:35 +0000 UTC
+[..]
+
+To rehydrate a cluster, either set the cnpg.io/hibernation annotation to off:
$ kubectl annotate cluster <cluster-name> --overwrite cnpg.io/hibernation=off
+
+Or, just unset it altogether:
+$ kubectl annotate cluster <cluster-name> cnpg.io/hibernation-
+
+The Pods will be recreated and the cluster will resume operation.
+ +From its inception, CloudNativePG has managed the creation of specific roles +required in PostgreSQL instances:
+postgres superuser, streaming_replica
+ and cnpg_pooler_pgbouncer (when the PgBouncer Pooler is used)This process is described in the "Bootstrap" section.
+With the managed stanza in the cluster spec, CloudNativePG now provides full
+lifecycle management for roles specified in .spec.managed.roles.
This feature enables declarative management of existing roles, as well as the +creation of new roles if they are not already present in the database. Role +creation will occur after the database bootstrapping is complete.
+An example manifest for a cluster with declarative role management can be found
+in the file cluster-example-with-roles.yaml.
Here is an excerpt from that file:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+spec:
+ managed:
+ roles:
+ - name: dante
+ ensure: present
+ comment: Dante Alighieri
+ login: true
+ superuser: false
+ inRoles:
+ - pg_monitor
+ - pg_signal_backend
+
+The role specification in .spec.managed.roles adheres to the
+PostgreSQL structure and naming conventions.
+Please refer to the API reference for
+the full list of attributes you can define for each role.
A few points are worth noting:
+ensure attribute is not part of PostgreSQL. It enables declarative
+ role management to create and remove roles. The two possible values are
+ present (the default) and absent.inherit attribute is true by default, following PostgreSQL conventions.connectionLimit attribute defaults to -1, in line with PostgreSQL conventions.inRoles defaults to no memberships.Declarative role management ensures that PostgreSQL instances align with the +spec. If a user modifies role attributes directly in the database, the +CloudNativePG operator will revert those changes during the next reconciliation +cycle.
+The declarative role management feature includes reconciling of role passwords. +Passwords are managed in fundamentally different ways in the Kubernetes world +and in PostgreSQL, and as a result there are a few things to note.
+Managed role configurations may optionally specify the name of a +Secret where the username and password are stored (encoded in Base64 +as is usual in Kubernetes). For example:
+ managed:
+ roles:
+ - name: dante
+ ensure: present
+ [… snipped …]
+ passwordSecret:
+ name: cluster-example-dante
+
+This would assume the existence of a Secret called cluster-example-dante,
+containing a username and password. The username should match the role we
+are setting the password for. For example, :
apiVersion: v1
+data:
+ username: ZGFudGU=
+ password: ZGFudGU=
+kind: Secret
+metadata:
+ name: cluster-example-dante
+ labels:
+ cnpg.io/reload: "true"
+type: kubernetes.io/basic-auth
+
+If there is no passwordSecret specified for a role, the instance manager will
+not try to CREATE / ALTER the role with a password. This keeps with PostgreSQL
+conventions, where ALTER will not update passwords unless directed to with
+WITH PASSWORD.
If a role was initially created with a password, and we would like to set the
+password to NULL in PostgreSQL, this necessitates being explicit on the part of
+the user of CloudNativePG.
+To distinguish "no password provided in spec" from "set the password to NULL",
+the field DisablePassword should be used.
Imagine we decided we would like to have no password on the dante role in the
+database. In such case we would specify the following:
managed:
+ roles:
+ - name: dante
+ ensure: present
+ [… snipped …]
+ disablePassword: true
+
+NOTE: it is considered an error to set both passwordSecret and
+disablePassword on a given role.
+This configuration will be rejected by the validation webhook.
VALID UNTILThe VALID UNTIL role attribute in PostgreSQL controls password expiry. Roles
+created without VALID UNTIL specified get NULL by default in PostgreSQL,
+meaning that their password will never expire.
PostgreSQL uses a timestamp type for VALID UNTIL, which includes support for
+the value 'infinity' indicating that the password never expires. Please see the
+PostgreSQL documentation
+for reference.
With declarative role management, the validUntil attribute for managed roles
+controls password expiry. validUntil can only take:
null)In the first case, the given validUntil timestamp will be set in the database
+as the VALID UNTIL attribute of the role.
In the second case (omitted validUntil) the operator will ensure password
+never expires, mirroring the behavior of PostgreSQL. Specifically:
VALID UNTIL clause in the role
+ creation statementVALID UNTIL to infinity if VALID
+ UNTIL was not set to NULL in the database (this is due to PostgreSQL not
+ allowing VALID UNTIL NULL in the ALTER ROLE SQL statement)Warning
+New roles created without passwordSecret will have a NULL password
+inside PostgreSQL.
You can also provide pre-encrypted passwords by specifying the password +in MD5/SCRAM-SHA-256 hash format:
+kind: Secret
+type: kubernetes.io/basic-auth
+metadata:
+ name: cluster-example-cavalcanti
+ labels:
+ cnpg.io/reload: "true"
+apiVersion: v1
+stringData:
+ username: cavalcanti
+ password: SCRAM-SHA-256$<iteration count>:<salt>$<StoredKey>:<ServerKey>
+
+In PostgreSQL, in some cases, commands cannot be honored by the database and +will be rejected. Please refer to the +PostgreSQL documentation on error codes +for details.
+Role operations can produce such fundamental errors. +Two examples:
+petrarca as a member of the role
+ (group) poets, but poets does not exist.dante, but the role dante is the owner
+ of the database inferno.These fundamental errors cannot be fixed by the database, nor the CloudNativePG
+operator, without clarification from the human administrator. The two examples
+above could be fixed by creating the role poets or dropping the database
+inferno respectively, but they might have originated due to human error, and
+in such case, the "fix" proposed might be the wrong thing to do.
CloudNativePG will record when such fundamental errors occur, and will display +them in the cluster Status. Which segues into…
+The Cluster status includes a section for the managed roles' status, as shown +below:
+status:
+ […snipped…]
+ managedRolesStatus:
+ byStatus:
+ not-managed:
+ - app
+ pending-reconciliation:
+ - dante
+ - petrarca
+ reconciled:
+ - ariosto
+ reserved:
+ - postgres
+ - streaming_replica
+ cannotReconcile:
+ dante:
+ - 'could not perform DELETE on role dante: owner of database inferno'
+ petrarca:
+ - 'could not perform UPDATE_MEMBERSHIPS on role petrarca: role "poets" does not exist'
+
+Note the special sub-section cannotReconcile for operations the database (and
+CloudNativePG) cannot honor, and which require human intervention.
This section covers roles reserved for operator use and those that are not +under declarative management, providing a comprehensive view of the roles in +the database instances.
+The kubectl plugin also shows the status of managed roles
+in its status sub-command:
Managed roles status
+Status Roles
+------ -----
+pending-reconciliation petrarca
+reconciled app,dante
+reserved postgres,streaming_replica
+
+Irreconcilable roles
+Role Errors
+---- ------
+petrarca could not perform UPDATE_MEMBERSHIPS on role petrarca: role "poets" does not exist
+
+Important
+In terms of backward compatibility, declarative role management is designed +to ignore roles that exist in the database but are not included in the spec. +The lifecycle of these roles will continue to be managed within PostgreSQL, +allowing CloudNativePG users to adopt this feature at their convenience.
+CloudNativePG is automatically tested after each +commit via a suite of End-to-end (E2E) tests (or integration tests) +which ensure that the operator correctly deploys and manages PostgreSQL +clusters.
+Kubernetes versions 1.27 through 1.32, and PostgreSQL versions 13 through 17, +are tested for each commit, helping detect bugs at an early stage of the +development process.
+For each tested version of Kubernetes and PostgreSQL, a Kubernetes +cluster is created using kind, run on the GitHub +Actions platform, +and the following suite of E2E tests are performed on that cluster:
+Basic:
+Service connectivity:
+Self-healing:
+Backup and Restore:
+Operator:
+Observability:
+Replication:
+Replica clusters
+Plugin:
+Postgres Configuration:
+Pod Scheduling:
+NodeSelectorCluster Metadata:
+Recovery:
+Importing Databases:
+Storage:
+Security:
+Maintenance:
+Hibernation
+Volume snapshots
+Managed Roles
+Tablespaces
+Declarative databases
+Declarative creation of databases with delete reclaim policy
+Major version upgrade
+In the case of unexpected errors on the primary for longer than the
+.spec.failoverDelay (by default 0 seconds), the cluster will go into
+failover mode. This may happen, for example, when:
postgres container on the primary has any kind of sustained failureIn the failover scenario, the primary cannot be assumed to be working properly.
+After cases like the ones above, the readiness probe for the primary pod will start +failing. This will be picked up in the controller's reconciliation loop. The +controller will initiate the failover process, in two steps:
+TargetPrimary as pending. This change of state will
+ force the primary pod to shutdown, to ensure the WAL receivers on the replicas
+ will stop. The cluster will be marked in failover phase ("Failing over").Important
+The two-phase procedure helps ensure the WAL receivers can stop in an orderly +fashion, and that the failing primary will not start streaming WALs again upon +restart. These safeguards prevent timeline discrepancies between the new primary +and the replicas.
+During the time the failing primary is being shut down:
+.spec.switchoverDelay seconds as timeout. This graceful shutdown will attempt
+ to archive pending WALs.Info
+"Fast" mode does not wait for PostgreSQL clients to disconnect and will +terminate an online backup in progress. All active transactions are rolled back +and clients are forcibly disconnected, then the server is shut down. +"Immediate" mode will abort all PostgreSQL server processes immediately, +without a clean shutdown.
+Failover may result in the service being impacted (RTO) +and/or data being lost (RPO):
+Note
+The timeout that controls fast shutdown is set by .spec.switchoverDelay,
+as in the case of a switchover. Increasing the time for fast shutdown is safer
+from an RPO point of view, but possibly delays the return to normal operation -
+negatively affecting RTO.
Warning
+As already mentioned in the "Instance Manager" section
+when explaining the switchover process, the .spec.switchoverDelay option
+affects the RPO and RTO of your PostgreSQL database. Setting it to a low value,
+might favor RTO over RPO but lead to data loss at cluster level and/or backup
+level. On the contrary, setting it to a high value, might remove the risk of
+data loss while leaving the cluster without an active primary for a longer time
+during the switchover.
As anticipated above, the .spec.failoverDelay option allows you to delay the start
+of the failover procedure by a number of seconds after the primary has been
+detected to be unhealthy. By default, this setting is set to 0, triggering the
+failover procedure immediately.
Sometimes failing over to a new primary can be more disruptive than waiting +for the primary to come back online. This is especially true of network +disruptions where multiple tiers are affected (i.e., downstream logical +subscribers) or when the time to perform the failover is longer than the +expected outage.
+Enabling a new configuration option to delay failover provides a mechanism to +prevent premature failover for short-lived network or node instability.
+Warning
+Failover quorum is an experimental feature introduced in version 1.27.0. +Use with caution in production environments.
+Failover quorum is a mechanism that enhances data durability and safety during +failover events in CloudNativePG-managed PostgreSQL clusters.
+Quorum-based failover allows the controller to determine whether to promote a replica +to primary based on the state of a quorum of replicas. +This is useful when stronger data durability is required than the one offered +by synchronous replication and +default automated failover procedures.
+When synchronous replication is not enabled, some data loss is expected and +accepted during failover, as a replica may lag behind the primary when +promoted.
+With synchronous replication enabled, the guarantee is that the application +will not receive explicit acknowledgment of the successful commit of a +transaction until the WAL data is known to be safely received by all required +synchronous standbys. +This is not enough to guarantee that the operator is able to promote the most +advanced replica.
+For example, in a three-node cluster with synchronous replication set to ANY 1
+(...), data is written to the primary and one standby before a commit is
+acknowledged. If both the primary and the aligned standby become unavailable
+(such as during a network partition), the remaining replica may not have the
+latest data. Promoting it could lose some data that the application considered
+committed.
Quorum-based failover addresses this risk by ensuring that failover only occurs +if the operator can confirm the presence of all synchronously committed data in +the instance to promote, and it does not occur otherwise.
+This feature allows users to choose their preferred trade-off between data +durability and data availability.
+Failover quorum can be enabled by setting the
+.spec.postgresql.synchronous.failoverQuorum field to true:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+
+ postgresql:
+ synchronous:
+ method: any
+ number: 1
+ failoverQuorum: true
+
+ storage:
+ size: 1Gi
+
+For backward compatibility, the legacy annotation
+alpha.cnpg.io/failoverQuorum is still supported by the admission webhook and
+takes precedence over the Cluster spec option:
"true" and a synchronous replication stanza
+ is present, the webhook automatically sets
+ .spec.postgresql.synchronous.failoverQuorum to true."false", the feature is always disabledImportant
+Because the annotation overrides the spec, we recommend that users of this
+experimental feature migrate to the native
+.spec.postgresql.synchronous.failoverQuorum option and remove the annotation
+from their manifests. The annotation is deprecated and will be removed in a
+future release.
Before promoting a replica to primary, the operator performs a quorum check,
+following the principles of the Dynamo R + W > N consistency model1.
In the quorum failover, these values assume the following meaning:
+R is the number of promotable replicas (read quorum);W is the number of replicas that must acknowledge the write before the
+ COMMIT is returned to the client (write quorum);N is the total number of potentially synchronous replicas;Promotable replicas are replicas that have these properties:
+If R + W > N, then we can be sure that among the promotable replicas there is
+at least one that has confirmed all the synchronous commits, and we can safely
+promote it to primary. If this is not the case, the controller will not promote
+any replica to primary, and will wait for the situation to change.
Users can force a promotion of a replica to primary through the
+kubectl cnpg promote command even if the quorum check is failing.
Warning
+Manual promotion should only be used as a last resort. Before proceeding, +make sure you fully understand the risk of data loss and carefully consider the +consequences of prioritizing the resumption of write workloads for your +applications.
+An additional CRD is used to track the quorum state of the cluster. A Cluster
+with the quorum failover enabled will have a FailoverQuorum resource with the same
+name as the Cluster resource. The FailoverQuorum CR is created by the
+controller when the quorum failover is enabled, and it is updated by the primary
+instance during its reconciliation loop, and read by the operator during quorum
+checks. It is used to track the latest known configuration of the synchronous
+replication.
Important
+Users should not modify the FailoverQuorum resource directly. During
+PostgreSQL configuration changes, when it is not possible to determine the
+configuration, the FailoverQuorum resource will be reset, preventing any
+failover until the new configuration is applied.
The FailoverQuorum resource works in conjunction with PostgreSQL synchronous
+replication.
Warning
+There is no guarantee that COMMIT operations returned to the
+client but that have not been performed synchronously, such as those made
+explicitly disabling synchronous replication with
+SET synchronous_commit TO local, will be present on a promoted replica.
In the following scenarios, R is the number of promotable replicas, W is
+the number of replicas that must acknowledge a write before commit, and N is
+the total number of potentially synchronous replicas. The "Failover" column
+indicates whether failover is allowed under quorum failover rules.
A cluster with instances: 3, synchronous.number=1, and
+dataDurability=required.
R + W > N (2 + 1 > 2), failover is allowed and safe.R + W = N (1 + 1 = 2), failover is not allowed to
+ prevent possible data loss.| R | +W | +N | +Failover | +
|---|---|---|---|
| 2 | +1 | +2 | +✅ | +
| 1 | +1 | +2 | +❌ | +
A cluster with instances: 3, synchronous.number: 1, and
+dataDurability: required experiences a network partition.
| R | +W | +N | +Failover | +
|---|---|---|---|
| 2 | +1 | +2 | +✅ | +
| 1 | +1 | +2 | +❌ | +
A cluster with instances: 5, synchronous.number=2, and
+dataDurability=required experiences a network partition.
| R | +W | +N | +Failover | +
|---|---|---|---|
| 3 | +2 | +4 | +✅ | +
| 2 | +2 | +4 | +❌ | +
A cluster with instances: 3 and remote synchronous replicas defined in
+standbyNamesPre or standbyNamesPost. We assume that the primary is failing.
This scenario requires an important consideration. Replicas listed in
+standbyNamesPre or standbyNamesPost are not counted in
+R (they cannot be promoted), but are included in N (they may have received
+synchronous writes). So, if
+synchronous.number <= len(standbyNamesPre) + len(standbyNamesPost), failover
+is not possible, as no local replica can be guaranteed to have the required
+data. The operator prevents such configurations during validation, but some
+invalid configurations are shown below for clarity.
Example configurations:
+Configuration #1 (valid):
+instances: 3
+postgresql:
+ synchronous:
+ method: any
+ number: 2
+ standbyNamesPre:
+ - angus
+
+In this configuration, when the primary fails, R = 2 (the local replicas),
+W = 2, and N = 3 (2 local replicas + 1 remote), allowing failover.
+In case of an additional replica failing (R = 1) failover is not allowed.
| R | +W | +N | +Failover | +
|---|---|---|---|
| 3 | +2 | +4 | +✅ | +
| 2 | +2 | +4 | +❌ | +
Configuration #2 (invalid):
+instances: 3
+postgresql:
+ synchronous:
+ method: any
+ number: 1
+ maxStandbyNamesFromCluster: 1
+ standbyNamesPre:
+ - angus
+
+In this configuration, R = 2 (the local replicas), W = 1, and N = 3
+(2 local replicas + 1 remote).
+Failover is not possible in this setup, so quorum failover can not be
+enabled with this configuration.
| R | +W | +N | +Failover | +
|---|---|---|---|
| 1 | +1 | +2 | +❌ | +
Configuration #3 (invalid):
+instances: 3
+postgresql:
+ synchronous:
+ method: any
+ number: 1
+ maxStandbyNamesFromCluster: 0
+ standbyNamesPre:
+ - angus
+ - malcolm
+
+In this configuration, R = 0 (the local replicas), W = 1, and N = 2
+(0 local replicas + 2 remote).
+Failover is not possible in this setup, so quorum failover can not be
+enabled with this configuration.
| R | +W | +N | +Failover | +
|---|---|---|---|
| 0 | +1 | +2 | +❌ | +
Consider a cluster with instances: 3, synchronous.number=1, and
+dataDurability=preferred that experiences a network partition.
synchronous_standby_names set.FailoverQuorum status cannot have changed, as the primary cannot
+ have received new configuration. If the operator can reach both replicas,
+ failover is allowed (R=2). If only one replica is reachable (R=1),
+ failover is not allowed.| R | +W | +N | +Failover | +
|---|---|---|---|
| 2 | +1 | +2 | +✅ | +
| 1 | +1 | +2 | +❌ | +
Note
+In previous versions of CloudNativePG, this page included specific failure +scenarios. Since these largely follow standard Kubernetes behavior, we have +streamlined the content to avoid duplication of information that belongs to the +underlying Kubernetes stack and is not specific to CloudNativePG.
+CloudNativePG adheres to standard Kubernetes principles for self-healing and +high availability. We assume familiarity with core Kubernetes concepts such as +storage classes, PVCs, nodes, and Pods. For CloudNativePG-specific details, +refer to the "Postgres Instance Manager" section, which +covers startup, liveness, and readiness probes, as well as the +self-healing section below.
+Important
+If you are running CloudNativePG in production, we strongly recommend +seeking professional support.
+If the primary Pod fails:
+-rw service is updated to point to the new primary.-r and -rw services.pg_rewind to re-synchronize if its PVC is available;
+ otherwise, a new standby is created from a backup of the new primary.If a standby Pod fails:
+-r and -ro services.-r and -ro services.For failure scenarios not covered by automated recovery, manual intervention +may be required.
+Important
+Do not perform manual operations without professional support.
+To temporarily disable the reconciliation loop for a PostgreSQL cluster, use
+the cnpg.io/reconciliationLoop annotation:
metadata:
+ name: cluster-example-no-reconcile
+ annotations:
+ cnpg.io/reconciliationLoop: "disabled"
+spec:
+ # ...
+
+Use this annotation with extreme caution and only during emergency +operations.
+Warning
+This annotation should be removed as soon as the issue is resolved. Leaving +it in place prevents the operator from executing self-healing actions, +including failover.
+Everyone knows that stateful workloads like PostgreSQL cannot run in +Kubernetes. Why do you say the contrary?
+An independent research survey commissioned by the Data on Kubernetes +Community in September 2021 +revealed that half of the respondents run most of their production +workloads on Kubernetes. 90% of them believe that Kubernetes is ready +for stateful workloads, and 70% of them run databases in production. +Databases like Postgres. However, according to them, significant +challenges remain, such as the knowledge gap (Kubernetes and Cloud +Native, in general, have a steep learning curve) and the quality of +Kubernetes operators. The latter is the reason why we believe that an +operator like CloudNativePG highly contributes to the success +of your project.
+For database fanatics like us, a real game-changer has been the +introduction of the support for local persistent volumes in +Kubernetes 1.14 in April 2019.
+CloudNativePG is built on immutable application containers. +What does it mean?
+According to the microservice architectural pattern, a container is +designed to run a single application or process. As a result, such +container images are built to run the main application as the +single entry point (the so-called PID 1 process).
+In Kubernetes terms, the application is referred to as workload. +Workloads can be stateless like a web application server or stateful like a +database. Mapping this concept to PostgreSQL, an immutable application +container is a single "postgres" process that is running and +tied to a single and specific version - the one in the immutable +container image.
+No other processes such as SSH or systemd, or syslog are allowed.
+Immutable Application Containers are in contrast with Mutable System +Containers, which are still a very common way to interpret and use +containers.
+Immutable means that a container won't be modified during its life: no +updates, no patches, no configuration changes. If you must update the +application code or apply a patch, you build a new image and redeploy +it. Immutability makes deployments safer and more repeatable.
+For more information, please refer to +"Why EDB chose immutable application containers".
+What does Cloud Native mean?
+The Cloud Native Computing Foundation defines the term +"Cloud Native". +However, since the start of the Cloud Native PostgreSQL/CloudNativePG operator +at 2ndQuadrant, the development team has been interpreting Cloud Native +as three main concepts:
+Currently, the standard de facto for container orchestration is +Kubernetes, which automates the deployment, administration and +scalability of Cloud Native Applications.
+Another definition of Cloud Native that resonates with us is the one +defined by Ibryam and Huß in +"Kubernetes Patterns", published by O'Reilly:
+++Principles, Patterns, Tools to automate containerized microservices at scale
+
Can I run CloudNativePG on bare metal Kubernetes?
+Yes, definitely. You can run Kubernetes on bare metal. And you can dedicate one +or more physical worker nodes with locally attached storage to PostgreSQL +workloads for maximum and predictable I/O performance.
+The actual Cloud Native PostgreSQL project, from which CloudNativePG +originated, was born after a pilot project in 2019 that benchmarked storage and +PostgreSQL on the same bare metal server, first directly in Linux, and then +inside Kubernetes. As expected, the experiment showed only negligible +performance impact introduced by the container running in Kubernetes through +local persistent volumes, allowing the Cloud Native initiative to continue.
+Why should I use PostgreSQL replication instead of file system +replication?
+Please read the "Architecture: Synchronizing the state" +section.
+Why should I use an operator instead of running PostgreSQL as a +container?
+The most basic approach to running PostgreSQL in Kubernetes is to have a +pod, which is the smallest unit of deployment in Kubernetes, running a +Postgres container with no replica. The volume hosting the Postgres data +directory is mounted on the pod, and it usually resides on network +storage. In this case, Kubernetes restarts the pod in case of a +problem or moves it to another Kubernetes node.
+The most sophisticated approach is to run PostgreSQL using an operator. +An operator is an extension of the Kubernetes controller and defines how +a complex application works in business continuity contexts. The +operator pattern is currently state of the art in Kubernetes for +this purpose. An operator simulates the work of a human operator in an +automated and programmatic way.
+Postgres is a complex application, and an operator not only needs to +deploy a cluster (the first step), but also properly react after +unexpected events. The typical example is that of a failover.
+An operator relies on Kubernetes for capabilities like self-healing, +scalability, replication, high availability, backup, recovery, updates, +access, resource control, storage management, and so on. It also +facilitates the integration of a PostgreSQL cluster in the log +management and monitoring infrastructure.
+CloudNativePG enables the definition of the desired state of a +PostgreSQL cluster via declarative configuration. Kubernetes +continuously makes sure that the current state of the infrastructure +matches the desired one through reconciliation loops initiated by the +Kubernetes controller. If the desired state and the actual state don't +match, reconciliation loops trigger self-healing procedures. That's +where an operator like CloudNativePG comes into play.
+Are there any other operators for Postgres out there?
+Yes, of course. And our advice is that you look at all of them and compare +them with CloudNativePG before making your decision. You will see that +most of these operators use an external failover management tool (Patroni +or similar) and rely on StatefulSets.
+Here is a non exhaustive list, in chronological order from their +publication on GitHub:
+Feel free to report any relevant missing entry as a PR.
+Info
+The Data on Kubernetes Community +(which includes some of our maintainers) is working on an independent and +vendor neutral project to list the operators called +Operator Feature Matrix.
+You say that CloudNativePG is a fully declarative operator. +What do you mean by that?
+The easiest way is to explain declarative configuration through an +example that highlights the differences with imperative configuration. +In an imperative context, the state is defined as a series of tasks to +be executed in sequence. So, we can get a three-node PostgreSQL cluster +by creating the first instance, configuring the replication, cloning a +second instance, and the third one.
+In a declarative approach, the state of a system is defined using +configuration, namely: there's a PostgreSQL 13 cluster with two replicas. +This approach highly simplifies change management operations, and when +these are stored in source control systems like Git, it enables the +Infrastructure as Code capability. And Kubernetes takes it farther than +deployment, as it makes sure that our request is fulfilled at any time.
+What are the required skills to run PostgreSQL on Kubernetes?
+Running PostgreSQL on Kubernetes requires both PostgreSQL and Kubernetes +skills in your DevOps team. The best experience is when database +administrators familiarize themselves with Kubernetes core concepts +and are able to interact with Kubernetes administrators.
+Our advice is for everyone that wants to fully exploit Cloud Native +PostgreSQL to acquire the "Certified Kubernetes Administrator (CKA)" +status from the CNCF certification program.
+Why isn't CloudNativePG using StatefulSets?
+CloudNativePG does not rely on StatefulSet resources, and
+instead manages the underlying PVCs directly by leveraging the selected
+storage class for dynamic provisioning. Please refer to the
+"Custom Pod Controller" section for details and reasons behind
+this decision.
What happens to the PostgreSQL clusters when the operator pod dies or it is +not available for a certain amount of time?
+The CloudNativePG operator, among other things, is responsible for self-healing +capabilities. As such, they might not be available during an outage of the +operator.
+However, assuming that the outage does not affect the nodes where PostgreSQL +clusters are running, the database will continue to serve normal operations, +through the relevant Kubernetes services. Moreover, the instance manager, +which runs inside each PostgreSQL pod will still work, making sure that the +database server is up, including accessory services like logging, export of +metrics, continuous archiving of WAL files, etc.
+To summarize:
+an outage of the operator does not necessarily imply a PostgreSQL +database outage; it's like running a database without a DBA or system +administrator.
+What are the reasons behind CloudNativePG not relying on a failover +management tool like Patroni, repmgr, or Stolon?
+Although part of the team that develops CloudNativePG has been heavily +involved in repmgr in the past, we decided to take a different approach +and directly extend the Kubernetes controller and rely on the Kubernetes API +server to hold the status of a Postgres cluster, and use it as the only source +of truth to:
+Should I manually resync a former primary with the new one following a +failover?
+No. The operator does that automatically for you, and relies on pg_rewind to
+synchronize the former primary with the new one.
Why should I use PostgreSQL?
+We believe that PostgreSQL is the equivalent in the database area of +what Linux represents in the operating system space. The current latest +major version of Postgres is version 16, which ships out of the box:
+And so on ...
+How many databases should be hosted in a single PostgreSQL instance?
+Our recommendation is to dedicate a single PostgreSQL cluster +(intended as primary and multiple standby servers) to a single database, +entirely managed by a single microservice application. However, by +leveraging the "postgres" superuser, it is possible to create as many +users and databases as desired (subject to the available resources).
+The reason for this recommendation lies in the Cloud Native concept, +based on microservices. In a pure microservice architecture, the +microservice itself should own the data it manages exclusively. +These could be flat files, queues, key-value stores, or, in our case, a +PostgreSQL relational database containing both structured and +unstructured data. The general idea is that only the microservice can +access the database, including schema management and migrations.
+CloudNativePG has been designed to work this way out of the +box, by default creating an application user and an application database +owned by the aforementioned application user.
+Reserving a PostgreSQL instance to a single microservice owned database, +enhances:
+Is there an upper limit in database size for not considering Kubernetes?
+No, as Kubernetes is no different from virtual machines and bare metal as far +as this is regarded. +Practically, however, it depends on the available resources of your Kubernetes +cluster. Our advice with very large databases (VLDB) is to consider a shared +nothing architecture, where a Kubernetes worker node is dedicated to a single +Postgres instance, with dedicated storage. +We proved that this extreme architectural pattern works when we benchmarked +running PostgreSQL on bare metal Kubernetes with local persistent +volumes. +Tablespaces and horizontal partitioning are data modeling techniques that you +can use to improve the vertical scalability of you databases.
+How can I specify a time zone in the PostgreSQL cluster?
+PostgreSQL has an extensive support for time zones, as explained in the official +documentation:
+ +Although time zones can even be used at session, transaction and even as part
+of a query in PostgreSQL, a very common way is to set them up globally. With
+CloudNativePG you can configure the cluster level time zone in the
+.spec.postgresql.parameters section as in the following example:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: pg-italy
+spec:
+ instances: 1
+
+ postgresql:
+ parameters:
+ timezone: "Europe/Rome"
+
+ storage:
+ size: 1Gi
+
+The time zone can be verified with:
+$ kubectl exec -ti pg-italy-1 -c postgres -- psql -x -c "SHOW timezone"
+-[ RECORD 1 ]---------
+TimeZone | Europe/Rome
+
+What is the recommended architecture for best business continuity +outcomes?
+As covered in the "Architecture" section, the main +recommendation is to adopt shared nothing architectures as much as possible, by +leveraging the native capabilities and resources that Kubernetes provides in a +single cluster, namely:
+Use at least one standby, preferably at least two, so that you can configure +synchronous replication in the cluster, introducing RPO=0 +for high availability.
+If you do not have availability zones - normally the case of on-premise +installations - separate on worker nodes and storage.
+Properly setup continuous backup on a local/regional object store.
+The same architecture that is in a single Kubernetes cluster can be replicated +in another Kubernetes cluster (normally in another geographical area or region) +through the replica cluster feature, providing disaster +recovery and high availability at global scale.
+You can use the WAL archive in the primary object store to feed the replica in +the other region, without having to provide a streaming connection, if the default +maximum RPO of 5 minutes is enough for you.
+How can instances be stopped or started?
+Please look at "Fencing" or "Hibernation".
+What are the global objects such as roles and databases that are +automatically created by CloudNativePG?
+The operator automatically creates a user for the application (by default
+called app) and a database for the application (by default called app)
+which is owned by the aforementioned user.
This way, the database is ready for a microservice adoption, as developers
+can control migrations using the app user, without requiring superuser
+access.
Teams can then create another user for read-write operations through the
+"Declarative role management" feature
+and assign the required GRANT to the tables.
Fencing in CloudNativePG is the ultimate process of protecting the
+data in one, more, or even all instances of a PostgreSQL cluster when they
+appear to be malfunctioning. When an instance is fenced, the PostgreSQL server
+process (postmaster) is guaranteed to be shut down, while the pod is kept running.
+This makes sure that, until the fence is lifted, data on the pod is not modified by
+PostgreSQL and that the file system can be investigated for debugging and
+troubleshooting purposes.
In CloudNativePG you can fence:
+ClusterFencing is controlled through the content of the cnpg.io/fencedInstances
+annotation, which expects a JSON formatted list of instance names.
+If the annotation is set to '["*"]', a singleton list with a wildcard, the
+whole cluster is fenced.
+If the annotation is set to an empty JSON list, the operator behaves as if the
+annotation was not set.
For example:
+cnpg.io/fencedInstances: '["cluster-example-1"]' will fence just
+ the cluster-example-1 instance
cnpg.io/fencedInstances: '["cluster-example-1","cluster-example-2"]'
+ will fence the cluster-example-1 and cluster-example-2 instances
cnpg.io/fencedInstances: '["*"]' will fence every instance in
+ the cluster.
The annotation can be manually set on the Kubernetes object, for example via
+the kubectl annotate command, or in a transparent way using the
+kubectl cnpg fencing on subcommand:
# to fence only one instance
+kubectl cnpg fencing on cluster-example 1
+
+# to fence all the instances in a Cluster
+kubectl cnpg fencing on cluster-example "*"
+
+Here is an example of a Cluster with an instance that was previously fenced:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ annotations:
+ cnpg.io/fencedInstances: '["cluster-example-1"]'
+[...]
+
+Fencing can be lifted by clearing the annotation, or set it to a different value.
+As for fencing, this can be done either manually with kubectl annotate, or
+using the kubectl cnpg fencing subcommand as follows:
# to lift the fencing only for one instance
+# N.B.: at the moment this won't work if the whole cluster was fenced previously,
+# in that case you will have to manually set the annotation as explained above
+kubectl cnpg fencing off cluster-example 1
+
+# to lift the fencing for all the instances in a Cluster
+kubectl cnpg fencing off cluster-example "*"
+
+Once an instance is set for fencing, the procedure to shut down the
+postmaster process is initiated, identical to the one of the switchover.
+This consists of an initial fast shutdown with a timeout set to
+.spec.switchoverDelay, followed by an immediate shutdown. Then:
the Pod will be kept alive
+the Pod won't be marked as Ready
+all the changes that don't require the Postgres instance to be up will be + reconciled, including:
+metrics will not be collected, except cnpg_collector_fencing_on which will be
+ set to 1
Warning
+If a primary instance is fenced, its postmaster process +is shut down but no failover is performed, interrupting the operativity of +the applications. When the fence will be lifted, the primary instance will be +started up again without performing a failover.
+Given that, we advise users to fence primary instances only if strictly required.
+If a fenced instance is deleted, the pod will be recreated normally, but the
+postmaster won't be started. This can be extremely helpful when instances
+are Crashlooping.
ImageCatalog and ClusterImageCatalog are essential resources that empower
+you to define images for creating a Cluster.
The key distinction lies in their scope: an ImageCatalog is namespaced, while
+a ClusterImageCatalog is cluster-scoped.
Both share a common structure, comprising a list of images, each equipped with
+a major field indicating the major version of the image.
Warning
+The operator places trust in the user-defined major version and refrains +from conducting any PostgreSQL version detection. It is the user's +responsibility to ensure alignment between the declared major version in +the catalog and the PostgreSQL image.
+The major field's value must remain unique within a catalog, preventing
+duplication across images. Distinct catalogs, however, may
+expose different images under the same major value.
Example of a Namespaced ImageCatalog:
apiVersion: postgresql.cnpg.io/v1
+kind: ImageCatalog
+metadata:
+ name: postgresql
+ namespace: default
+spec:
+ images:
+ - major: 15
+ image: ghcr.io/cloudnative-pg/postgresql:15.14-system-trixie
+ - major: 16
+ image: ghcr.io/cloudnative-pg/postgresql:16.10-system-trixie
+ - major: 17
+ image: ghcr.io/cloudnative-pg/postgresql:17.6-system-trixie
+ - major: 18
+ image: ghcr.io/cloudnative-pg/postgresql:18.0-system-trixie
+
+Example of a Cluster-Wide Catalog using ClusterImageCatalog Resource:
apiVersion: postgresql.cnpg.io/v1
+kind: ClusterImageCatalog
+metadata:
+ name: postgresql
+spec:
+ images:
+ - major: 15
+ image: ghcr.io/cloudnative-pg/postgresql:15.14-system-trixie
+ - major: 16
+ image: ghcr.io/cloudnative-pg/postgresql:16.10-system-trixie
+ - major: 17
+ image: ghcr.io/cloudnative-pg/postgresql:17.6-system-trixie
+ - major: 18
+ image: ghcr.io/cloudnative-pg/postgresql:18.0-system-trixie
+
+A Cluster resource has the flexibility to reference either an ImageCatalog
+(like in the following example) or a ClusterImageCatalog to precisely specify
+the desired image.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+ imageCatalogRef:
+ apiGroup: postgresql.cnpg.io
+ # Change the following to `ClusterImageCatalog` if needed
+ kind: ImageCatalog
+ name: postgresql
+ major: 16
+ storage:
+ size: 1Gi
+
+Clusters utilizing these catalogs maintain continuous monitoring. +Any alterations to the images within a catalog trigger automatic updates for +all associated clusters referencing that specific entry.
+The CloudNativePG project maintains ClusterImageCatalog manifests for all
+supported images.
These catalogs are regularly updated and published in the +artifacts repository.
+Each catalog corresponds to a specific combination of image type (e.g.
+minimal) and Debian release (e.g. trixie). It lists the most up-to-date
+container images for every supported PostgreSQL major version.
By installing these catalogs, cluster administrators can ensure that their +PostgreSQL clusters are automatically updated to the latest patch release +within a given PostgreSQL major version, for the selected Debian distribution +and image type.
+For example, to install the latest catalog for the minimal PostgreSQL
+container images on Debian trixie, run:
kubectl apply -f \
+ https://raw.githubusercontent.com/cloudnative-pg/artifacts/refs/heads/main/image-catalogs/catalog-minimal-trixie.yaml
+
+You can install all the available catalogs by using the kustomization file
+present in the image-catalogs directory:
kubectl apply -k https://github.com/cloudnative-pg/artifacts//image-catalogs?ref=main
+
+You can then view all the catalogs deployed with:
+kubectl get clusterimagecatalogs.postgresql.cnpg.io
+
+For example, you can create a cluster with the latest minimal image for PostgreSQL 18 on trixie with:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: angus
+spec:
+ instances: 3
+ imageCatalogRef:
+ apiGroup: postgresql.cnpg.io
+ kind: ClusterImageCatalog
+ name: postgresql-minimal-trixie
+ major: 18
+ storage:
+ size: 1Gi
+
+
+ CloudNativePG supports the dynamic loading of PostgreSQL extensions into a
+Cluster at Pod startup using the Kubernetes ImageVolume feature
+and the extension_control_path GUC introduced in PostgreSQL 18, to which this
+project contributed.
This feature allows you to mount a PostgreSQL extension, +packaged as an OCI-compliant container image, as a read-only and immutable +volume inside a running pod at a known filesystem path.
+You can make the extension available either globally, using the
+shared_preload_libraries option,
+or at the database level through the CREATE EXTENSION command. For the
+latter, you can use the Database resource’s declarative extension management
+to ensure consistent, automated extension setup within your PostgreSQL
+databases.
Image volume extensions decouple the distribution of PostgreSQL operand +container images from the distribution of extensions. This eliminates the +need to define and embed extensions at build time within your PostgreSQL +images—a major adoption blocker for PostgreSQL as a containerized workload, +including from a security and supply chain perspective.
+As a result, you can:
+minimal operand images
+ provided by CloudNativePG.Cluster definitions,
+ without rebuilding or maintaining custom PostgreSQL images.Extension images must be built according to the +documented specifications.
+To use image volume extensions with CloudNativePG, you need:
+extension_control_path.ImageVolume feature gate enabled.Cluster resource.Cluster resource.Cluster resource.Extension images are defined in the .spec.postgresql.extensions stanza of a
+Cluster resource, which accepts an ordered list of extensions to be added to
+the PostgreSQL cluster.
Info
+For field-level details, see the
+API reference for ExtensionConfiguration.
Each image volume is mounted at /extensions/<EXTENSION_NAME>.
By default, CloudNativePG automatically manages the relevant GUCs, setting:
+extension_control_path to /extensions/<EXTENSION_NAME>/share, allowing
+ PostgreSQL to locate any extension control file within /extensions/<EXTENSION_NAME>/share/extensiondynamic_library_path to /extensions/<EXTENSION_NAME>/libThese values are appended in the order in which the extensions are defined in
+the extensions list, ensuring deterministic path resolution within
+PostgreSQL. This allows PostgreSQL to discover and load the extension without
+requiring manual configuration inside the pod.
Info
+Depending on how your extension container images are built and their layout,
+you may need to adjust the default extension_control_path and
+dynamic_library_path values to match the image structure.
Important
+If the extension image includes shared libraries, they must be compiled +with the same PostgreSQL major version, operating system distribution, and CPU +architecture as the PostgreSQL container image used by your cluster, to ensure +compatibility and prevent runtime issues.
+Adding an extension to a database in CloudNativePG involves a few steps:
+Cluster resource so that PostgreSQL can
+ discover and load it.shared_preload_libraries
+ if the extension requires it.Database resource where you want it
+ installed, if the extension supports CREATE EXTENSION.Warning
+Avoid making changes to extension images and PostgreSQL configuration
+settings (such as shared_preload_libraries) simultaneously.
+First, allow the pod to roll out with the new extension image, then update
+the PostgreSQL configuration.
+This limitation will be addressed in a future release of CloudNativePG.
For illustration purposes, this guide uses a simple, fictitious extension named
+foo that supports CREATE EXTENSION.
Cluster resourceYou can add an ImageVolume-based extension to a Cluster using the
+.spec.postgresql.extensions stanza. For example:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: foo-18
+spec:
+ # ...
+ postgresql:
+ extensions:
+ - name: foo
+ image:
+ reference: # registry path for your extension image
+ # ...
+
+The name field is mandatory and must be unique within the cluster, as
+it determines the mount path (/extensions/foo in this example). It must
+consist of lowercase alphanumeric characters or hyphens (-) and must start
+and end with an alphanumeric character.
The image stanza follows the Kubernetes ImageVolume API.
+The reference must point to a valid container registry path for the extension
+image.
Important
+When a new extension is added to a running Cluster, CloudNativePG will
+automatically trigger a rolling update to attach the new
+image volume to each pod. Before adding a new extension in production,
+ensure you have thoroughly tested it in a staging environment to prevent
+configuration issues that could leave your PostgreSQL cluster in an unhealthy
+state.
Once mounted, CloudNativePG will automatically configure PostgreSQL by appending:
+/extensions/foo/share to extension_control_path/extensions/foo/lib to dynamic_library_pathThis ensures that the PostgreSQL container is ready to serve the foo
+extension when requested by a database, as described in the next section. The
+CREATE EXTENSION foo command, triggered automatically during the
+reconciliation of the Database resource,
+will work without additional configuration, as PostgreSQL will locate:
/extensions/foo/share/extension/foo.control/extensions/foo/lib/foo.soDatabase resourceOnce the extension is available in the PostgreSQL instance, you can leverage +declarative databases to manage the lifecycle of your extensions +within the target database.
+Continuing with the foo example, you can request the installation of the
+foo extension in the app database of the foo-18 cluster using the
+following resource definition:
apiVersion: postgresql.cnpg.io/v1
+kind: Database
+metadata:
+ name: foo-app
+spec:
+ name: app
+ owner: app
+ cluster:
+ name: foo-18
+ extensions:
+ - name: foo
+ version: 1.0
+
+CloudNativePG will automatically reconcile this resource, executing the
+CREATE EXTENSION foo command inside the app database if it is not
+already installed, ensuring your desired state is maintained without manual
+intervention.
In some cases, the default expected structure may be insufficient for your +extension image, particularly when:
+Following the "convention over configuration" paradigm, CloudNativePG allows +you to finely control the configuration of each extension image through the +following fields:
+extension_control_path: A list of relative paths within the container image
+ to be appended to PostgreSQL’s extension_control_path, allowing it to
+ locate extension control files.dynamic_library_path: A list of relative paths within the container image
+ to be appended to PostgreSQL’s dynamic_library_path, enabling it to locate
+ shared library files for extensions.ld_library_path: A list of relative paths within the container image to be
+ appended to the LD_LIBRARY_PATH environment variable of the instance
+ manager process, allowing PostgreSQL to locate required system libraries at
+ runtime.This flexibility enables you to support complex or non-standard extension +images while maintaining clarity and predictability.
+If your extension image does not use the default lib and share directories
+for its libraries and control files, you can override the defaults by
+explicitly setting extension_control_path and dynamic_library_path.
For example:
+spec:
+ postgresql:
+ extensions:
+ - name: my-extension
+ extension_control_path:
+ - my/share/path
+ dynamic_library_path:
+ - my/lib/path
+ image:
+ reference: # registry path for your extension image
+
+CloudNativePG will configure PostgreSQL with:
+/extensions/my-extension/my/share/path appended to extension_control_path/extensions/my-extension/my/lib/path appended to dynamic_library_pathThis allows PostgreSQL to discover your extension’s control files and shared +libraries correctly, even with a non-standard layout.
+You may need to include multiple extensions within the same container image, +adopting a structure where each extension’s files reside in their own +subdirectory.
+For example, to package PostGIS and pgRouting together in a single image, each +in its own subdirectory:
+# ...
+spec:
+ # ...
+ postgresql:
+ extensions:
+ - name: geospatial
+ extension_control_path:
+ - postgis/share
+ - pgrouting/share
+ dynamic_library_path:
+ - postgis/lib
+ - pgrouting/lib
+ # ...
+ image:
+ reference: # registry path for your geospatial image
+ # ...
+ # ...
+ # ...
+
+Some extensions, such as PostGIS, require system libraries that may not be
+present in the base PostgreSQL image. To support these requirements, you can
+package the necessary libraries within your extension container image and make
+them available to PostgreSQL using the ld_library_path field.
For example, if your extension image includes a system directory with the
+required libraries:
# ...
+spec:
+ # ...
+ postgresql:
+ extensions:
+ - name: postgis
+ # ...
+ ld_library_path:
+ - syslib
+ image:
+ reference: # registry path for your PostGIS image
+ # ...
+ # ...
+ # ...
+
+CloudNativePG will set the LD_LIBRARY_PATH environment variable to include
+/extensions/postgis/syslib, allowing PostgreSQL to locate and load these
+system libraries at runtime.
Important
+Since ld_library_path must be set when the PostgreSQL process starts,
+changing this value requires a cluster restart for the new value to take effect.
+CloudNativePG does not currently trigger this restart automatically; you will need to
+manually restart the cluster (e.g., using cnpg restart) after modifying ld_library_path.
A standard extension container image for CloudNativePG includes two +required directories at its root:
+/share/: contains an extension subdirectory with the extension control
+ file (e.g. <EXTENSION>.control) and the corresponding SQL files./lib/: contains the extension’s shared library (e.g. <EXTENSION>.so) as
+ well as any other required libraries.Following this structure ensures that the extension will be automatically +discoverable and usable by PostgreSQL within CloudNativePG without requiring +manual configuration.
+Important
+We encourage PostgreSQL extension developers to publish OCI-compliant extension +images following this layout as part of their artifact distribution, making +their extensions easily consumable within Kubernetes environments. +Ideally, extension images should target a specific operating system +distribution and architecture, be tied to a particular PostgreSQL version, and +be built using the distribution’s native packaging system (for example, using +Debian or RPM packages). This approach ensures consistency, security, and +compatibility with the PostgreSQL images used in your clusters.
+Currently, adding, removing, or updating an extension image triggers a +restart of the PostgreSQL pods. This behavior is inherited from how +image volumes +work in Kubernetes.
+Before performing an extension update, ensure you have:
+version field for the extension in the relevant Database
+ resource definition to align with the new version in the image.These steps help prevent downtime or data inconsistencies in your PostgreSQL +clusters during extension updates.
+ +CloudNativePG (CNPG) is an open-source +operator +designed to manage PostgreSQL workloads on any +supported Kubernetes cluster. +It fosters cloud-neutrality through seamless deployment in private, public, +hybrid, and multi-cloud environments via its +distributed topology feature.
+Built around DevOps principles, CloudNativePG embraces declarative +configuration and immutable infrastructure, ensuring reliability and automation +in database management.
+At its core, CloudNativePG introduces a custom Kubernetes resource called
+Cluster, representing a PostgreSQL cluster with:
These instances reside within a Kubernetes namespace, allowing applications to +connect seamlessly using operator-managed services. Failovers and switchovers +occur transparently, eliminating the need for manual intervention.
+For applications inside the Kubernetes cluster, CNPG provides a microservice
+database approach, enabling co-location of PostgreSQL clusters and applications
+in the same namespace for optimized access.
+For applications outside the cluster, CNPG offers flexible connectivity through
+service templates and LoadBalancer services for direct TCP exposure.
+Additionally, web applications can take advantage of the native connection
+pooler based on PgBouncer.
CloudNativePG was originally built by EDB, then +released open source under Apache License 2.0. +The source code repository is in GitHub.
+Note
+Based on the Operator Capability Levels model, +users can expect a "Level V - Auto Pilot" subset of capabilities from the +CloudNativePG Operator.
+Each minor release of CloudNativePG is designed to work with a range of +Kubernetes versions, usually the ones supported by the CNCF at the time the +minor version was first released.
+Please refer to the "Supported releases" page for details.
+The CloudNativePG community maintains +container images for both the operator and PostgreSQL (the operand).
+The CloudNativePG operator container images are available on the
+cloudnative-pg project's GitHub Container Registry
+in three different flavors:
-ubi9)Red Hat UBI images are primarily intended for OLM consumption.
+All container images are signed and include SBOM and provenance attestations, +provided separately for each architecture.
+The CloudNativePG project provides and maintains PostgreSQL operand container
+images, built on top of the official Debian slim base image,
+for both linux/amd64 and linux/arm64 architectures.
Images are published for all Debian supported releases
+(stable,
+oldstable) and for
+PostgreSQL versions supported by PGDG.
+They are distributed via the postgres-containers GitHub Container Registry.
Three image flavors are available, each extending the previous one:
+ +Important
+The system images are deprecated and will be removed once in-core
+Barman Cloud support is phased out. They remain usable for now, but you may
+want to plan a future migration to minimal or standard images with the
+Barman Cloud plugin, or another supported backup solution.
By default, this version of CloudNativePG deploys ghcr.io/cloudnative-pg/postgresql:18.0-system-trixie.
All images are signed and shipped with SBOM and provenance attestations. +Weekly automated builds ensure that critical vulnerabilities (CVEs) are promptly fixed.
+For details and support, see the postgres-containers project.
.spec.backup.barmanObjectStore (deprecated since v1.26).Subscription resource.cert-manager).metrics port 9187) for custom
+ monitoring.cnpg plugin for kubectl to simplify cluster operations.Info
+CloudNativePG does not use StatefulSets for managing data persistence.
+Instead, it directly manages Persistent Volume Claims (PVCs).
+See "Custom Pod Controller" for more details.
Follow the instructions in the "Quickstart" to test +CloudNativePG on a local Kubernetes cluster using Kind, or Minikube.
+In case you are not familiar with some basic terminology on Kubernetes and PostgreSQL, +please consult the "Before you start" section.
+The CloudNativePG documentation is licensed under a Creative Commons +Attribution 4.0 International License.
+Postgres, PostgreSQL, and the Slonik Logo +are trademarks or registered trademarks of the PostgreSQL Community Association +of Canada, and used with their permission.
+CloudNativePG is a +Cloud Native Computing Foundation Sandbox project.
+
The operator can be installed like any other resource in Kubernetes,
+through a YAML manifest applied via kubectl.
You can install the latest operator manifest +for this minor release as follows:
+kubectl apply --server-side -f \
+ https://raw.githubusercontent.com/cloudnative-pg/cloudnative-pg/main/releases/cnpg-1.28.0-rc1.yaml
+
+You can verify that with:
+kubectl rollout status deployment \
+ -n cnpg-system cnpg-controller-manager
+
+cnpg plugin for kubectlYou can use the cnpg plugin to override the default configuration options
+that are in the static manifests.
For example, to generate the default latest manifest but change the watch +namespaces to only be a specific namespace, you could run:
+kubectl cnpg install generate \
+ --watch-namespace "specific-namespace" \
+ > cnpg_for_specific_namespace.yaml
+
+Please refer to "cnpg plugin" documentation
+for a more comprehensive example.
Warning
+If you are deploying CloudNativePG on GKE and get an error (... failed to
+call webhook...), be aware that by default traffic between worker nodes
+and control plane is blocked by the firewall except for a few specific
+ports, as explained in the official
+docs
+and by this
+issue.
+You'll need to either change the targetPort in the webhook service, to be
+one of the allowed ones, or open the webhooks' port (9443) on the
+firewall.
If you want to test or evaluate the latest development snapshot of
+CloudNativePG before the next official patch release, you can download the
+manifests from the
+cloudnative-pg/artifacts
+which provides easy access to the current trunk (main) as well as to each
+supported release.
For example, you can install the latest snapshot of the operator with:
+curl -sSfL \
+ https://raw.githubusercontent.com/cloudnative-pg/artifacts/main/manifests/operator-manifest.yaml | \
+ kubectl apply --server-side -f -
+
+If you are instead looking for the latest snapshot of the operator for this +specific minor release, you can just run:
+curl -sSfL \
+ https://raw.githubusercontent.com/cloudnative-pg/artifacts/release-1.28/manifests/operator-manifest.yaml | \
+ kubectl apply --server-side -f -
+
+Important
+Snapshots are not supported by the CloudNativePG Community, and are not +intended for use in production.
+The operator can be installed using the provided Helm chart.
+CloudNativePG can also be installed via the Operator Lifecycle Manager (OLM) +directly from OperatorHub.io.
+For deployments on Red Hat OpenShift, EDB offers and fully supports a certified +version of CloudNativePG, available through the +Red Hat OpenShift Container Platform.
+In Kubernetes, the operator is by default installed in the cnpg-system
+namespace as a Kubernetes Deployment. The name of this deployment
+depends on the installation method.
+When installed through the manifest or the cnpg plugin, it is called
+cnpg-controller-manager by default. When installed via Helm, the default name
+is cnpg-cloudnative-pg.
Note
+With Helm you can customize the name of the deployment via the
+fullnameOverride field in the "values.yaml" file.
You can get more information using the describe command in kubectl:
$ kubectl get deployments -n cnpg-system
+NAME READY UP-TO-DATE AVAILABLE AGE
+<deployment-name> 1/1 1 1 18m
+
+kubectl describe deploy \
+ -n cnpg-system \
+ <deployment-name>
+
+As with any Deployment, it sits on top of a ReplicaSet and supports rolling +upgrades. The default configuration of the CloudNativePG operator +comes with a Deployment of a single replica, which is suitable for most +installations. In case the node where the pod is running is not reachable +anymore, the pod will be rescheduled on another node.
+If you require high availability at the operator level, it is possible to +specify multiple replicas in the Deployment configuration - given that the +operator supports leader election. Also, you can take advantage of taints and +tolerations to make sure that the operator does not run on the same nodes where +the actual PostgreSQL clusters are running (this might even include the control +plane for self-managed Kubernetes installations).
+Operator configuration
+You can change the default behavior of the operator by overriding +some default options. For more information, please refer to the +"Operator configuration" section.
+Important
+Please carefully read the release notes +before performing an upgrade as some versions might require +extra steps.
+Upgrading CloudNativePG operator is a two-step process:
+Unless differently stated in the release notes, the first step is normally done +by applying the manifest of the newer version for plain Kubernetes +installations, or using the native package manager of the used distribution +(please follow the instructions in the above sections).
+The second step is automatically triggered after updating the controller. By
+default, this initiates a rolling update of every deployed PostgreSQL cluster,
+upgrading one instance at a time to use the new instance manager. The rolling
+update concludes with a switchover, which is governed by the
+primaryUpdateStrategy option. The default value, unsupervised, completes
+the switchover automatically. If set to supervised, the user must manually
+promote the new primary instance using the cnpg plugin for kubectl.
Rolling updates
+This process is discussed in-depth on the Rolling Updates page.
+Important
+In case primaryUpdateStrategy is set to the default value of unsupervised,
+an upgrade of the operator will trigger a switchover on your PostgreSQL cluster,
+causing a (normally negligible) downtime. If your PostgreSQL Cluster has only one
+instance, the instance will be automatically restarted as supervised value is
+not supported for primaryUpdateStrategy. In either case, your applications will
+have to reconnect to PostgreSQL.
The default rolling update behavior can be replaced with in-place updates of +the instance manager. This approach does not require a restart of the +PostgreSQL instance, thereby avoiding a switchover within the cluster. This +feature, which is disabled by default, is described in detail below.
+By default, all PostgreSQL clusters are rolled out simultaneously, which may +lead to a spike in resource usage, especially when managing multiple clusters. +CloudNativePG provides two configuration options at the operator level +that allow you to introduce delays between cluster roll-outs or even between +instances within the same cluster, helping to distribute resource usage over +time:
+CLUSTERS_ROLLOUT_DELAY: Defines the number of seconds to wait between
+ roll-outs of different PostgreSQL clusters (default: 0).INSTANCES_ROLLOUT_DELAY: Defines the number of seconds to wait between
+ roll-outs of individual instances within the same PostgreSQL cluster (default:
+ 0).By default, CloudNativePG issues a rolling update of the cluster +every time the operator is updated. The new instance manager shipped with the +operator is added to each PostgreSQL pod via an init container.
+However, this behavior can be changed via configuration to enable in-place +updates of the instance manager, which is the PID 1 process that keeps the +container alive.
+Internally, each instance manager in CloudNativePG supports the injection of a +new executable that replaces the existing one after successfully completing an +integrity verification phase and gracefully terminating all internal processes. +Upon restarting with the new binary, the instance manager seamlessly adopts the +already running postmaster.
+As a result, the PostgreSQL process is unaffected by the update, refraining +from the need to perform a switchover. The other side of the coin, is that +the Pod is changed after the start, breaking the pure concept of immutability.
+You can enable this feature by setting the ENABLE_INSTANCE_MANAGER_INPLACE_UPDATES
+environment variable to 'true' in the
+operator configuration.
The in-place upgrade process will not change the init container image inside the +Pods. Therefore, the Pod definition will not reflect the current version of the +operator.
+CloudNativePG follows semantic versioning. Every release of the +operator within the same API version is compatible with the previous one. +The current API version is v1, corresponding to versions 1.x.y of the operator.
+In addition to new features, new versions of the operator contain bug fixes and +stability enhancements. Because of this, we strongly encourage users to upgrade +to the latest version of the operator, as each version is released in order to +maintain the most secure and stable Postgres environment.
+CloudNativePG currently releases new versions of the operator at +least monthly. If you are unable to apply updates as each version becomes +available, we recommend upgrading through each version in sequential order to +come current periodically and not skipping versions.
+The release notes page contains a detailed list of the +changes introduced in every released version of CloudNativePG, +and it must be read before upgrading to a newer version of the software.
+Most versions are directly upgradable and in that case, applying the newer +manifest for plain Kubernetes installations or using the native package +manager of the chosen distribution is enough.
+When versions are not directly upgradable, the old version needs to be +removed before installing the new one. This won't affect user data but +only the operator itself.
+ + +Important
+We strongly recommend that all CloudNativePG users upgrade to version +1.27.0, or at least to the latest stable version of your current minor release +(e.g., 1.26.1).
+Version 1.27 introduces a change in the default behavior of the
+liveness probe: it now enforces the
+shutdown of an isolated primary
+within the livenessProbeTimeout (30 seconds).
If this behavior is not suitable for your environment, you can disable the +isolation check in the liveness probe with the following configuration:
+spec:
+ probes:
+ liveness:
+ isolationCheck:
+ enabled: false
+
+Warning
+Due to changes in the startup probe for the manager component
+(#6623),
+upgrading the operator will trigger a restart of your PostgreSQL clusters,
+even if in-place updates are enabled (ENABLE_INSTANCE_MANAGER_INPLACE_UPDATES=true).
+Your applications will need to reconnect to PostgreSQL after the upgrade.
Cluster .statusWith the transition to a backup and recovery agnostic approach based on CNPG-I
+plugins in CloudNativePG, which began with version 1.26.0 for Barman Cloud, we
+are starting the deprecation period for the following fields in the .status
+section of the Cluster resource:
firstRecoverabilityPointfirstRecoverabilityPointByMethodlastSuccessfulBackuplastSuccessfulBackupByMethodlastFailedBackupThe following Prometheus metrics are also deprecated:
+cnpg_collector_first_recoverability_pointcnpg_collector_last_failed_backup_timestampcnpg_collector_last_available_backup_timestampWarning
+If you have migrated to a plugin-based backup and recovery solution such as +Barman Cloud, these fields and metrics are no longer synchronized and will +not be updated. Users still relying on the in-core support for Barman Cloud +and volume snapshots can continue to use these fields for the time being.
+Under the new plugin-based approach, multiple backup methods can operate +simultaneously, each with its own timeline for backup and recovery. For +example, some plugins may provide snapshots without WAL archiving, while others +support continuous archiving.
+Because of this flexibility, maintaining centralized status fields in the
+Cluster resource could be misleading or confusing, as they would not
+accurately represent the state across all configured backup methods.
+For this reason, these fields are being deprecated.
Instead, each plugin is responsible for exposing its own backup status +information and providing metrics back to the instance manager for monitoring +and operational awareness.
+cnpg pluginIn this release, the cnpg plugin for kubectl transitions from an imperative
+to a declarative approach for cluster hibernation.
+The hibernate on and hibernate off commands are now convenient shortcuts
+that apply declarative changes to enable or disable hibernation.
+The hibernate status command has been removed, as its purpose is now
+fulfilled by the standard status command.
CloudNativePG does not rely on an external tool for failover management. +It simply relies on the Kubernetes API server and a native key component called: +the Postgres instance manager.
+The instance manager takes care of the entire lifecycle of the PostgreSQL
+server process (also known as postmaster).
When you create a new cluster, the operator makes a Pod per instance.
+The field .spec.instances specifies how many instances to create.
Each Pod will start the instance manager as the parent process (PID 1) for the +main container, which in turn runs the PostgreSQL instance. During the lifetime +of the Pod, the instance manager acts as a backend to handle the +startup, liveness and readiness probes.
+The startup probe ensures that a PostgreSQL instance, whether a primary or +standby, has fully started.
+Info
+By default, the startup probe uses
+pg_isready.
+However, the behavior can be customized by specifying a different startup
+strategy.
While the startup probe is running, the liveness and readiness probes remain +disabled. Following Kubernetes standards, if the startup probe fails, the +kubelet will terminate the container, which will then be restarted.
+The .spec.startDelay parameter specifies the maximum time, in seconds,
+allowed for the startup probe to succeed.
By default, the startDelay is set to 3600 seconds. It is recommended to
+adjust this setting based on the time PostgreSQL needs to fully initialize in
+your specific environment.
Warning
+Setting .spec.startDelay too low can cause the liveness probe to activate
+prematurely, potentially resulting in unnecessary Pod restarts if PostgreSQL
+hasn’t fully initialized.
CloudNativePG configures the startup probe with the following default parameters:
+failureThreshold: FAILURE_THRESHOLD
+periodSeconds: 10
+successThreshold: 1
+timeoutSeconds: 5
+
+The failureThreshold value is automatically calculated by dividing
+startDelay by periodSeconds.
You can customize any of the probe settings in the .spec.probes.startup
+section of your configuration.
Warning
+Be sure that any custom probe settings are tailored to your cluster's +operational requirements to avoid unintended disruptions.
+Info
+For more details on probe configuration, refer to the +probe API documentation.
+If you manually specify .spec.probes.startup.failureThreshold, it will
+override the default behavior and disable the automatic use of startDelay.
For example, the following configuration explicitly sets custom probe
+parameters, bypassing startDelay:
# ... snip
+spec:
+ probes:
+ startup:
+ periodSeconds: 3
+ timeoutSeconds: 3
+ failureThreshold: 10
+
+In certain scenarios, you may need to customize the startup strategy for your +PostgreSQL cluster. For example, you might delay marking a replica as started +until it begins streaming from the primary or define a replication lag +threshold that must be met before considering the replica ready.
+To accommodate these requirements, CloudNativePG extends the
+.spec.probes.startup stanza with two optional parameters:
type: specifies the criteria for considering the probe successful. Accepted
+ values, in increasing order of complexity/depth, include:
pg_isready: marks the probe as successful when the pg_isready command
+ exits with 0. This is the default for primary instances and replicas.query: marks the probe as successful when a basic query is executed on
+ the postgres database locally.streaming: marks the probe as successful when the replica begins
+ streaming from its source and meets the specified lag requirements (details
+ below).maximumLag: defines the maximum acceptable replication lag, measured in bytes
+ (expressed as Kubernetes quantities). This parameter is only applicable when
+ type is set to streaming. If maximumLag is not specified, the replica is
+ considered successfully started as soon as it begins streaming.
Important
+The .spec.probes.startup.maximumLag option is validated and enforced only
+during the startup phase of the pod, meaning it applies exclusively when the
+replica is starting.
Warning
+Incorrect configuration of the maximumLag option can cause continuous
+failures of the startup probe, leading to repeated replica restarts. Ensure
+you understand how this option works and configure appropriate values for
+failureThreshold and periodSeconds to give the replica enough time to
+catch up with its source.
The following example requires a replica to have a maximum lag of 16Mi from the +source to be considered started:
+# <snip>
+probes:
+ startup:
+ type: streaming
+ maximumLag: 16Mi
+
+The liveness probe begins after the startup probe successfully completes. Its +primary role is to ensure the PostgreSQL instance manager is operating +correctly.
+Following Kubernetes standards, if the liveness probe fails, the kubelet will +terminate the container, which will then be restarted.
+The amount of time before a Pod is classified as not alive is configurable via
+the .spec.livenessProbeTimeout parameter.
CloudNativePG configures the liveness probe with the following default +parameters:
+failureThreshold: FAILURE_THRESHOLD
+periodSeconds: 10
+successThreshold: 1
+timeoutSeconds: 5
+
+The failureThreshold value is automatically calculated by dividing
+livenessProbeTimeout by periodSeconds.
By default, .spec.livenessProbeTimeout is set to 30 seconds. This means the
+liveness probe will report a failure if it detects three consecutive probe
+failures, with a 10-second interval between each check.
You can customize any of the probe settings in the .spec.probes.liveness
+section of your configuration.
Warning
+Be sure that any custom probe settings are tailored to your cluster's +operational requirements to avoid unintended disruptions.
+Info
+For more details on probe configuration, refer to the +probe API documentation.
+If you manually specify .spec.probes.liveness.failureThreshold, it will
+override the default behavior and disable the automatic use of
+livenessProbeTimeout.
For example, the following configuration explicitly sets custom probe
+parameters, bypassing livenessProbeTimeout:
# ... snip
+spec:
+ probes:
+ liveness:
+ periodSeconds: 3
+ timeoutSeconds: 3
+ failureThreshold: 10
+
+CloudNativePG 1.27 introduces an additional behavior for the liveness +probe of a PostgreSQL primary, which will report a failure if both of the +following conditions are met:
+The effect of this behavior is to consider an isolated primary to be not alive and subsequently shut it down when the liveness probe fails.
+It is enabled by default and can be disabled by adding the following:
+spec:
+ probes:
+ liveness:
+ isolationCheck:
+ enabled: false
+
+Important
+Be aware that the default liveness probe settings—automatically derived from livenessProbeTimeout—might
+be aggressive (30 seconds). As such, we recommend explicitly setting the
+liveness probe configuration to suit your environment.
The spec also accepts two optional network settings: requestTimeout
+and connectionTimeout, both defaulting to 1000 (in milliseconds).
+In cloud environments, you may need to increase these values.
+For example:
spec:
+ probes:
+ liveness:
+ isolationCheck:
+ enabled: true
+ requestTimeout: "2000"
+ connectionTimeout: "2000"
+
+The readiness probe starts once the startup probe has successfully completed. +Its primary purpose is to check whether the PostgreSQL instance is ready to +accept traffic and serve requests at any point during the pod's lifecycle.
+Info
+By default, the readiness probe uses
+pg_isready.
+However, the behavior can be customized by specifying a different readiness
+strategy.
Following Kubernetes standards, if the readiness probe fails, the pod will be +marked unready and will not receive traffic from any services. An unready pod +is also ineligible for promotion during automated failover scenarios.
+CloudNativePG uses the following default configuration for the readiness probe:
+failureThreshold: 3
+periodSeconds: 10
+successThreshold: 1
+timeoutSeconds: 5
+
+If the default settings do not suit your requirements, you can fully customize
+the readiness probe by specifying parameters in the .spec.probes.readiness
+stanza. For example:
# ... snip
+spec:
+ probes:
+ readiness:
+ periodSeconds: 3
+ timeoutSeconds: 3
+ failureThreshold: 10
+
+Warning
+Ensure that any custom probe settings are aligned with your cluster’s +operational requirements to prevent unintended disruptions.
+Info
+For more information on configuring probes, see the +probe API.
+In certain scenarios, you may need to customize the readiness strategy for your +cluster. For example, you might delay marking a replica as ready until it +begins streaming from the primary or define a maximum replication lag threshold +before considering the replica ready.
+To accommodate these requirements, CloudNativePG extends the
+.spec.probes.readiness stanza with two optional parameters: type and
+maximumLag. Please refer to the Startup Probe Strategy
+section for detailed information on these options.
Important
+Unlike the startup probe, the .spec.probes.readiness.maximumLag option is
+continuously monitored. A lagging replica may become unready if this setting is
+not appropriately tuned.
Warning
+Incorrect configuration of the maximumLag option can lead to repeated
+readiness probe failures, causing serious consequences, such as:
Recommendation
+Use the streaming and maximumLag options with extreme caution. If
+you're unfamiliar with PostgreSQL replication, rely on the default
+strategy. Seek professional advice if unsure.
The following example requires a replica to have a maximum lag of 64Mi from the +source to be considered ready. It also provides approximately 300 seconds (30 +failures × 10 seconds) for the startup probe to succeed:
+# <snip>
+probes:
+ readiness:
+ type: streaming
+ maximumLag: 64Mi
+ failureThreshold: 30
+ periodSeconds: 10
+
+When a Pod running Postgres is deleted, either manually or by Kubernetes
+following a node drain operation, the kubelet will send a termination signal to the
+instance manager, and the instance manager will take care of shutting down
+PostgreSQL in an appropriate way.
+The .spec.smartShutdownTimeout and .spec.stopDelay options, expressed in seconds,
+control the amount of time given to PostgreSQL to shut down. The values default
+to 180 and 1800 seconds, respectively.
The shutdown procedure is composed of two steps:
+The instance manager first issues a CHECKPOINT, then initiates a smart
+shut down, disallowing any new connection to PostgreSQL. This step will last
+for up to .spec.smartShutdownTimeout seconds.
If PostgreSQL is still up, the instance manager requests a fast
+shut down, terminating any existing connection and exiting promptly.
+If the instance is archiving and/or streaming WAL files, the process
+will wait for up to the remaining time set in .spec.stopDelay to complete the
+operation and then forcibly shut down. Such a timeout needs to be at least 15
+seconds.
Important
+In order to avoid any data loss in the Postgres cluster, which impacts +the database RPO, don't delete the Pod where +the primary instance is running. In this case, perform a switchover to +another instance first.
+During a switchover, the shutdown procedure slightly differs from the general
+case. The instance manager of the former primary first issues a CHECKPOINT,
+then initiates a fast shutdown of PostgreSQL before the designated new
+primary is promoted, ensuring that all data are safely available on the new
+primary.
For this reason, the .spec.switchoverDelay, expressed in seconds, controls
+the time given to the former primary to shut down gracefully and archive all
+the WAL files. By default it is set to 3600 (1 hour).
Warning
+The .spec.switchoverDelay option affects the RPO
+and RTO of your PostgreSQL database. Setting it to
+a low value, might favor RTO over RPO but lead to data loss at cluster level
+and/or backup level. On the contrary, setting it to a high value, might remove
+the risk of data loss while leaving the cluster without an active primary for a
+longer time during the switchover.
In case of primary pod failure, the cluster will go into failover mode. +Please refer to the "Failover" section for details.
+Storage exhaustion is a well known issue for PostgreSQL clusters. +The PostgreSQL documentation +highlights the possible failure scenarios and the importance of monitoring disk +usage to prevent it from becoming full.
+The same applies to CloudNativePG and Kubernetes as well: the +"Monitoring" section +provides details on checking the disk space used by WAL segments and standard +metrics on disk usage exported to Prometheus.
+Important
+In a production system, it is critical to monitor the database +continuously. Exhausted disk storage can lead to a database server shutdown.
+Note
+The detection of exhausted storage relies on a storage class that
+accurately reports disk size and usage. This may not be the case in simulated
+Kubernetes environments like Kind or with test storage class implementations
+such as csi-driver-host-path.
If the disk containing the WALs becomes full and no more WAL segments can be +stored, PostgreSQL will stop working. CloudNativePG correctly detects this issue +by verifying that there is enough space to store the next WAL segment, +and avoids triggering a failover, which could complicate recovery.
+That allows a human administrator to address the root cause.
+In such a case, if supported by the storage class, the quickest course of action +is currently to:
+Cluster resource to the same valueOnce the issue is resolved and there is sufficient free space for WAL segments, +the Pod will restart and the cluster will become healthy.
+See also the "Volume expansion" section of the +documentation.
+ +CloudNativePG provides a plugin for kubectl to manage a cluster in Kubernetes.
You can install the cnpg plugin using a variety of methods.
Note
+For air-gapped systems, installation via package managers, using previously +downloaded files, may be a good option.
+curl -sSfL \
+ https://github.com/cloudnative-pg/cloudnative-pg/raw/main/hack/install-cnpg-plugin.sh | \
+ sudo sh -s -- -b /usr/local/bin
+
+In the +releases section of the GitHub repository, +you can navigate to any release of interest (pick the same or newer release +than your CloudNativePG operator), and in it you will find an Assets +section. In that section are pre-built packages for a variety of systems. +As a result, you can follow standard practices and instructions to install +them in your systems.
+For example, let's install the 1.28.0-rc1 release of the plugin, for an Intel based
+64 bit server. First, we download the right .deb file.
wget https://github.com/cloudnative-pg/cloudnative-pg/releases/download/v1.28.0-rc1/kubectl-cnpg_1.28.0-rc1_linux_x86_64.deb \
+ --output-document kube-plugin.deb
+
+Then, with superuser privileges, install from the local file using dpkg:
$ sudo dpkg -i kube-plugin.deb
+Selecting previously unselected package cnpg.
+(Reading database ... 6688 files and directories currently installed.)
+Preparing to unpack kube-plugin.deb ...
+Unpacking cnpg (1.28.0-rc1) ...
+Setting up cnpg (1.28.0-rc1) ...
+
+As in the example for .rpm packages, let's install the 1.28.0-rc1 release for an
+Intel 64 bit machine. Note the --output flag to provide a file name.
curl -L https://github.com/cloudnative-pg/cloudnative-pg/releases/download/v1.28.0-rc1/kubectl-cnpg_1.28.0-rc1_linux_x86_64.rpm \
+ --output kube-plugin.rpm
+
+Then, with superuser privileges, install with yum, and you're ready to use:
$ sudo yum --disablerepo=* localinstall kube-plugin.rpm
+Failed to set locale, defaulting to C.UTF-8
+Dependencies resolved.
+====================================================================================================
+ Package Architecture Version Repository Size
+====================================================================================================
+Installing:
+ cnpg x86_64 1.28.0-rc1 @commandline 20 M
+
+Transaction Summary
+====================================================================================================
+Install 1 Package
+
+Total size: 20 M
+Installed size: 78 M
+Is this ok [y/N]: y
+
+To install the plugin from the AUR, follow these steps:
+git clone https://aur.archlinux.org/kubectl-cnpg.git
+cd kubectl-cnpg
+makepkg -si
+
+Or use your favorite AUR-helper, for example paru:
+paru -S kubectl-cnpg
+
+If you already have Krew installed, you can simply +run:
+kubectl krew install cnpg
+
+When a new version of the plugin is released, you can update the existing +installation with:
+kubectl krew update
+kubectl krew upgrade cnpg
+
+Note
+Please note that the Homebrew community manages the availability of the kubectl-cnpg plugin on Homebrew.
+If you already have Homebrew installed, you can simply +run:
+brew install kubectl-cnpg
+
+When a new version of the plugin is released, you can update the existing +installation with:
+brew update
+brew upgrade kubectl-cnpg
+
+Note
+Auto-completion for the kubectl plugin is already managed by Homebrew.
+There's no need to create the kubectl_complete-cnpg script mentioned below.
CloudNativePG Plugin is currently built for the following +operating system and architectures:
+To configure auto-completion for the plugin, a helper shell script needs to be
+installed into your current PATH. Assuming the latter contains /usr/local/bin,
+this can be done with the following commands:
cat > kubectl_complete-cnpg <<EOF
+#!/usr/bin/env sh
+
+# Call the __complete command passing it all arguments
+kubectl cnpg __complete "\$@"
+EOF
+
+chmod +x kubectl_complete-cnpg
+
+# Important: the following command may require superuser permission
+sudo mv kubectl_complete-cnpg /usr/local/bin
+
+Important
+The name of the script needs to be exactly the one provided since it's used by the kubectl auto-complete process
+Once the plugin is installed and deployed, you can start using it like this:
+kubectl cnpg COMMAND [ARGS...]
+
+Note
+The plugin automatically detects if the standard output channel is connected to a terminal.
+In such cases, it may add ANSI colors to the command output. To disable colors, use the
+--color=never option with the command.
The cnpg plugin can be used to generate the YAML manifest for the
+installation of the operator. This option would typically be used if you want
+to override some default configurations such as number of replicas,
+installation namespace, namespaces to watch, and so on.
For details and available options, run:
+kubectl cnpg install generate --help
+
+The main options are:
+-n: specifies the namespace in which to install the operator (default:
+ cnpg-system).--control-plane: if set to true, the operator deployment will include a
+ toleration and affinity for node-role.kubernetes.io/control-plane.--replicas: sets the number of replicas in the deployment.--watch-namespace: specifies a comma-separated list of namespaces to watch
+ (default: all namespaces).--version: defines the minor version of the operator to be installed, such
+ as 1.23. If a minor version is specified, the plugin installs the latest
+ patch version of that minor version. If no version is supplied, the plugin
+ installs the latest MAJOR.MINOR.PATCH version of the operator.An example of the generate command, which will generate a YAML manifest that
+will install the operator, is as follows:
kubectl cnpg install generate \
+ -n king \
+ --version 1.23 \
+ --replicas 3 \
+ --watch-namespace "albert, bb, freddie" \
+ > operator.yaml
+
+The flags in the above command have the following meaning:
+- -n king install the CNPG operator into the king namespace
+- --version 1.23 install the latest patch version for minor version 1.23
+- --replicas 3 install the operator with 3 replicas
+- --watch-namespace "albert, bb, freddie" have the operator watch for
+ changes in the albert, bb and freddie namespaces only
The status command provides an overview of the current status of your
+cluster, including:
pg_stat_archiver view from the primary - or designated primary in the
+ case of a replica clusterpg_stat_replication
+ view on the primary instanceCurrent LSN field corresponds
+ to the latest write-ahead log location that has been replayed during recovery
+ (replay LSN).Important
+The status information above is taken at different times and at different
+locations, resulting in slightly inconsistent returned values. For example,
+the Current Write LSN location in the main header, might be different
+from the Current LSN field in the instances status as it is taken at
+two different time intervals.
kubectl cnpg status sandbox
+
+Cluster Summary
+Name: default/sandbox
+System ID: 7423474350493388827
+PostgreSQL Image: ghcr.io/cloudnative-pg/postgresql:16.4
+Primary instance: sandbox-1
+Primary start time: 2024-10-08 18:31:57 +0000 UTC (uptime 1m14s)
+Status: Cluster in healthy state
+Instances: 3
+Ready instances: 3
+Size: 126M
+Current Write LSN: 0/604DE38 (Timeline: 1 - WAL File: 000000010000000000000006)
+
+Continuous Backup status
+Not configured
+
+Streaming Replication status
+Replication Slots Enabled
+Name Sent LSN Write LSN Flush LSN Replay LSN Write Lag Flush Lag Replay Lag State Sync State Sync Priority Replication Slot
+---- -------- --------- --------- ---------- --------- --------- ---------- ----- ---------- ------------- ----------------
+sandbox-2 0/604DE38 0/604DE38 0/604DE38 0/604DE38 00:00:00 00:00:00 00:00:00 streaming async 0 active
+sandbox-3 0/604DE38 0/604DE38 0/604DE38 0/604DE38 00:00:00 00:00:00 00:00:00 streaming async 0 active
+
+Instances status
+Name Current LSN Replication role Status QoS Manager Version Node
+---- ----------- ---------------- ------ --- --------------- ----
+sandbox-1 0/604DE38 Primary OK BestEffort 1.28.0-rc1 k8s-eu-worker
+sandbox-2 0/604DE38 Standby (async) OK BestEffort 1.28.0-rc1 k8s-eu-worker2
+sandbox-3 0/604DE38 Standby (async) OK BestEffort 1.28.0-rc1 k8s-eu-worker
+
+If you require more detailed status information, use the --verbose option (or
+-v for short). The level of detail increases each time the flag is repeated:
kubectl cnpg status sandbox --verbose
+
+Cluster Summary
+Name: default/sandbox
+System ID: 7423474350493388827
+PostgreSQL Image: ghcr.io/cloudnative-pg/postgresql:16.4
+Primary instance: sandbox-1
+Primary start time: 2024-10-08 18:31:57 +0000 UTC (uptime 2m4s)
+Status: Cluster in healthy state
+Instances: 3
+Ready instances: 3
+Size: 126M
+Current Write LSN: 0/6053720 (Timeline: 1 - WAL File: 000000010000000000000006)
+
+Continuous Backup status
+Not configured
+
+Physical backups
+No running physical backups found
+
+Streaming Replication status
+Replication Slots Enabled
+Name Sent LSN Write LSN Flush LSN Replay LSN Write Lag Flush Lag Replay Lag State Sync State Sync Priority Replication Slot Slot Restart LSN Slot WAL Status Slot Safe WAL Size
+---- -------- --------- --------- ---------- --------- --------- ---------- ----- ---------- ------------- ---------------- ---------------- --------------- ------------------
+sandbox-2 0/6053720 0/6053720 0/6053720 0/6053720 00:00:00 00:00:00 00:00:00 streaming async 0 active 0/6053720 reserved NULL
+sandbox-3 0/6053720 0/6053720 0/6053720 0/6053720 00:00:00 00:00:00 00:00:00 streaming async 0 active 0/6053720 reserved NULL
+
+Unmanaged Replication Slot Status
+No unmanaged replication slots found
+
+Managed roles status
+No roles managed
+
+Tablespaces status
+No managed tablespaces
+
+Pod Disruption Budgets status
+Name Role Expected Pods Current Healthy Minimum Desired Healthy Disruptions Allowed
+---- ---- ------------- --------------- ----------------------- -------------------
+sandbox replica 2 2 1 1
+sandbox-primary primary 1 1 1 0
+
+Instances status
+Name Current LSN Replication role Status QoS Manager Version Node
+---- ----------- ---------------- ------ --- --------------- ----
+sandbox-1 0/6053720 Primary OK BestEffort 1.28.0-rc1 k8s-eu-worker
+sandbox-2 0/6053720 Standby (async) OK BestEffort 1.28.0-rc1 k8s-eu-worker2
+sandbox-3 0/6053720 Standby (async) OK BestEffort 1.28.0-rc1 k8s-eu-worker
+
+With an additional -v (e.g. kubectl cnpg status sandbox -v -v), you can
+also view PostgreSQL configuration, HBA settings, and certificates.
The command also supports output in yaml and json format.
The meaning of this command is to promote a pod in the cluster to primary, so you
+can start with maintenance work or test a switch-over situation in your cluster:
kubectl cnpg promote CLUSTER CLUSTER-INSTANCE
+
+Or you can use the instance node number to promote:
+kubectl cnpg promote CLUSTER INSTANCE
+
+Clusters created using the CloudNativePG operator work with a CA to sign +a TLS authentication certificate.
+To get a certificate, you need to provide a name for the secret to store +the credentials, the cluster name, and a user for this certificate:
+kubectl cnpg certificate cluster-cert --cnpg-cluster CLUSTER --cnpg-user USER
+
+After the secret it's created, you can get it using kubectl:
kubectl get secret cluster-cert
+
+And the content of the same in plain text using the following commands:
+kubectl get secret cluster-cert -o json | jq -r '.data | map(@base64d) | .[]'
+
+The kubectl cnpg restart command can be used in two cases:
requesting the operator to orchestrate a rollout restart
+ for a certain cluster. This is useful to apply
+ configuration changes to cluster dependent objects, such as ConfigMaps
+ containing custom monitoring queries.
request a single instance restart, either in-place if the instance is + the cluster's primary or deleting and recreating the pod if + it is a replica.
+# this command will restart a whole cluster in a rollout fashion
+kubectl cnpg restart CLUSTER
+
+# this command will restart a single instance, according to the policy above
+kubectl cnpg restart CLUSTER INSTANCE
+
+If the in-place restart is requested but the change cannot be applied without +a switchover, the switchover will take precedence over the in-place restart. A +common case for this will be a minor upgrade of PostgreSQL image.
+Note
+If you want ConfigMaps and Secrets to be automatically reloaded
+by instances, you can add a label with key cnpg.io/reload to it.
The kubectl cnpg reload command requests the operator to trigger a reconciliation
+loop for a certain cluster. This is useful to apply configuration changes
+to cluster dependent objects, such as ConfigMaps containing custom monitoring queries.
The following command will reload all configurations for a given cluster:
+kubectl cnpg reload CLUSTER
+
+The kubectl cnpg maintenance command helps to modify one or more clusters
+across namespaces and set the maintenance window values, it will change
+the following fields:
.spec.nodeMaintenanceWindow.inProgress.spec.nodeMaintenanceWindow.reusePVCAccepts as argument set and unset using this to set the
+inProgress to true in case setand to false in case of unset.
By default, reusePVC is always set to false unless the --reusePVC flag is passed.
The plugin will ask for a confirmation with a list of the cluster to modify +and their new values, if this is accepted this action will be applied to +all the cluster in the list.
+If you want to set in maintenance all the PostgreSQL in your Kubernetes cluster, +just need to write the following command:
+kubectl cnpg maintenance set --all-namespaces
+
+And you'll have the list of all the cluster to update
+The following are the new values for the clusters
+Namespace Cluster Name Maintenance reusePVC
+--------- ------------ ----------- --------
+default cluster-example true false
+default pg-backup true false
+test cluster-example true false
+Do you want to proceed? [y/n]: y
+
+The kubectl cnpg report command bundles various pieces
+of information into a ZIP file.
+It aims to provide the needed context to debug problems
+with clusters in production.
It has two sub-commands: operator and cluster.
The operator sub-command requests the operator to provide information
+regarding the operator deployment, configuration and events.
Important
+All confidential information in Secrets and ConfigMaps is REDACTED.
+The Data map will show the keys but the values will be empty.
+The flag -S / --stopRedaction will defeat the redaction and show the
+values. Use only at your own risk, this will share private data.
Note
+By default, operator logs are not collected, but you can enable operator
+log collection with the --logs flag
Least-Privilege Support
+The report operator command works with minimal permissions. Only the operator
+deployment is required - all other resources are optional and collected on a
+best-effort basis. If you lack permissions for certain resources (e.g., webhooks,
+OLM resources), warnings are logged and report generation continues with available
+data.
The report includes:
+DeploymentPod informationSecrets and ConfigMaps in the operator
+ namespaceEvents in the operator namespacePod logs in JSON-lines format (requires
+ --logs flag)Minimal permissions
+Read access (get) to the operator deployment in the operator namespace.
+This allows namespace-scoped users to generate basic troubleshooting reports.
Recommended permissions for full report
+Add list on pods, events; get on secrets, configmaps, services
+(namespace-scoped); and list on webhook configurations (cluster-scoped).
A user with only namespace-scoped permissions can still generate useful +reports:
+# With only deployment read access
+kubectl cnpg report operator -n cnpg-system -f report.zip
+
+Info
+The command will log warnings for inaccessible resources but successfully +generate a report with the deployment manifest, which is often sufficient for +basic troubleshooting.
+The command will generate a ZIP file containing various manifest in YAML format
+(by default, but settable to JSON with the -o flag). Use the -f flag to
+name a result file explicitly. If the -f flag is not used, a default
+time-stamped filename is created for the zip file.
Note
+The report plugin obeys kubectl conventions, and will look for objects
+constrained by namespace. The CNPG Operator will generally not be installed in
+the same namespace as the clusters. E.g. the default installation namespace is
+cnpg-system.
kubectl cnpg report operator -n cnpg-system
+
+results in
+Successfully written report to "report_operator_<TIMESTAMP>.zip" (format: "yaml")
+
+With the -f flag set:
kubectl cnpg report operator -n cnpg-system -f reportRedacted.zip
+
+Unzipping the file will produce a time-stamped top-level folder to keep the +directory tidy:
+unzip reportRedacted.zip
+
+will result in:
+Archive: reportRedacted.zip
+ creating: report_operator_<TIMESTAMP>/
+ creating: report_operator_<TIMESTAMP>/manifests/
+ inflating: report_operator_<TIMESTAMP>/manifests/deployment.yaml
+ inflating: report_operator_<TIMESTAMP>/manifests/operator-pod.yaml
+ inflating: report_operator_<TIMESTAMP>/manifests/events.yaml
+ inflating: report_operator_<TIMESTAMP>/manifests/validating-webhook-configuration.yaml
+ inflating: report_operator_<TIMESTAMP>/manifests/mutating-webhook-configuration.yaml
+ inflating: report_operator_<TIMESTAMP>/manifests/webhook-service.yaml
+ inflating: report_operator_<TIMESTAMP>/manifests/cnpg-ca-secret(secret).yaml
+ inflating: report_operator_<TIMESTAMP>/manifests/cnpg-webhook-cert(secret).yaml
+
+If you activated the --logs option, you'd see an extra subdirectory:
Archive: report_operator_<TIMESTAMP>.zip
+ <snipped …>
+ creating: report_operator_<TIMESTAMP>/operator-logs/
+ inflating: report_operator_<TIMESTAMP>/operator-logs/cnpg-controller-manager-66fb98dbc5-pxkmh-logs.jsonl
+
+Note
+The plugin will try to get the PREVIOUS operator's logs, which is helpful +when investigating restarted operators. +In all cases, it will also try to get the CURRENT operator logs. If current +and previous logs are available, it will show them both.
+====== Beginning of Previous Log =====
+2023-03-28T12:56:41.251711811Z {"level":"info","ts":"2023-03-28T12:56:41Z","logger":"setup","msg":"Starting CloudNativePG Operator","version":"1.28.0-rc1","build":{"Version":"1.28.0-rc1+dev107","Commit":"cc9bab17","Date":"2023-03-28"}}
+2023-03-28T12:56:41.251851909Z {"level":"info","ts":"2023-03-28T12:56:41Z","logger":"setup","msg":"Starting pprof HTTP server","addr":"0.0.0.0:6060"}
+ <snipped …>
+
+====== End of Previous Log =====
+2023-03-28T12:57:09.854306024Z {"level":"info","ts":"2023-03-28T12:57:09Z","logger":"setup","msg":"Starting CloudNativePG Operator","version":"1.28.0-rc1","build":{"Version":"1.28.0-rc1+dev107","Commit":"cc9bab17","Date":"2023-03-28"}}
+2023-03-28T12:57:09.854363943Z {"level":"info","ts":"2023-03-28T12:57:09Z","logger":"setup","msg":"Starting pprof HTTP server","addr":"0.0.0.0:6060"}
+
+If the operator hasn't been restarted, you'll still see the ====== Begin …
+and ====== End … guards, with no content inside.
You can verify that the confidential information is REDACTED by default:
+cd report_operator_<TIMESTAMP>/manifests/
+head cnpg-ca-secret\(secret\).yaml
+
+data:
+ ca.crt: ""
+ ca.key: ""
+metadata:
+ creationTimestamp: "2022-03-22T10:42:28Z"
+ managedFields:
+ - apiVersion: v1
+ fieldsType: FieldsV1
+ fieldsV1:
+
+With the -S (--stopRedaction) option activated, secrets are shown:
kubectl cnpg report operator -n cnpg-system -f reportNonRedacted.zip -S
+
+You'll get a reminder that you're about to view confidential information:
+WARNING: secret Redaction is OFF. Use it with caution
+Successfully written report to "reportNonRedacted.zip" (format: "yaml")
+
+unzip reportNonRedacted.zip
+head cnpg-ca-secret\(secret\).yaml
+
+data:
+ ca.crt: LS0tLS1CRUdJTiBD…
+ ca.key: LS0tLS1CRUdJTiBF…
+metadata:
+ creationTimestamp: "2022-03-22T10:42:28Z"
+ managedFields:
+ - apiVersion: v1
+ fieldsType: FieldsV1
+
+The cluster sub-command gathers the following:
kubectl get cluster -o yamlThe cluster sub-command accepts the -f and -o flags, as the operator does.
+If the -f flag is not used, a default timestamped report name will be used.
+Note that the cluster information does not contain configuration Secrets / ConfigMaps,
+so the -S is disabled.
Note
+By default, cluster logs are not collected, but you can enable cluster
+log collection with the --logs flag
Usage:
+kubectl cnpg report cluster CLUSTER [flags]
+
+Note that, unlike the operator sub-command, for the cluster sub-command you
+need to provide the cluster name, and very likely the namespace, unless the cluster
+is in the default one.
kubectl cnpg report cluster CLUSTER -f report.zip [-n NAMESPACE]
+
+and then:
+unzip report.zip
+
+Archive: report.zip
+ creating: report_cluster_example_<TIMESTAMP>/
+ creating: report_cluster_example_<TIMESTAMP>/manifests/
+ inflating: report_cluster_example_<TIMESTAMP>/manifests/cluster.yaml
+ inflating: report_cluster_example_<TIMESTAMP>/manifests/cluster-pods.yaml
+ inflating: report_cluster_example_<TIMESTAMP>/manifests/cluster-jobs.yaml
+ inflating: report_cluster_example_<TIMESTAMP>/manifests/events.yaml
+
+Remember that you can use the --logs flag to add the pod and job logs to the ZIP.
kubectl cnpg report cluster CLUSTER [-n NAMESPACE] --logs
+
+will result in:
+Successfully written report to "report_cluster_example_<TIMESTAMP>.zip" (format: "yaml")
+
+unzip report_cluster_<TIMESTAMP>.zip
+
+Archive: report_cluster_example_<TIMESTAMP>.zip
+ creating: report_cluster_example_<TIMESTAMP>/
+ creating: report_cluster_example_<TIMESTAMP>/manifests/
+ inflating: report_cluster_example_<TIMESTAMP>/manifests/cluster.yaml
+ inflating: report_cluster_example_<TIMESTAMP>/manifests/cluster-pods.yaml
+ inflating: report_cluster_example_<TIMESTAMP>/manifests/cluster-jobs.yaml
+ inflating: report_cluster_example_<TIMESTAMP>/manifests/events.yaml
+ creating: report_cluster_example_<TIMESTAMP>/logs/
+ inflating: report_cluster_example_<TIMESTAMP>/logs/cluster-example-full-1.jsonl
+ creating: report_cluster_example_<TIMESTAMP>/job-logs/
+ inflating: report_cluster_example_<TIMESTAMP>/job-logs/cluster-example-full-1-initdb-qnnvw.jsonl
+ inflating: report_cluster_example_<TIMESTAMP>/job-logs/cluster-example-full-2-join-tvj8r.jsonl
+
+The kubectl cnpg logs command allows to follow the logs of a collection
+of pods related to CloudNativePG in a single go.
It has at the moment one available sub-command: cluster.
The cluster sub-command gathers all the pod logs for a cluster in a single
+stream or file.
+This means that you can get all the pod logs in a single terminal window, with a
+single invocation of the command.
As in all the cnpg plugin sub-commands, you can get instructions and help with
+the -h flag:
kubectl cnpg logs cluster -h
The logs command will display logs in JSON-lines format, unless the
+--timestamps flag is used, in which case, a human-readable timestamp will be
+prepended to each line. In this case, lines will no longer be valid JSON,
+and tools such as jq may not work as desired.
If the logs cluster sub-command is given the -f flag (aka --follow), it
+will follow the cluster pod logs, and will also watch for any new pods created
+in the cluster after the command has been invoked.
+Any new pods found, including pods that have been restarted or re-created,
+will also have their pods followed.
+The logs will be displayed in the terminal's standard-out.
+This command will only exit when the cluster has no more pods left, or when it
+is interrupted by the user.
If logs is called without the -f option, it will read the logs from all
+cluster pods until the time of invocation and display them in the terminal's
+standard-out, then exit.
+The -o or --output flag can be provided, to specify the name
+of the file where the logs should be saved, instead of displaying over
+standard-out.
+The --tail flag can be used to specify how many log lines will be retrieved
+from each pod in the cluster. By default, the logs cluster sub-command will
+display all the logs from each pod in the cluster. If combined with the "follow"
+flag -f, the number of logs specified by --tail will be retrieved until the
+current time, and from then the new logs will be followed.
NOTE: unlike other cnpg plugin commands, the -f is used to denote "follow"
+rather than specify a file. This keeps with the convention of kubectl logs,
+which takes -f to mean the logs should be followed.
Usage:
+kubectl cnpg logs cluster CLUSTER [flags]
+
+Using the -f option to follow:
kubectl cnpg report cluster CLUSTER -f
+
+Using --tail option to display 3 lines from each pod and the -f option
+to follow:
kubectl cnpg report cluster CLUSTER -f --tail 3
+
+{"level":"info","ts":"2023-06-30T13:37:33Z","logger":"postgres","msg":"2023-06-30 13:37:33.142 UTC [26] LOG: ending log output to stderr","source":"/controller/log/postgres","logging_pod":"cluster-example-3"}
+{"level":"info","ts":"2023-06-30T13:37:33Z","logger":"postgres","msg":"2023-06-30 13:37:33.142 UTC [26] HINT: Future log output will go to log destination \"csvlog\".","source":"/controller/log/postgres","logging_pod":"cluster-example-3"}
+…
+…
+
+With the -o option omitted, and with --output specified:
$ kubectl cnpg logs cluster CLUSTER --output my-cluster.log
+
+Successfully written logs to "my-cluster.log"
+
+The pretty sub-command reads a log stream from standard input, formats it
+into a human-readable output, and attempts to sort the entries by timestamp.
It can be used in combination with kubectl cnpg logs cluster, as
+shown in the following example:
$ kubectl cnpg logs cluster cluster-example | kubectl cnpg logs pretty
+2024-10-15T17:35:00.336 INFO cluster-example-1 instance-manager Starting CloudNativePG Instance Manager
+2024-10-15T17:35:00.336 INFO cluster-example-1 instance-manager Checking for free disk space for WALs before starting PostgreSQL
+2024-10-15T17:35:00.347 INFO cluster-example-1 instance-manager starting tablespace manager
+2024-10-15T17:35:00.347 INFO cluster-example-1 instance-manager starting external server manager
+[...]
+
+Alternatively, it can be used in combination with other commands that produce
+CNPG logs in JSON format, such as stern, or kubectl logs, as in the
+following example:
$ kubectl logs cluster-example-1 | kubectl cnpg logs pretty
+2024-10-15T17:35:00.336 INFO cluster-example-1 instance-manager Starting CloudNativePG Instance Manager
+2024-10-15T17:35:00.336 INFO cluster-example-1 instance-manager Checking for free disk space for WALs before starting PostgreSQL
+2024-10-15T17:35:00.347 INFO cluster-example-1 instance-manager starting tablespace manager
+2024-10-15T17:35:00.347 INFO cluster-example-1 instance-manager starting external server manager
+[...]
+
+The pretty sub-command also supports advanced log filtering, allowing users
+to display logs for specific pods or loggers, or to filter logs by severity
+level.
+Here's an example:
$ kubectl cnpg logs cluster cluster-example | kubectl cnpg logs pretty --pods cluster-example-1 --loggers postgres --log-level info
+2024-10-15T17:35:00.509 INFO cluster-example-1 postgres 2024-10-15 17:35:00.509 UTC [29] LOG: redirecting log output to logging collector process
+2024-10-15T17:35:00.509 INFO cluster-example-1 postgres 2024-10-15 17:35:00.509 UTC [29] HINT: Future log output will appear in directory "/controller/log"...
+2024-10-15T17:35:00.510 INFO cluster-example-1 postgres 2024-10-15 17:35:00.509 UTC [29] LOG: ending log output to stderr
+2024-10-15T17:35:00.510 INFO cluster-example-1 postgres ending log output to stderr
+[...]
+
+The pretty sub-command will try to sort the log stream,
+to make logs easier to reason about. In order to achieve this, it gathers the
+logs into groups, and within groups it sorts by timestamp. This is the only
+way to sort interactively, as pretty may be piped from a command in "follow"
+mode. The sub-command will add a group separator line, ---, at the end of
+each sorted group. The size of the grouping can be configured via the
+--sorting-group-size flag (default: 1000), as illustrated in the following example:
$ kubectl cnpg logs cluster cluster-example | kubectl cnpg logs pretty --sorting-group-size=3
+2024-10-15T17:35:20.426 INFO cluster-example-2 instance-manager Starting CloudNativePG Instance Manager
+2024-10-15T17:35:20.426 INFO cluster-example-2 instance-manager Checking for free disk space for WALs before starting PostgreSQL
+2024-10-15T17:35:20.438 INFO cluster-example-2 instance-manager starting tablespace manager
+---
+2024-10-15T17:35:20.438 INFO cluster-example-2 instance-manager starting external server manager
+2024-10-15T17:35:20.438 INFO cluster-example-2 instance-manager starting controller-runtime manager
+2024-10-15T17:35:20.439 INFO cluster-example-2 instance-manager Starting EventSource
+---
+[...]
+
+To explore all available options, use the -h flag for detailed explanations
+of the supported flags and their usage.
Info
+You can also increase the verbosity of the log by adding more -v options.
The kubectl cnpg destroy command helps remove an instance and all the
+associated PVCs from a Kubernetes cluster.
The optional --keep-pvc flag, if specified, allows you to keep the PVCs,
+while removing all metadata.ownerReferences that were set by the instance.
+Additionally, the cnpg.io/pvcStatus label on the PVCs will change from
+ready to detached to signify that they are no longer in use.
Running again the command without the --keep-pvc flag will remove the
+detached PVCs.
Usage:
+kubectl cnpg destroy CLUSTER INSTANCE
+
+The following example removes the cluster-example-2 pod and the associated
+PVCs:
kubectl cnpg destroy cluster-example 2
+
+There are times when you may need to temporarily suspend a CloudNativePG
+Cluster while preserving its data, allowing you to resume operations later.
+This feature is known as cluster hibernation.
Hibernation is managed declaratively using the cnpg.io/hibernation
+annotation.
Info
+For more details, see the "Declarative Hibernation" +documentation page.
+To simplify the process, the cnpg plugin for kubectl provides a hibernate
+command, which acts as a convenient shortcut for applying the annotation.
To hibernate a cluster, run:
+kubectl cnpg hibernate on CLUSTER
+
+This command applies the cnpg.io/hibernation=on annotation to the cluster,
+suspending its execution.
To resume a hibernated cluster, use:
+kubectl cnpg hibernate off CLUSTER
+
+This will remove the hibernation state by setting cnpg.io/hibernation=off.
You can check the cluster’s status at any time with:
+kubectl cnpg status CLUSTER
+
+This will display the current state of the cluster, including whether it is +hibernated.
+Pgbench can be run against an existing PostgreSQL cluster with following +command:
+kubectl cnpg pgbench CLUSTER -- --time 30 --client 1 --jobs 1
+
+Refer to the Benchmarking pgbench section for more +details.
+fio can be run on an existing storage class with following command:
kubectl cnpg fio FIO_JOB_NAME [-n NAMESPACE]
+
+Refer to the Benchmarking fio section for more details.
+The kubectl cnpg backup command requests a new physical backup for
+an existing Postgres cluster by creating a new Backup resource.
The following example requests an on-demand backup for a given cluster:
+kubectl cnpg backup CLUSTER
+
+or, if using volume snapshots:
+kubectl cnpg backup CLUSTER -m volumeSnapshot
+
+The created backup will be named after the request time:
+$ kubectl cnpg backup cluster-example
+backup/cluster-example-20230121002300 created
+
+By default, a newly created backup will use the backup target policy defined
+in the cluster to choose which instance to run on.
+However, you can override this policy with the --backup-target option.
In the case of volume snapshot backups, you can also use the --online option
+to request an online/hot backup or an offline/cold one: additionally, you can
+also tune online backups by explicitly setting the --immediate-checkpoint and
+--wait-for-archive options.
The "Backup" section contains more information about +the configuration settings.
+The kubectl cnpg psql CLUSTER command starts a new PostgreSQL interactive front-end
+process (psql) connected to an existing Postgres cluster, as if you were running
+it from the actual pod. This means that you will be using the postgres user.
Important
+As you will be connecting as postgres user, in production environments this
+method should be used with extreme care, by authorized personnel only.
$ kubectl cnpg psql cluster-example
+
+psql (18.0 (Debian 18.0-1.pgdg110+1))
+Type "help" for help.
+
+postgres=#
+
+By default, the command will connect to the primary instance. The user can
+select to work against a replica by using the --replica option:
$ kubectl cnpg psql --replica cluster-example
+
+psql (18.0 (Debian 18.0-1.pgdg110+1))
+
+Type "help" for help.
+
+postgres=# select pg_is_in_recovery();
+ pg_is_in_recovery
+-------------------
+ t
+(1 row)
+
+postgres=# \q
+
+This command will start kubectl exec, and the kubectl executable must be
+reachable in your PATH variable to correctly work.
Warning
+The kubectl cnpg snapshot command has been removed.
+Please use the backup command to request
+backups using volume snapshots.
pgAdmin stands as the most popular and feature-rich +open-source administration and development platform for PostgreSQL. +For more information on the project, please refer to the official +documentation.
+Given that the pgAdmin Development Team maintains official Docker container +images, you can install pgAdmin in your environment as a standard +Kubernetes deployment.
+Important
+Deployment of pgAdmin in Kubernetes production environments is beyond the +scope of this document and, more broadly, of the CloudNativePG project.
+However, for the purposes of demonstration and evaluation, CloudNativePG
+offers a suitable solution. The cnpg plugin implements the pgadmin4
+command, providing a straightforward method to connect to a given database
+Cluster and navigate its content in a local environment such as kind.
For example, you can install a demo deployment of pgAdmin4 for the
+cluster-example cluster as follows:
kubectl cnpg pgadmin4 cluster-example
+
+This command will produce:
+ConfigMap/cluster-example-pgadmin4 created
+Deployment/cluster-example-pgadmin4 created
+Service/cluster-example-pgadmin4 created
+Secret/cluster-example-pgadmin4 created
+
+[...]
+
+After deploying pgAdmin, forward the port using kubectl and connect +through your browser by following the on-screen instructions.
+
As usual, you can use the --dry-run option to generate the YAML file:
kubectl cnpg pgadmin4 --dry-run cluster-example
+
+pgAdmin4 can be installed in either desktop or server mode, with the default +being server.
+In server mode, authentication is required using a randomly generated password,
+and users must manually specify the database to connect to.
On the other hand, desktop mode initiates a pgAdmin web interface without
+requiring authentication. It automatically connects to the app database as the
+app user, making it ideal for quick demos, such as on a local deployment using
+kind:
kubectl cnpg pgadmin4 --mode desktop cluster-example
+
+After concluding your demo, ensure the termination of the pgAdmin deployment by +executing:
+kubectl cnpg pgadmin4 --dry-run cluster-example | kubectl delete -f -
+
+Warning
+Never deploy pgAdmin in production using the plugin.
+The cnpg publication command group is designed to streamline the creation and
+removal of PostgreSQL logical replication publications.
+Be aware that these commands are primarily intended for assisting in the
+creation of logical replication publications, particularly on remote PostgreSQL
+databases.
Warning
+It is crucial to have a solid understanding of both the capabilities and +limitations of PostgreSQL's native logical replication system before using +these commands. +In particular, be mindful of the logical replication restrictions.
+To create a logical replication publication, use the cnpg publication create
+command. The basic structure of this command is as follows:
kubectl cnpg publication create \
+ --publication PUBLICATION_NAME \
+ [--external-cluster EXTERNAL_CLUSTER]
+ LOCAL_CLUSTER [options]
+
+There are two primary use cases:
+With --external-cluster: Use this option to create a publication on an
+ external cluster (i.e. defined in the externalClusters stanza). The commands
+ will be issued from the LOCAL_CLUSTER, but the publication will be for the
+ data in EXTERNAL_CLUSTER.
Without --external-cluster: Use this option to create a publication in the
+ LOCAL_CLUSTER PostgreSQL Cluster (by default, the app database).
Warning
+When connecting to an external cluster, ensure that the specified user has
+sufficient permissions to execute the CREATE PUBLICATION command.
You have several options, similar to the CREATE PUBLICATION
+command, to define the group of tables to replicate. Notable options include:
--all-tables option, you create a publication FOR ALL TABLES.--table: Add a specific table (with an expression) to the publication.--schema: Include all tables in the specified database schema (available
+ from PostgreSQL 15).The --dry-run option enables you to preview the SQL commands that the plugin
+will execute.
For additional information and detailed instructions, type the following +command:
+kubectl cnpg publication create --help
+
+Given a source-cluster and a destination-cluster, we would like to create a
+publication for the data on source-cluster.
+The destination-cluster has an entry in the externalClusters stanza pointing
+to source-cluster.
We can run:
+kubectl cnpg publication create destination-cluster \
+ --external-cluster=source-cluster --all-tables
+
+which will create a publication for all tables on source-cluster, running
+the SQL commands on the destination-cluster.
Or instead, we can run:
+kubectl cnpg publication create source-cluster \
+ --publication=app --all-tables
+
+which will create a publication named app for all the tables in the
+source-cluster, running the SQL commands on the source cluster.
Info
+There are two sample files that have been provided for illustration and inspiration: +logical-source and +logical-destination.
+The cnpg publication drop command seamlessly complements the create command
+by offering similar key options, including the publication name, cluster name,
+and an optional external cluster. You can drop a PUBLICATION with the
+following command structure:
kubectl cnpg publication drop \
+ --publication PUBLICATION_NAME \
+ [--external-cluster EXTERNAL_CLUSTER]
+ LOCAL_CLUSTER [options]
+
+To access further details and precise instructions, use the following command:
+kubectl cnpg publication drop --help
+
+The cnpg subscription command group is a dedicated set of commands designed
+to simplify the creation and removal of
+PostgreSQL logical replication subscriptions.
+These commands are specifically crafted to aid in the establishment of logical
+replication subscriptions, especially when dealing with remote PostgreSQL
+databases.
Warning
+Before using these commands, it is essential to have a comprehensive +understanding of both the capabilities and limitations of PostgreSQL's +native logical replication system. +In particular, be mindful of the logical replication restrictions.
+In addition to subscription management, we provide a helpful command for +synchronizing all sequences from the source cluster. While its applicability +may vary, this command can be particularly useful in scenarios involving major +upgrades or data import from remote servers.
+To create a logical replication subscription, use the cnpg subscription create
+command. The basic structure of this command is as follows:
kubectl cnpg subscription create \
+ --subscription SUBSCRIPTION_NAME \
+ --publication PUBLICATION_NAME \
+ --external-cluster EXTERNAL_CLUSTER \
+ LOCAL_CLUSTER [options]
+
+This command configures a subscription directed towards the specified
+publication in the designated external cluster, as defined in the
+externalClusters stanza of the LOCAL_CLUSTER.
For additional information and detailed instructions, type the following +command:
+kubectl cnpg subscription create --help
+
+As in the section on publications, we have a source-cluster and a
+destination-cluster, and we have already created a publication called
+app.
The following command:
+kubectl cnpg subscription create destination-cluster \
+ --external-cluster=source-cluster \
+ --publication=app --subscription=app
+
+will create a subscription for app on the destination cluster.
Warning
+Prioritize testing subscriptions in a non-production environment to ensure +their effectiveness and identify any potential issues before implementing them +in a production setting.
+Info
+There are two sample files that have been provided for illustration and inspiration: +logical-source and +logical-destination.
+The cnpg subscription drop command seamlessly complements the create command.
+You can drop a SUBSCRIPTION with the following command structure:
kubectl cnpg subcription drop \
+ --subscription SUBSCRIPTION_NAME \
+ LOCAL_CLUSTER [options]
+
+To access further details and precise instructions, use the following command:
+kubectl cnpg subscription drop --help
+
+One notable constraint of PostgreSQL logical replication, implemented through +publications and subscriptions, is the lack of sequence synchronization. This +becomes particularly relevant when utilizing logical replication for live +database migration, especially to a higher version of PostgreSQL. A crucial +step in this process involves updating sequences before transitioning +applications to the new database (cutover).
+To address this limitation, the cnpg subscription sync-sequences command
+offers a solution. This command establishes a connection with the source
+database, retrieves all relevant sequences, and subsequently updates local
+sequences with matching identities (based on database schema and sequence
+name).
You can use the command as shown below:
+kubectl cnpg subscription sync-sequences \
+ --subscription SUBSCRIPTION_NAME \
+ LOCAL_CLUSTER
+
+For comprehensive details and specific instructions, utilize the following +command:
+kubectl cnpg subscription sync-sequences --help
+
+As in the previous sections for publication and subscription, we have
+a source-cluster and a destination-cluster. The publication and the
+subscription, both called app, are already present.
The following command will synchronize the sequences involved in the
+app subscription, from the source cluster into the destination cluster.
kubectl cnpg subscription sync-sequences destination-cluster \
+ --subscription=app
+
+Warning
+Prioritize testing subscriptions in a non-production environment to +guarantee their effectiveness and detect any potential issues before deploying +them in a production setting.
+The cnpg plugin can be easily integrated in K9s, a
+popular terminal-based UI to interact with Kubernetes clusters.
See k9s/plugins.yml for details.
The plugin requires a set of Kubernetes permissions that depends on the command
+to execute. These permissions may affect resources and sub-resources like Pods,
+PDBs, PVCs, and enable actions like get, delete, patch. The following
+table contains the full details:
| Command | +Resource Permissions | +
|---|---|
| backup | +clusters: get backups: create |
+
| certificate | +clusters: get secrets: get,create |
+
| destroy | +pods: get,delete jobs: delete,list PVCs: list,delete,update |
+
| fencing | +clusters: get,patch pods: get |
+
| fio | +PVCs: create configmaps: create deployment: create |
+
| hibernate | +clusters: get,patch,delete pods: list,get,delete pods/exec: create jobs: list PVCs: get,list,update,patch,delete |
+
| install | +none | +
| logs | +clusters: get pods: list pods/log: get |
+
| maintenance | +clusters: get,patch,list |
+
| pgadmin4 | +clusters: get configmaps: create deployments: create services: create secrets: create |
+
| pgbench | +clusters: get jobs: create |
+
| promote | +clusters: get clusters/status: patch pods: get |
+
| psql | +pods: get,list pods/exec: create |
+
| publication | +clusters: get pods: get,list pods/exec: create |
+
| reload | +clusters: get,patch | +
| report cluster | +clusters: get pods: list pods/log: get jobs: list events: list PVCs: list |
+
| report operator | +Required: deployments: get Optional (for full report): configmaps: get events: list pods: list pods/log: get secrets: get services: get mutatingwebhookconfigurations: list1 validatingwebhookconfigurations: list1 If OLM is present: clusterserviceversions: list1 installplans: list1 subscriptions: list1 |
+
| restart | +clusters: get,patch pods: get,delete |
+
| status | +clusters: get pods: list pods/exec: create pods/proxy: create PDBs: list objectstores.barmancloud.cnpg.io: get |
+
| subscription | +clusters: get pods: get,list pods/exec: create |
+
| version | +none | +
Additionally, assigning the list permission on the clusters will enable
+autocompletion for multiple commands.
It is possible to create roles with restricted permissions. +The following example creates a role that only has access to the cluster logs:
+---
+apiVersion: rbac.authorization.k8s.io/v1
+kind: Role
+metadata:
+ name: cnpg-log
+rules:
+ - verbs:
+ - get
+ apiGroups:
+ - postgresql.cnpg.io
+ resources:
+ - clusters
+ - verbs:
+ - list
+ apiGroups:
+ - ''
+ resources:
+ - pods
+ - verbs:
+ - get
+ apiGroups:
+ - ''
+ resources:
+ - pods/log
+
+The next example shows a role with the minimal permissions required to get
+the cluster status using the plugin's status command:
apiVersion: rbac.authorization.k8s.io/v1
+kind: Role
+metadata:
+ name: cnpg-status
+rules:
+ - verbs:
+ - get
+ apiGroups:
+ - postgresql.cnpg.io
+ resources:
+ - clusters
+ - verbs:
+ - list
+ apiGroups:
+ - ''
+ resources:
+ - pods
+ - verbs:
+ - create
+ apiGroups:
+ - ''
+ resources:
+ - pods/exec
+ - verbs:
+ - create
+ apiGroups:
+ - ''
+ resources:
+ - pods/proxy
+ - verbs:
+ - list
+ apiGroups:
+ - policy
+ resources:
+ - poddisruptionbudgets
+ - verbs:
+ - get
+ apiGroups:
+ - barmancloud.cnpg.io
+ resources:
+ - objectstores
+
+Important
+Keeping the verbs restricted per resources and per apiGroups helps to
+prevent inadvertently granting more than intended permissions.
Maintaining an up-to-date Kubernetes cluster is crucial for ensuring optimal +performance and security, particularly for self-managed clusters, especially +those running on bare metal infrastructure. Regular updates help address +technical debt and mitigate business risks, despite the controlled downtimes +associated with temporarily removing a node from the cluster for maintenance +purposes. For further insights on embracing risk in operations, refer to the +"Embracing Risk" +chapter from the Site Reliability Engineering book.
+Updating Kubernetes involves planning and executing maintenance tasks, such as +applying security updates to underlying Linux servers, replacing malfunctioning +hardware components, or upgrading the cluster to the latest Kubernetes version. +These activities are essential for maintaining a robust and secure +infrastructure.
+Typically, maintenance operations are carried out on one node at a time, following a structured process:
+drain): workloads are gracefully moved away from
+ the node to be updated, ensuring a smooth transition.uncordon): the updated node is
+ reintegrated into the cluster, ready to resume its responsibilities.This process requires either stopping workloads for the entire upgrade duration +or migrating them to other nodes in the cluster.
+While the standard approach ensures service reliability and leverages +Kubernetes' self-healing capabilities, there are scenarios where operating with +a temporarily degraded cluster may be acceptable. This is particularly relevant +for PostgreSQL clusters relying on node-local storage, where the storage is +local to the Kubernetes worker node running the PostgreSQL database. Node-local +storage, or simply local storage, is employed to enhance performance.
+Note
+If your database files reside on shared storage accessible over the +network, the default self-healing behavior of the operator can efficiently +handle scenarios where volumes are reused by pods on different nodes after a +drain operation. In such cases, you can skip the remaining sections of this +document.
+By default, CloudNativePG safeguards Postgres cluster operations. If a node is +to be drained and contains a cluster's primary instance, a switchover happens +ahead of the drain. Once the instance in the node is downgraded to replica, the +draining can resume. +For single-instance clusters, a switchover is not possible, so CloudNativePG +will prevent draining the node where the instance is housed. +Additionally, in clusters with 3 or more instances, CloudNativePG guarantees that +only one replica at a time is gracefully shut down during a drain operation.
+Each PostgreSQL Cluster is equipped with two associated PodDisruptionBudget
+resources - you can easily confirm it with the kubectl get pdb command.
Our recommendation is to leave pod disruption budgets enabled for every
+production Postgres cluster. This can be effortlessly managed by toggling the
+.spec.enablePDB option, as detailed in the
+API reference.
For PostgreSQL clusters used for development purposes, often consisting of +a single instance, it is essential to disable pod disruption budgets. Failure +to do so will prevent the node hosting that cluster from being drained.
+The following example illustrates how to disable pod disruption budgets for a +1-instance development cluster:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: dev
+spec:
+ instances: 1
+ enablePDB: false
+
+ storage:
+ size: 1Gi
+
+This configuration ensures smoother maintenance procedures without restrictions +on draining the node during development activities.
+Important
+While CloudNativePG will continue supporting the node maintenance window, +it is currently recommended to transition to direct control of pod disruption +budgets, as explained in the previous section. This section is retained +mainly for backward compatibility.
+Prior to release 1.23, CloudNativePG had just one declarative mechanism to manage
+Kubernetes upgrades when dealing with local storage: you had to temporarily put
+the cluster in maintenance mode through the nodeMaintenanceWindow option
+to avoid standard self-healing procedures to kick in, while, for example,
+enlarging the partition on the physical node or updating the node itself.
Warning
+Limit the duration of the maintenance window to the shortest +amount of time possible. In this phase, some of the expected +behaviors of Kubernetes are either disabled or running with +some limitations, including self-healing, rolling updates, +and Pod disruption budget.
+The nodeMaintenanceWindow option of the cluster has two further
+settings:
inProgress:
+Boolean value that states if the maintenance window for the nodes
+is currently in progress or not. By default, it is set to off.
+During the maintenance window, the reusePVC option below is
+evaluated by the operator.
reusePVC:
+Boolean value that defines if an existing PVC is reused or
+not during the maintenance operation. By default, it is set to on.
+When enabled, Kubernetes waits for the node to come up
+again and then reuses the existing PVC; the PodDisruptionBudget
+policy is temporarily removed.
+When disabled, Kubernetes forces the recreation of the
+Pod on a different node with a new PVC by relying on
+PostgreSQL's physical streaming replication, then destroys
+the old PVC together with the Pod. This scenario is generally
+not recommended unless the database's size is small, and re-cloning
+the new PostgreSQL instance takes shorter than waiting. This behavior
+does not apply to clusters with only one instance and
+reusePVC disabled: see section below.
Note
+When performing the kubectl drain command, you will need
+to add the --delete-emptydir-data option.
+Don't be afraid: it refers to another volume internally used
+by the operator - not the PostgreSQL data directory.
Important
+PodDisruptionBudget management can be disabled by setting the
+.spec.enablePDB field to false. In that case, the operator won't
+create PodDisruptionBudgets and will delete them if they were
+previously created.
reusePVC set to falseImportant
+We recommend to always create clusters with more +than one instance in order to guarantee high availability.
+Deleting the only PostgreSQL instance in a single instance cluster with
+reusePVC set to false would imply all data being lost,
+therefore we prevent users from draining nodes such instances might be running
+on, even in maintenance mode.
However, in case maintenance is required for such a node you have two options:
+reusePVC, accepting the downtimeAs long as a database service downtime is acceptable for your environment,
+draining the node is as simple as setting the nodeMaintenanceWindow to
+inProgress: true and reusePVC: true. This will allow the instance to
+be deleted and recreated as soon as the original PVC is available
+(e.g. with node local storage, as soon as the node is back up).
Otherwise you will have to scale up the cluster, creating a new instance +on a different node and promoting the new instance to primary in order to +shut down the original one on the node undergoing maintenance. The only +downtime in this case will be the duration of the switchover.
+A possible approach could be:
+Resources in Kubernetes are organized in a flat structure, with no hierarchical +information or relationship between them. However, such resources and objects +can be linked together and put in relationship through labels and +annotations.
+Info
+For more information, see the Kubernetes documentation on +annotations and +labels.
+In brief:
+You can select one or more labels or annotations to use +in your CloudNativePG deployments. Then you need to configure the operator +so that when you define these labels or annotations in a cluster's metadata, +they're inherited by all resources created by it (including pods).
+Note
+Label and annotation inheritance is the technique adopted by CloudNativePG +instead of alternative approaches such as pod templates.
+CloudNativePG manages the following predefined labels:
+cnpg.io/backupDateYYYYMMDD).
+ This label is available only on VolumeSnapshot resources.cnpg.io/backupNameVolumeSnapshot resources.cnpg.io/backupMonthVolumeSnapshot resources.cnpg.io/backupTimelineVolumeSnapshot resources.cnpg.io/backupYearVolumeSnapshot resources.cnpg.io/clustercnpg.io/immediateBackupBackup resource if the backup is the first one created from
+ a ScheduledBackup object having immediate set to true.cnpg.io/instanceNamepostgresql label).cnpg.io/jobRoleimport, initdb, join, ...)cnpg.io/majorVersion17).
+This label is available only on VolumeSnapshot resources.cnpg.io/onlineBackupVolumeSnapshot resources.cnpg.io/podRolecnpg.io/poolerNamecnpg.io/pvcRolePG_DATA or PG_WAL.cnpg.io/reloadConfigMap and Secret resources. When set to true,
+ a change in the resource is automatically reloaded by the operator.cnpg.io/userTypeSecret, either superuser (Postgres superuser access) or app
+ (application-level user in CloudNativePG terminology), and is limited to the
+ default users created by CloudNativePG (typically postgres and app).role - deprecatedprimary or a replica.
+ This label is deprecated, you should use cnpg.io/instanceRole instead.cnpg.io/scheduled-backupScheduledBackup resource that created a given
+ Backup object.cnpg.io/instanceRoleprimary or a replica.app.kubernetes.io/managed-bycloudnative-pg.
+ Available across all CloudNativePG managed resources.app.kubernetes.io/namepostgresql.
+ Available on pods, jobs, deployments, services, persistentVolumeClaims, volumeSnapshots,
+ podDisruptionBudgets, podMonitors.app.kubernetes.io/componentdatabase, pooler, ...).
+ Available on pods, jobs, deployments, services, persistentVolumeClaims, volumeSnapshots,
+ podDisruptionBudgets, podMonitors.app.kubernetes.io/instanceCluster resource.
+ Available on pods, jobs, deployments, services, volumeSnapshots, podDisruptionBudgets, podMonitors.app.kubernetes.io/versionCloudNativePG manages the following predefined annotations:
+container.apparmor.security.beta.kubernetes.io/*cnpg.io/backupEndTimeVolumeSnapshot resources.cnpg.io/backupEndWALVolumeSnapshot resources.cnpg.io/backupStartTimecnpg.io/backupStartWALVolumeSnapshot resources.cnpg.io/coredumpFilter0x31 to exclude shared memory
+segments from the dump. See PostgreSQL core dumps
+for more information.cnpg.io/clusterManifestCluster owning this resource (such as a PVC). This label
+replaces the old, deprecated cnpg.io/hibernateClusterManifest label.cnpg.io/fencedInstances* element.cnpg.io/forceLegacyBackupCluster resource for testing purposes only, to
+simulate the behavior of barman-cloud-backup prior to version 3.4 (Jan 2023)
+when the --name option wasn't available.cnpg.io/hashcnpg.io/hibernationCluster resource to control the declarative hibernation feature.
+Allowed values are on and off.cnpg.io/managedSecretsServiceAccount resources for each Postgres cluster.cnpg.io/nodeSerialcnpg.io/operatorVersioncnpg.io/pgControldatapg_controldata command. This annotation replaces the old,
+deprecated cnpg.io/hibernatePgControlData annotation.cnpg.io/podEnvHashcnpg.io/podSpec annotation now also contains the pod environment.cnpg.io/podPatchAnnotation can be applied on a Cluster resource.
When set to JSON-patch formatted patch, the patch will be applied on the instance Pods.
+⚠️ WARNING: This feature may introduce discrepancies between the +operator’s expectations and Kubernetes behavior. Use with caution and only as a +last resort.
+IMPORTANT: adding or changing this annotation won't trigger a rolling deployment
+of the generated Pods. The latter can be triggered manually by the user with
+kubectl cnpg restart.
cnpg.io/podSpecspec of the pod generated by the operator. This annotation replaces
+the old, deprecated cnpg.io/podEnvHash annotation.cnpg.io/poolerSpecHashcnpg.io/pvcStatusinitializing, ready, or detached.cnpg.io/reconcilePodSpecAnnotation can be applied to a Cluster or Pooler to prevent restarts.
When set to disabled on a Cluster, the operator prevents instances
+from restarting due to changes in the PodSpec. This includes changes to:
When set to disabled on a Pooler, the operator restricts any modifications
+to the deployment specification, except for changes to spec.instances.
cnpg.io/reconciliationLoopdisabled on a Cluster, the operator prevents the
+reconciliation loop from running.cnpg.io/reloadedAtreload time. reload is triggered by the user through a plugin.cnpg.io/skipEmptyWalArchiveCheckenabled on a Cluster resource, the operator disables the check
+that ensures that the WAL archive is empty before writing data. Use at your own
+risk.cnpg.io/skipWalArchivingenabled on a Cluster resource, the operator disables WAL archiving.
+This will set archive_mode to off and require a restart of all PostgreSQL
+instances. Use at your own risk.cnpg.io/snapshotStartTimecnpg.io/snapshotEndTimecnpg.io/validationWhen set to disabled on a CloudNativePG-managed custom resource, the
+validation webhook allows all changes without restriction.
⚠️ WARNING: Disabling validation may permit unsafe or destructive +operations. Use this setting with caution and at your own risk.
+cnpg.io/volumeSnapshotDeadlineBackup and ScheduledBackup resources, allows you to control
+how long the operator should retry recoverable errors before considering the
+volume snapshot backup failed. In minutes, defaulting to 10.kubectl.kubernetes.io/restartedAtalpha.cnpg.io/unrecoverablePod running a PostgreSQL instance.
+It instructs the operator to delete the Pod and all its associated PVCs.
+The instance will then be recreated according to the configured join
+strategy. This annotation can only be used on instances that are neither the
+current primary nor the designated target primary.By default, no label or annotation defined in the cluster's metadata is +inherited by the associated resources. +To enable label/annotation inheritance, follow the +instructions provided in Operator configuration.
+The following continues from that example and limits it to the following:
+categoriesapp, environment, and workloadNote
+Feel free to select the names that most suit your context for both
+annotations and labels. You can also use wildcards
+in naming and adopt strategies like using mycompany/* for all labels
+or setting annotations starting with mycompany/ to be inherited.
When defining the cluster, before any resource is deployed, you can +set the metadata as follows:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+ annotations:
+ categories: database
+ labels:
+ environment: production
+ workload: database
+ app: sso
+spec:
+ # ... <snip>
+
+Once the cluster is deployed, you can verify, for example, that the labels +were correctly set in the pods:
+kubectl get pods --show-labels
+
+Currently, CloudNativePG doesn't automatically propagate labels or +annotations deletions. Therefore, when an annotation or label is removed from +a cluster that was previously propagated to the underlying pods, the operator +doesn't remove it on the associated resources.
+ +CloudNativePG outputs logs in JSON format directly to standard output, including +PostgreSQL logs, without persisting them to storage for security reasons. This +design facilitates seamless integration with most Kubernetes-compatible log +management tools, including command line ones like +stern.
+Important
+Long-term storage and management of logs are outside the scope of the +operator and should be handled at the Kubernetes infrastructure level. +For more information, see the +Kubernetes Logging Architecture +documentation.
+Each log entry includes the following fields:
+level – The log level (e.g., info, notice).ts – The timestamp.logger – The type of log (e.g., postgres, pg_controldata).msg – The log message, or the keyword record if the message is in JSON
+ format.record – The actual record, with a structure that varies depending on the
+ logger type.logging_pod – The name of the pod where the log was generated.Info
+If your log ingestion system requires custom field names, you can rename
+the level and ts fields using the log-field-level and
+log-field-timestamp flags in the operator controller. This can be configured
+by editing the Deployment definition of the cloudnative-pg operator.
You can configure the log level for the instance pods in the cluster
+specification using the logLevel option. Available log levels are: error,
+warning, info (default), debug, and trace.
Important
+Currently, the log level can only be set at the time the instance starts. +Changes to the log level in the cluster specification after the cluster has +started will only apply to new pods, not existing ones.
+The logs produced by the operator pod can be configured with log
+levels, same as instance pods: error, warning, info (default), debug,
+and trace.
The log level for the operator can be configured by editing the Deployment
+definition of the operator and setting the --log-level command line argument
+to the desired value.
Each PostgreSQL log entry is a JSON object with the logger key set to
+postgres. The structure of the log entries is as follows:
{
+ "level": "info",
+ "ts": 1619781249.7188137,
+ "logger": "postgres",
+ "msg": "record",
+ "record": {
+ "log_time": "2021-04-30 11:14:09.718 UTC",
+ "user_name": "",
+ "database_name": "",
+ "process_id": "25",
+ "connection_from": "",
+ "session_id": "608be681.19",
+ "session_line_num": "1",
+ "command_tag": "",
+ "session_start_time": "2021-04-30 11:14:09 UTC",
+ "virtual_transaction_id": "",
+ "transaction_id": "0",
+ "error_severity": "LOG",
+ "sql_state_code": "00000",
+ "message": "database system was interrupted; last known up at 2021-04-30 11:14:07 UTC",
+ "detail": "",
+ "hint": "",
+ "internal_query": "",
+ "internal_query_pos": "",
+ "context": "",
+ "query": "",
+ "query_pos": "",
+ "location": "",
+ "application_name": "",
+ "backend_type": "startup"
+ },
+ "logging_pod": "cluster-example-1",
+}
+
+Info
+Internally, the operator uses PostgreSQL's CSV log format. For more details, +refer to the PostgreSQL documentation on CSV log format.
+CloudNativePG offers seamless and native support for +PGAudit on PostgreSQL clusters.
+To enable PGAudit, add the necessary pgaudit parameters in the postgresql
+section of the cluster configuration.
Important
+The PGAudit library must be added to shared_preload_libraries.
+CloudNativePG automatically manages this based on the presence of pgaudit.*
+parameters in the PostgreSQL configuration. The operator handles both the
+addition and removal of the library from shared_preload_libraries.
Additionally, the operator manages the creation and removal of the PGAudit +extension across all databases within the cluster.
+Important
+CloudNativePG executes the CREATE EXTENSION and DROP EXTENSION commands
+in all databases within the cluster that accept connections.
The following example demonstrates a PostgreSQL Cluster deployment with
+PGAudit enabled and configured:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+
+ postgresql:
+ parameters:
+ "pgaudit.log": "all, -misc"
+ "pgaudit.log_catalog": "off"
+ "pgaudit.log_parameter": "on"
+ "pgaudit.log_relation": "on"
+
+ storage:
+ size: 1Gi
+
+The audit CSV log entries generated by PGAudit are parsed and routed to +standard output in JSON format, similar to all other logs:
+.logger is set to pgaudit..msg is set to record..record contains the entire parsed record as a JSON object. This structure
+ resembles that of logging_collector logs, with the exception of
+ .record.audit, which contains the PGAudit CSV message formatted as a JSON
+ object.This example shows sample log entries:
+{
+ "level": "info",
+ "ts": 1627394507.8814096,
+ "logger": "pgaudit",
+ "msg": "record",
+ "record": {
+ "log_time": "2021-07-27 14:01:47.881 UTC",
+ "user_name": "postgres",
+ "database_name": "postgres",
+ "process_id": "203",
+ "connection_from": "[local]",
+ "session_id": "610011cb.cb",
+ "session_line_num": "1",
+ "command_tag": "SELECT",
+ "session_start_time": "2021-07-27 14:01:47 UTC",
+ "virtual_transaction_id": "3/336",
+ "transaction_id": "0",
+ "error_severity": "LOG",
+ "sql_state_code": "00000",
+ "backend_type": "client backend",
+ "audit": {
+ "audit_type": "SESSION",
+ "statement_id": "1",
+ "substatement_id": "1",
+ "class": "READ",
+ "command": "SELECT FOR KEY SHARE",
+ "statement": "SELECT pg_current_wal_lsn()",
+ "parameter": "<none>"
+ }
+ },
+ "logging_pod": "cluster-example-1",
+}
+
+See the +PGAudit documentation +for more details about each field in a record.
+All logs generated by the operator and its instances are in JSON format, with
+the logger field indicating the process that produced them. The possible
+logger values are as follows:
barman-cloud-wal-archive: logs from barman-cloud-wal-archivebarman-cloud-wal-restore: logs from barman-cloud-wal-restoreinitdb: logs from running initdbpg_basebackup: logs from running pg_basebackuppg_controldata: logs from running pg_controldatapg_ctl: logs from running any pg_ctl subcommandpg_rewind: logs from running pg_rewindpgaudit: logs from the PGAudit extensionpostgres: logs from the postgres instance (with msg distinct from
+ record)wal-archive: logs from the wal-archive subcommand of the instance managerwal-restore: logs from the wal-restore subcommand of the instance managerinstance-manager: from the PostgreSQL instance managerWith the exception of postgres, which follows a specific structure, all other
+logger values contain the msg field with the escaped message that is
+logged.
PostgreSQL extends its replication capabilities beyond physical replication, +which operates at the level of exact block addresses and byte-by-byte copying, +by offering logical replication. +Logical replication replicates data objects and their changes based on a +defined replication identity, typically the primary key.
+Logical replication uses a publish-and-subscribe model, where subscribers +connect to publications on a publisher node. Subscribers pull data changes from +these publications and can re-publish them, enabling cascading replication and +complex topologies.
+Important
+To protect your logical replication subscribers after a failover of the +publisher cluster in CloudNativePG, ensure that replication slot +synchronization for logical decoding is enabled. Without this, your logical +replication clients may lose data and fail to continue seamlessly after a +failover. For configuration details, see +"Replication: Logical Decoding Slot Synchronization".
+This flexible model is particularly useful for:
+Info
+For more details, examples, and limitations, please refer to the +official PostgreSQL documentation on Logical Replication.
+CloudNativePG enhances this capability by providing declarative support for +key PostgreSQL logical replication objects:
+Publication resourceSubscription resourceIn PostgreSQL's publish-and-subscribe replication model, a +publication +is the source of data changes. It acts as a logical container for the change +sets (also known as replication sets) generated from one or more tables within +a database. Publications can be defined on any PostgreSQL 10+ instance acting +as the publisher, including instances managed by popular DBaaS solutions in the +public cloud. Each publication is tied to a single database and provides +fine-grained control over which tables and changes are replicated.
+For publishers outside Kubernetes, you can create publications using SQL
+or leverage the cnpg publication create plugin command.
When managing Cluster objects with CloudNativePG, PostgreSQL publications
+can be defined declaratively through the Publication resource.
Info
+Please refer to the API reference
+for the full list of attributes you can define for each Publication object.
Suppose you have a cluster named freddie and want to replicate all tables in
+the app database. Here's a Publication manifest:
apiVersion: postgresql.cnpg.io/v1
+kind: Publication
+metadata:
+ name: freddie-publisher
+spec:
+ cluster:
+ name: freddie
+ dbname: app
+ name: publisher
+ target:
+ allTables: true
+
+In the above example:
+freddie-publisher (metadata.name).freddie cluster
+ (spec.cluster.name) with name publisher (spec.name).spec.target.allTables: true) from the app
+ database (spec.dbname).While the allTables option provides a convenient way to replicate all tables
+in a database, PostgreSQL version 15 and later introduce enhanced flexibility
+through the CREATE PUBLICATION
+command. This allows you to precisely define which tables, or even which types
+of data changes, should be included in a publication.
Important
+If you are using PostgreSQL versions earlier than 15, review the syntax and
+options available for CREATE PUBLICATION in your specific release. Some
+parameters and features may not be supported.
For complex or tailored replication setups, refer to the +PostgreSQL logical replication documentation.
+Additionally, refer to the CloudNativePG API reference +for details on declaratively customizing replication targets.
+The following example defines a publication that replicates all tables in the
+portal schema of the app database, along with the users table from the
+access schema:
apiVersion: postgresql.cnpg.io/v1
+kind: Publication
+metadata:
+ name: publisher
+spec:
+ cluster:
+ name: freddie
+ dbname: app
+ name: publisher
+ target:
+ objects:
+ - tablesInSchema: portal
+ - table:
+ name: users
+ schema: access
+
+Publication ManifestThe following fields are required for a Publication object:
metadata.name: Unique name for the Kubernetes Publication object.spec.cluster.name: Name of the PostgreSQL cluster.spec.dbname: Database name where the publication is created.spec.name: Publication name in PostgreSQL.spec.target: Specifies the tables or changes to include in the publication.The Publication object must reference a specific Cluster, determining where
+the publication will be created. It is managed by the cluster's primary instance,
+ensuring the publication is created or updated as needed.
After creating a Publication, CloudNativePG manages it on the primary
+instance of the specified cluster. Following a successful reconciliation cycle,
+the Publication status will reflect the following:
applied: true, indicates the configuration has been successfully applied.observedGeneration matches metadata.generation, confirming the applied
+ configuration corresponds to the most recent changes.If an error occurs during reconciliation, status.applied will be false, and
+an error message will be included in the status.message field.
The publicationReclaimPolicy field controls the behavior when deleting a
+Publication object:
retain (default): Leaves the publication in PostgreSQL for manual
+ management.delete: Automatically removes the publication from PostgreSQL.Consider the following example:
+apiVersion: postgresql.cnpg.io/v1
+kind: Publication
+metadata:
+ name: freddie-publisher
+spec:
+ cluster:
+ name: freddie
+ dbname: app
+ name: publisher
+ target:
+ allTables: true
+ publicationReclaimPolicy: delete
+
+In this case, deleting the Publication object also removes the publisher
+publication from the app database of the freddie cluster.
In PostgreSQL's publish-and-subscribe replication model, a +subscription +represents the downstream component that consumes data changes. +A subscription establishes the connection to a publisher's database and +specifies the set of publications (one or more) it subscribes to. Subscriptions +can be created on any supported PostgreSQL instance acting as the subscriber.
+Important
+Since schema definitions are not replicated, the subscriber must have the +corresponding tables already defined before data replication begins.
+CloudNativePG simplifies subscription management by enabling you to define them
+declaratively using the Subscription resource.
Info
+Please refer to the API reference
+for the full list of attributes you can define for each Subscription object.
Suppose you want to replicate changes from the publisher publication on the
+app database of the freddie cluster (publisher) to the app database of
+the king cluster (subscriber). Here's an example of a Subscription
+manifest:
apiVersion: postgresql.cnpg.io/v1
+kind: Subscription
+metadata:
+ name: freddie-to-king-subscription
+spec:
+ cluster:
+ name: king
+ dbname: app
+ name: subscriber
+ externalClusterName: freddie
+ publicationName: publisher
+
+In the above example:
+freddie-to-king-subscriber (metadata.name).app database (spec.dbname) of the
+ king cluster (spec.cluster.name), with name subscriber (spec.name).publisher publication in the external freddie cluster,
+ referenced by spec.externalClusterName.To facilitate this setup, the freddie external cluster must be defined in the
+king cluster's configuration. Below is an example excerpt showing how to
+define the external cluster in the king manifest:
externalClusters:
+ - name: freddie
+ connectionParameters:
+ host: freddie-rw.default.svc
+ user: postgres
+ dbname: app
+
+Info
+For more details on configuring the externalClusters section, see the
+"Bootstrap" section of the
+documentation.
As you can see, a subscription can connect to any PostgreSQL database +accessible over the network. This flexibility allows you to seamlessly migrate +your data into Kubernetes with nearly zero downtime. It’s an excellent option +for transitioning from various environments, including popular cloud-based +Database-as-a-Service (DBaaS) platforms.
+Subscription ManifestThe following fields are mandatory for defining a Subscription object:
metadata.name: A unique name for the Kubernetes Subscription object
+ within its namespace.spec.cluster.name: The name of the PostgreSQL cluster where the
+ subscription will be created.spec.dbname: The name of the database in which the subscription will be
+ created.spec.name: The name of the subscription as it will appear in PostgreSQL.spec.externalClusterName: The name of the external cluster, as defined in
+ the spec.cluster.name cluster's configuration. This references the
+ publisher database.spec.publicationName: The name of the publication in the publisher database
+ to which the subscription will connect.The Subscription object must reference a specific Cluster, determining
+where the subscription will be managed. CloudNativePG ensures that the
+subscription is created or updated on the primary instance of the specified
+cluster.
After creating a Subscription, CloudNativePG manages it on the primary
+instance of the specified cluster. Following a successful reconciliation cycle,
+the Subscription status will reflect the following:
applied: true, indicates the configuration has been successfully applied.observedGeneration matches metadata.generation, confirming the applied
+ configuration corresponds to the most recent changes.If an error occurs during reconciliation, status.applied will be false, and
+an error message will be included in the status.message field.
The subscriptionReclaimPolicy field controls the behavior when deleting a
+Subscription object:
retain (default): Leaves the subscription in PostgreSQL for manual
+ management.delete: Automatically removes the subscription from PostgreSQL.Consider the following example:
+apiVersion: postgresql.cnpg.io/v1
+kind: Subscription
+metadata:
+ name: freddie-to-king-subscription
+spec:
+ cluster:
+ name: king
+ dbname: app
+ name: subscriber
+ externalClusterName: freddie
+ publicationName: publisher
+ subscriptionReclaimPolicy: delete
+
+In this case, deleting the Subscription object also removes the subscriber
+subscription from the app database of the king cluster.
To ensure that your logical replication subscriptions remain operational after +a failover of the publisher, configure CloudNativePG to synchronize logical +decoding slots across the cluster. For detailed instructions, see +Logical Decoding Slot Synchronization.
+Logical replication in PostgreSQL has some inherent limitations, as outlined in +the official documentation. +Notably, the following objects are not replicated:
+The first limitation, related to schema replication, can be easily addressed
+using CloudNativePG's capabilities. For instance, you can leverage the import
+bootstrap feature to copy the schema of the tables you need to replicate.
+Alternatively, you can manually create the schema as you would for any
+PostgreSQL database.
While sequences are not automatically kept in sync through logical replication,
+CloudNativePG provides a solution to be used in live migrations.
+You can use the cnpg plugin
+to synchronize sequence values, ensuring consistency between the publisher and
+subscriber databases.
To highlight the powerful capabilities of logical replication, this example
+demonstrates how to replicate data from a publisher database (freddie)
+running PostgreSQL 16 to a subscriber database (king) running the latest
+PostgreSQL version. This setup can be deployed in your Kubernetes cluster for
+evaluation and hands-on learning.
This example illustrates how logical replication facilitates live migrations +and upgrades between PostgreSQL versions while ensuring data consistency. By +combining logical replication with CloudNativePG, you can easily set up, +manage, and evaluate such scenarios in a Kubernetes environment.
+freddie)The first step involves creating a freddie PostgreSQL cluster with version 16.
+The cluster contains a single instance and includes an app database
+initialized with a table, n, storing 10,000 numbers. A logical replication
+publication named publisher is also configured to include all tables in the
+database.
Here’s the manifest for setting up the freddie cluster and its publication
+resource:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: freddie
+spec:
+ instances: 1
+
+ imageName: ghcr.io/cloudnative-pg/postgresql:16-standard-trixie
+
+ storage:
+ size: 1Gi
+
+ bootstrap:
+ initdb:
+ postInitApplicationSQL:
+ - CREATE TABLE n (i SERIAL PRIMARY KEY, m INTEGER)
+ - INSERT INTO n (m) (SELECT generate_series(1, 10000))
+ - ALTER TABLE n OWNER TO app
+
+ managed:
+ roles:
+ - name: app
+ login: true
+ replication: true
+---
+apiVersion: postgresql.cnpg.io/v1
+kind: Publication
+metadata:
+ name: freddie-publisher
+spec:
+ cluster:
+ name: freddie
+ dbname: app
+ name: publisher
+ target:
+ allTables: true
+
+king)Next, create the king PostgreSQL cluster, running the latest version of
+PostgreSQL. This cluster initializes by importing the schema from the app
+database on the freddie cluster using the external cluster configuration. A
+Subscription resource, freddie-to-king-subscription, is then configured to
+consume changes published by the publisher on freddie.
Below is the manifest for setting up the king cluster and its subscription:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: king
+spec:
+ instances: 1
+
+ imageName: ghcr.io/cloudnative-pg/postgresql:18-standard-trixie
+
+ storage:
+ size: 1Gi
+
+ bootstrap:
+ initdb:
+ import:
+ type: microservice
+ schemaOnly: true
+ databases:
+ - app
+ source:
+ externalCluster: freddie
+
+ externalClusters:
+ - name: freddie
+ connectionParameters:
+ host: freddie-rw.default.svc
+ user: app
+ dbname: app
+ password:
+ name: freddie-app
+ key: password
+---
+apiVersion: postgresql.cnpg.io/v1
+kind: Subscription
+metadata:
+ name: freddie-to-king-subscription
+spec:
+ cluster:
+ name: king
+ dbname: app
+ name: subscriber
+ externalClusterName: freddie
+ publicationName: publisher
+
+Once the king cluster is running, you can verify that the replication is
+working by connecting to the app database and counting the records in the n
+table. The following example uses the psql command provided by the cnpg
+plugin for simplicity:
kubectl cnpg psql king -- app -qAt -c 'SELECT count(*) FROM n'
+10000
+
+This command should return 10000, confirming that the data from the freddie
+cluster has been successfully replicated to the king cluster.
Using the cnpg plugin, you can also synchronize existing sequences to ensure
+consistency between the publisher and subscriber. The example below
+demonstrates how to synchronize a sequence for the king cluster:
kubectl cnpg subscription sync-sequences king --subscription=subscriber
+SELECT setval('"public"."n_i_seq"', 10000);
+
+10000
+
+This command updates the sequence n_i_seq in the king cluster to match the
+current value, ensuring it is in sync with the source database.
Important
+Installing Prometheus and Grafana is beyond the scope of this project. +We assume they are correctly installed in your system. However, for +experimentation we provide instructions in +Part 4 of the Quickstart.
+For each PostgreSQL instance, the operator provides an exporter of metrics for
+Prometheus via HTTP or HTTPS, on port 9187, named metrics.
+The operator comes with a predefined set of metrics, as well as a highly
+configurable and customizable system to define additional queries via one or
+more ConfigMap or Secret resources (see the
+"User defined metrics" section below for details).
Important
+CloudNativePG, by default, installs a set of predefined metrics
+in a ConfigMap named cnpg-default-monitoring.
Info
+You can inspect the exported metrics by following the instructions in +the "How to inspect the exported metrics" +section below.
+All monitoring queries that are performed on PostgreSQL are:
+pg_monitor roleapplication_name set to cnpg_metrics_exporterpostgresPlease refer to the "Predefined Roles" section in PostgreSQL
+documentation
+for details on the pg_monitor role.
Queries, by default, are run against the main database, as defined by
+the specified bootstrap method of the Cluster resource, according
+to the following logic:
initdb: queries will be run by default against the specified database
+ in initdb.database, or app if not specifiedrecovery: queries will be run by default against the specified database
+ in recovery.database, or postgres if not specifiedpg_basebackup: queries will be run by default against the specified database
+ in pg_basebackup.database, or postgres if not specifiedThe default database can always be overridden for a given user-defined metric,
+by specifying a list of one or more databases in the target_databases option.
Prometheus/Grafana
+If you are interested in evaluating the integration of CloudNativePG +with Prometheus and Grafana, you can find a quick setup guide +in Part 4 of the quickstart
+By default, the outputs of monitoring queries are cached for thirty +seconds. This is done to enhance resource efficiency and to avoid +PostgreSQL to run monitoring queries every time the prometheus +endpoint is scraped.
+The cache itself can be observed by the cache_hits, cache_misses
+and last_update_timestamp metrics.
Setting the cluster.spec.monitoring.metricsQueriesTTL to zero will
+disable the cache, and in that case the metrics will be run on every
+metrics endpoint scrape.
You can monitor a specific PostgreSQL cluster using the
+Prometheus Operator's
+PodMonitor resource.
The recommended approach is to manually create and manage a PodMonitor for
+each CloudNativePG cluster. This method provides you with full control over the
+monitoring configuration and lifecycle.
PodMonitorTo monitor your cluster, define a PodMonitor resource as follows. Be sure to
+deploy it in the same namespace where your Prometheus Operator is configured to
+find PodMonitor resources.
apiVersion: monitoring.coreos.com/v1
+kind: PodMonitor
+metadata:
+ name: cluster-example
+spec:
+ selector:
+ matchLabels:
+ cnpg.io/cluster: cluster-example
+ podMetricsEndpoints:
+ - port: metrics
+
+Important Configuration Details
+metadata.name: Give your PodMonitor a unique name.spec.namespaceSelector: Use this to specify the namespace where
+ your PostgreSQL cluster is running.spec.selector.matchLabels: You must use the cnpg.io/cluster: <cluster-name>
+ label to correctly target the PostgreSQL instances.PodMonitor CreationFeature Deprecation Notice
+The .spec.monitoring.enablePodMonitor field in the Cluster resource is
+now deprecated and will be removed in a future version of the operator.
If you are currently using this feature, we strongly recommend you either
+remove or set .spec.monitoring.enablePodMonitor to false and manually
+create a PodMonitor resource for your cluster as described above.
+This change ensures that you have complete ownership of your monitoring
+configuration, preventing it from being managed or overwritten by the operator.
To enable TLS communication on the metrics port, configure the .spec.monitoring.tls.enabled
+setting to true. This setup ensures that the metrics exporter uses the same
+server certificate used by PostgreSQL to secure communication on port 5432.
Important
+Changing the .spec.monitoring.tls.enabled setting will trigger a rolling restart of the Cluster.
If the PodMonitor is managed by the operator (.spec.monitoring.enablePodMonitor set to true),
+it will automatically contain the necessary configurations to access the metrics via TLS.
To manually deploy a PodMonitor suitable for reading metrics via TLS, define it as follows and
+adjust as needed:
apiVersion: monitoring.coreos.com/v1
+kind: PodMonitor
+metadata:
+ name: cluster-example
+spec:
+ selector:
+ matchLabels:
+ "cnpg.io/cluster": cluster-example
+ podMetricsEndpoints:
+ - port: metrics
+ scheme: https
+ tlsConfig:
+ ca:
+ secret:
+ name: cluster-example-ca
+ key: ca.crt
+ serverName: cluster-example-rw
+
+Important
+Ensure you modify the example above with a unique name, as well as the
+correct Cluster's namespace and labels (e.g., cluster-example).
Important
+The serverName field in the metrics endpoint must match one of the names
+defined in the server certificate. If the default certificate is in use,
+the serverName value should be in the format <cluster-name>-rw.
Every PostgreSQL instance exporter automatically exposes a set of predefined +metrics, which can be classified in two major categories:
+PostgreSQL related metrics, starting with cnpg_collector_*, including:
.ready and .done files in the archive status folderGo runtime related metrics, starting with go_*
Below is a sample of the metrics returned by the localhost:9187/metrics
+endpoint of an instance. As you can see, the Prometheus format is
+self-documenting:
# HELP cnpg_collector_collection_duration_seconds Collection time duration in seconds
+# TYPE cnpg_collector_collection_duration_seconds gauge
+cnpg_collector_collection_duration_seconds{collector="Collect.up"} 0.0031393
+
+# HELP cnpg_collector_collections_total Total number of times PostgreSQL was accessed for metrics.
+# TYPE cnpg_collector_collections_total counter
+cnpg_collector_collections_total 2
+
+# HELP cnpg_collector_fencing_on 1 if the instance is fenced, 0 otherwise
+# TYPE cnpg_collector_fencing_on gauge
+cnpg_collector_fencing_on 0
+
+# HELP cnpg_collector_nodes_used NodesUsed represents the count of distinct nodes accommodating the instances. A value of '-1' suggests that the metric is not available. A value of '1' suggests that all instances are hosted on a single node, implying the absence of High Availability (HA). Ideally this value should match the number of instances in the cluster.
+# TYPE cnpg_collector_nodes_used gauge
+cnpg_collector_nodes_used 3
+
+# HELP cnpg_collector_last_collection_error 1 if the last collection ended with error, 0 otherwise.
+# TYPE cnpg_collector_last_collection_error gauge
+cnpg_collector_last_collection_error 0
+
+# HELP cnpg_collector_manual_switchover_required 1 if a manual switchover is required, 0 otherwise
+# TYPE cnpg_collector_manual_switchover_required gauge
+cnpg_collector_manual_switchover_required 0
+
+# HELP cnpg_collector_pg_wal Total size in bytes of WAL segments in the '/var/lib/postgresql/data/pgdata/pg_wal' directory computed as (wal_segment_size * count)
+# TYPE cnpg_collector_pg_wal gauge
+cnpg_collector_pg_wal{value="count"} 9
+cnpg_collector_pg_wal{value="slots_max"} NaN
+cnpg_collector_pg_wal{value="keep"} 32
+cnpg_collector_pg_wal{value="max"} 64
+cnpg_collector_pg_wal{value="min"} 5
+cnpg_collector_pg_wal{value="size"} 1.50994944e+08
+cnpg_collector_pg_wal{value="volume_max"} 128
+cnpg_collector_pg_wal{value="volume_size"} 2.147483648e+09
+
+# HELP cnpg_collector_pg_wal_archive_status Number of WAL segments in the '/var/lib/postgresql/data/pgdata/pg_wal/archive_status' directory (ready, done)
+# TYPE cnpg_collector_pg_wal_archive_status gauge
+cnpg_collector_pg_wal_archive_status{value="done"} 6
+cnpg_collector_pg_wal_archive_status{value="ready"} 0
+
+# HELP cnpg_collector_replica_mode 1 if the cluster is in replica mode, 0 otherwise
+# TYPE cnpg_collector_replica_mode gauge
+cnpg_collector_replica_mode 0
+
+# HELP cnpg_collector_sync_replicas Number of requested synchronous replicas (synchronous_standby_names)
+# TYPE cnpg_collector_sync_replicas gauge
+cnpg_collector_sync_replicas{value="expected"} 0
+cnpg_collector_sync_replicas{value="max"} 0
+cnpg_collector_sync_replicas{value="min"} 0
+cnpg_collector_sync_replicas{value="observed"} 0
+
+# HELP cnpg_collector_up 1 if PostgreSQL is up, 0 otherwise.
+# TYPE cnpg_collector_up gauge
+cnpg_collector_up{cluster="cluster-example"} 1
+
+# HELP cnpg_collector_postgres_version Postgres version
+# TYPE cnpg_collector_postgres_version gauge
+cnpg_collector_postgres_version{cluster="cluster-example",full="18.0"} 18.0
+
+# HELP cnpg_collector_last_failed_backup_timestamp The last failed backup as a unix timestamp (Deprecated)
+# TYPE cnpg_collector_last_failed_backup_timestamp gauge
+cnpg_collector_last_failed_backup_timestamp 0
+
+# HELP cnpg_collector_last_available_backup_timestamp The last available backup as a unix timestamp (Deprecated)
+# TYPE cnpg_collector_last_available_backup_timestamp gauge
+cnpg_collector_last_available_backup_timestamp 1.63238406e+09
+
+# HELP cnpg_collector_first_recoverability_point The first point of recoverability for the cluster as a unix timestamp (Deprecated)
+# TYPE cnpg_collector_first_recoverability_point gauge
+cnpg_collector_first_recoverability_point 1.63238406e+09
+
+# HELP cnpg_collector_lo_pages Estimated number of pages in the pg_largeobject table
+# TYPE cnpg_collector_lo_pages gauge
+cnpg_collector_lo_pages{datname="app"} 0
+cnpg_collector_lo_pages{datname="postgres"} 78
+
+# HELP cnpg_collector_wal_buffers_full Number of times WAL data was written to disk because WAL buffers became full. Only available on PG 14+
+# TYPE cnpg_collector_wal_buffers_full gauge
+cnpg_collector_wal_buffers_full{stats_reset="2023-06-19T10:51:27.473259Z"} 6472
+
+# HELP cnpg_collector_wal_bytes Total amount of WAL generated in bytes. Only available on PG 14+
+# TYPE cnpg_collector_wal_bytes gauge
+cnpg_collector_wal_bytes{stats_reset="2023-06-19T10:51:27.473259Z"} 1.0035147e+07
+
+# HELP cnpg_collector_wal_fpi Total number of WAL full page images generated. Only available on PG 14+
+# TYPE cnpg_collector_wal_fpi gauge
+cnpg_collector_wal_fpi{stats_reset="2023-06-19T10:51:27.473259Z"} 1474
+
+# HELP cnpg_collector_wal_records Total number of WAL records generated. Only available on PG 14+
+# TYPE cnpg_collector_wal_records gauge
+cnpg_collector_wal_records{stats_reset="2023-06-19T10:51:27.473259Z"} 26178
+
+# HELP cnpg_collector_wal_sync Number of times WAL files were synced to disk via issue_xlog_fsync request (if fsync is on and wal_sync_method is either fdatasync, fsync or fsync_writethrough, otherwise zero). Only available on PG 14+
+# TYPE cnpg_collector_wal_sync gauge
+cnpg_collector_wal_sync{stats_reset="2023-06-19T10:51:27.473259Z"} 37
+
+# HELP cnpg_collector_wal_sync_time Total amount of time spent syncing WAL files to disk via issue_xlog_fsync request, in milliseconds (if track_wal_io_timing is enabled, fsync is on, and wal_sync_method is either fdatasync, fsync or fsync_writethrough, otherwise zero). Only available on PG 14+
+# TYPE cnpg_collector_wal_sync_time gauge
+cnpg_collector_wal_sync_time{stats_reset="2023-06-19T10:51:27.473259Z"} 0
+
+# HELP cnpg_collector_wal_write Number of times WAL buffers were written out to disk via XLogWrite request. Only available on PG 14+
+# TYPE cnpg_collector_wal_write gauge
+cnpg_collector_wal_write{stats_reset="2023-06-19T10:51:27.473259Z"} 7243
+
+# HELP cnpg_collector_wal_write_time Total amount of time spent writing WAL buffers to disk via XLogWrite request, in milliseconds (if track_wal_io_timing is enabled, otherwise zero). This includes the sync time when wal_sync_method is either open_datasync or open_sync. Only available on PG 14+
+# TYPE cnpg_collector_wal_write_time gauge
+cnpg_collector_wal_write_time{stats_reset="2023-06-19T10:51:27.473259Z"} 0
+
+# HELP cnpg_last_error 1 if the last collection ended with error, 0 otherwise.
+# TYPE cnpg_last_error gauge
+cnpg_last_error 0
+
+# HELP go_gc_duration_seconds A summary of the pause duration of garbage collection cycles.
+# TYPE go_gc_duration_seconds summary
+go_gc_duration_seconds{quantile="0"} 5.01e-05
+go_gc_duration_seconds{quantile="0.25"} 7.27e-05
+go_gc_duration_seconds{quantile="0.5"} 0.0001748
+go_gc_duration_seconds{quantile="0.75"} 0.0002959
+go_gc_duration_seconds{quantile="1"} 0.0012776
+go_gc_duration_seconds_sum 0.0035741
+go_gc_duration_seconds_count 13
+
+# HELP go_goroutines Number of goroutines that currently exist.
+# TYPE go_goroutines gauge
+go_goroutines 25
+
+# HELP go_info Information about the Go environment.
+# TYPE go_info gauge
+go_info{version="go1.20.5"} 1
+
+# HELP go_memstats_alloc_bytes Number of bytes allocated and still in use.
+# TYPE go_memstats_alloc_bytes gauge
+go_memstats_alloc_bytes 4.493744e+06
+
+# HELP go_memstats_alloc_bytes_total Total number of bytes allocated, even if freed.
+# TYPE go_memstats_alloc_bytes_total counter
+go_memstats_alloc_bytes_total 2.1698216e+07
+
+# HELP go_memstats_buck_hash_sys_bytes Number of bytes used by the profiling bucket hash table.
+# TYPE go_memstats_buck_hash_sys_bytes gauge
+go_memstats_buck_hash_sys_bytes 1.456234e+06
+
+# HELP go_memstats_frees_total Total number of frees.
+# TYPE go_memstats_frees_total counter
+go_memstats_frees_total 172118
+
+# HELP go_memstats_gc_cpu_fraction The fraction of this program's available CPU time used by the GC since the program started.
+# TYPE go_memstats_gc_cpu_fraction gauge
+go_memstats_gc_cpu_fraction 1.0749468700447189e-05
+
+# HELP go_memstats_gc_sys_bytes Number of bytes used for garbage collection system metadata.
+# TYPE go_memstats_gc_sys_bytes gauge
+go_memstats_gc_sys_bytes 5.530048e+06
+
+# HELP go_memstats_heap_alloc_bytes Number of heap bytes allocated and still in use.
+# TYPE go_memstats_heap_alloc_bytes gauge
+go_memstats_heap_alloc_bytes 4.493744e+06
+
+# HELP go_memstats_heap_idle_bytes Number of heap bytes waiting to be used.
+# TYPE go_memstats_heap_idle_bytes gauge
+go_memstats_heap_idle_bytes 5.8236928e+07
+
+# HELP go_memstats_heap_inuse_bytes Number of heap bytes that are in use.
+# TYPE go_memstats_heap_inuse_bytes gauge
+go_memstats_heap_inuse_bytes 7.528448e+06
+
+# HELP go_memstats_heap_objects Number of allocated objects.
+# TYPE go_memstats_heap_objects gauge
+go_memstats_heap_objects 26306
+
+# HELP go_memstats_heap_released_bytes Number of heap bytes released to OS.
+# TYPE go_memstats_heap_released_bytes gauge
+go_memstats_heap_released_bytes 5.7401344e+07
+
+# HELP go_memstats_heap_sys_bytes Number of heap bytes obtained from system.
+# TYPE go_memstats_heap_sys_bytes gauge
+go_memstats_heap_sys_bytes 6.5765376e+07
+
+# HELP go_memstats_last_gc_time_seconds Number of seconds since 1970 of last garbage collection.
+# TYPE go_memstats_last_gc_time_seconds gauge
+go_memstats_last_gc_time_seconds 1.6311727586032727e+09
+
+# HELP go_memstats_lookups_total Total number of pointer lookups.
+# TYPE go_memstats_lookups_total counter
+go_memstats_lookups_total 0
+
+# HELP go_memstats_mallocs_total Total number of mallocs.
+# TYPE go_memstats_mallocs_total counter
+go_memstats_mallocs_total 198424
+
+# HELP go_memstats_mcache_inuse_bytes Number of bytes in use by mcache structures.
+# TYPE go_memstats_mcache_inuse_bytes gauge
+go_memstats_mcache_inuse_bytes 14400
+
+# HELP go_memstats_mcache_sys_bytes Number of bytes used for mcache structures obtained from system.
+# TYPE go_memstats_mcache_sys_bytes gauge
+go_memstats_mcache_sys_bytes 16384
+
+# HELP go_memstats_mspan_inuse_bytes Number of bytes in use by mspan structures.
+# TYPE go_memstats_mspan_inuse_bytes gauge
+go_memstats_mspan_inuse_bytes 191896
+
+# HELP go_memstats_mspan_sys_bytes Number of bytes used for mspan structures obtained from system.
+# TYPE go_memstats_mspan_sys_bytes gauge
+go_memstats_mspan_sys_bytes 212992
+
+# HELP go_memstats_next_gc_bytes Number of heap bytes when next garbage collection will take place.
+# TYPE go_memstats_next_gc_bytes gauge
+go_memstats_next_gc_bytes 8.689632e+06
+
+# HELP go_memstats_other_sys_bytes Number of bytes used for other system allocations.
+# TYPE go_memstats_other_sys_bytes gauge
+go_memstats_other_sys_bytes 2.566622e+06
+
+# HELP go_memstats_stack_inuse_bytes Number of bytes in use by the stack allocator.
+# TYPE go_memstats_stack_inuse_bytes gauge
+go_memstats_stack_inuse_bytes 1.343488e+06
+
+# HELP go_memstats_stack_sys_bytes Number of bytes obtained from system for stack allocator.
+# TYPE go_memstats_stack_sys_bytes gauge
+go_memstats_stack_sys_bytes 1.343488e+06
+
+# HELP go_memstats_sys_bytes Number of bytes obtained from system.
+# TYPE go_memstats_sys_bytes gauge
+go_memstats_sys_bytes 7.6891144e+07
+
+# HELP go_threads Number of OS threads created.
+# TYPE go_threads gauge
+go_threads 18
+
+Note
+cnpg_collector_postgres_version is a GaugeVec metric containing
+the Major.Minor version of PostgreSQL. The full semantic version
+Major.Minor.Patch can be found inside one of its label field
+named full.
Warning
+The metrics cnpg_collector_last_failed_backup_timestamp,
+cnpg_collector_last_available_backup_timestamp, and
+cnpg_collector_first_recoverability_point have been deprecated starting
+from version 1.26. These metrics will continue to function with native backup
+solutions such as in-core Barman Cloud (deprecated) and volume snapshots. Note
+that for these cases, cnpg_collector_first_recoverability_point and
+cnpg_collector_last_available_backup_timestamp will remain zero until the
+first backup is completed to the object store. This is separate from WAL
+archiving.
This feature is currently in beta state and the format is inspired by the +queries.yaml file (release 0.12) +of the PostgreSQL Prometheus Exporter.
+Custom metrics can be defined by users by referring to the created Configmap/Secret in a Cluster definition
+under the .spec.monitoring.customQueriesConfigMap or customQueriesSecret section as in the following example:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+ namespace: test
+spec:
+ instances: 3
+
+ storage:
+ size: 1Gi
+
+ monitoring:
+ customQueriesConfigMap:
+ - name: example-monitoring
+ key: custom-queries
+
+The customQueriesConfigMap/customQueriesSecret sections contain a list of
+ConfigMap/Secret references specifying the key in which the custom queries are defined.
+Take care that the referred resources have to be created in the same namespace as the Cluster resource.
Note
+If you want ConfigMaps and Secrets to be automatically reloaded by instances, you can
+add a label with key cnpg.io/reload to it, otherwise you will have to reload
+the instances using the kubectl cnpg reload subcommand.
Important
+When a user defined metric overwrites an already existing metric the instance manager prints a json warning log,
+containing the message:Query with the same name already found. Overwriting the existing one.
+and a key queryName containing the overwritten query name.
Here you can see an example of a ConfigMap containing a single custom query,
+referenced by the Cluster example above:
apiVersion: v1
+kind: ConfigMap
+metadata:
+ name: example-monitoring
+ namespace: test
+ labels:
+ cnpg.io/reload: ""
+data:
+ custom-queries: |
+ pg_replication:
+ query: "SELECT CASE WHEN NOT pg_is_in_recovery()
+ THEN 0
+ ELSE GREATEST (0,
+ EXTRACT(EPOCH FROM (now() - pg_last_xact_replay_timestamp())))
+ END AS lag,
+ pg_is_in_recovery() AS in_recovery,
+ EXISTS (TABLE pg_stat_wal_receiver) AS is_wal_receiver_up,
+ (SELECT count(*) FROM pg_stat_replication) AS streaming_replicas"
+
+ metrics:
+ - lag:
+ usage: "GAUGE"
+ description: "Replication lag behind primary in seconds"
+ - in_recovery:
+ usage: "GAUGE"
+ description: "Whether the instance is in recovery"
+ - is_wal_receiver_up:
+ usage: "GAUGE"
+ description: "Whether the instance wal_receiver is up"
+ - streaming_replicas:
+ usage: "GAUGE"
+ description: "Number of streaming replicas connected to the instance"
+
+A list of basic monitoring queries can be found in the
+default-monitoring.yaml file
+that is already installed in your CloudNativePG deployment (see "Default set of metrics").
The predicate_query option allows the user to execute the query to collect the metrics only under the specified conditions.
+To do so the user needs to provide a predicate query that returns at most one row with a single boolean column.
The predicate query is executed in the same transaction as the main query and against the same databases.
+some_query: |
+ predicate_query: |
+ SELECT
+ some_bool as predicate
+ FROM some_table
+ query: |
+ SELECT
+ count(*) as rows
+ FROM some_table
+ metrics:
+ - rows:
+ usage: "GAUGE"
+ description: "number of rows"
+
+If the target_databases option lists more than one database
+the metric is collected from each of them.
Database auto-discovery can be enabled for a specific query by specifying a
+shell-like pattern (i.e., containing *, ? or []) in the list of
+target_databases. If provided, the operator will expand the list of target
+databases by adding all the databases returned by the execution of SELECT
+datname FROM pg_database WHERE datallowconn AND NOT datistemplate and matching
+the pattern according to path.Match() rules.
Note
+The * character has a special meaning in yaml,
+so you need to quote ("*") the target_databases value when it includes such a pattern.
It is recommended that you always include the name of the database
+in the returned labels, for example using the current_database() function
+as in the following example:
some_query: |
+ query: |
+ SELECT
+ current_database() as datname,
+ count(*) as rows
+ FROM some_table
+ metrics:
+ - datname:
+ usage: "LABEL"
+ description: "Name of current database"
+ - rows:
+ usage: "GAUGE"
+ description: "number of rows"
+ target_databases:
+ - albert
+ - bb
+ - freddie
+
+This will produce in the following metric being exposed:
+cnpg_some_query_rows{datname="albert"} 2
+cnpg_some_query_rows{datname="bb"} 5
+cnpg_some_query_rows{datname="freddie"} 10
+
+Here is an example of a query with auto-discovery enabled which also
+runs on the template1 database (otherwise not returned by the
+aforementioned query):
some_query: |
+ query: |
+ SELECT
+ current_database() as datname,
+ count(*) as rows
+ FROM some_table
+ metrics:
+ - datname:
+ usage: "LABEL"
+ description: "Name of current database"
+ - rows:
+ usage: "GAUGE"
+ description: "number of rows"
+ target_databases:
+ - "*"
+ - "template1"
+
+The above example will produce the following metrics (provided the databases exist):
+cnpg_some_query_rows{datname="albert"} 2
+cnpg_some_query_rows{datname="bb"} 5
+cnpg_some_query_rows{datname="freddie"} 10
+cnpg_some_query_rows{datname="template1"} 7
+cnpg_some_query_rows{datname="postgres"} 42
+
+Every custom query has the following basic structure:
+<MetricName>:
+ query: "<SQLQuery>"
+ metrics:
+ - <ColumnName>:
+ usage: "<MetricType>"
+ description: "<MetricDescription>"
+
+Here is a short description of all the available fields:
+<MetricName>: the name of the Prometheus metricname: override <MetricName>, if definedquery: the SQL query to run on the target database to generate the metricsprimary: whether to run the query only on the primary instancemaster: same as primary (for compatibility with the Prometheus PostgreSQL exporter's syntax - deprecated) runonserver: a semantic version range to limit the versions of PostgreSQL the query should run on
+ (e.g. ">=11.0.0" or ">=12.0.0 <=15.0.0")target_databases: a list of databases to run the query against,
+ or a shell-like pattern
+ to enable auto discovery. Overwrites the default database if provided.predicate_query: a SQL query that returns at most one row and one boolean column to run on the target database.
+ The system evaluates the predicate and if true executes the query. metrics: section containing a list of all exported columns, defined as follows:<ColumnName>: the name of the column returned by the queryname: override the ColumnName of the column in the metric, if definedusage: one of the values described belowdescription: the metric's descriptionmetrics_mapping: the optional column mapping when usage is set to MAPPEDMETRICThe possible values for usage are:
| Column Usage Label | +Description | +
|---|---|
DISCARD |
+this column should be ignored | +
LABEL |
+use this column as a label | +
COUNTER |
+use this column as a counter | +
GAUGE |
+use this column as a gauge | +
MAPPEDMETRIC |
+use this column with the supplied mapping of text values | +
DURATION |
+use this column as a text duration (in milliseconds) | +
HISTOGRAM |
+use this column as a histogram | +
Please visit the "Metric Types" page +from the Prometheus documentation for more information.
+Custom defined metrics are returned by the Prometheus exporter endpoint (:9187/metrics)
+with the following format:
cnpg_<MetricName>_<ColumnName>{<LabelColumnName>=<LabelColumnValue> ... } <ColumnValue>
+
+Note
+LabelColumnName are metrics with usage set to LABEL and their Value
Considering the pg_replication example above, the exporter's endpoint would
+return the following output when invoked:
# HELP cnpg_pg_replication_in_recovery Whether the instance is in recovery
+# TYPE cnpg_pg_replication_in_recovery gauge
+cnpg_pg_replication_in_recovery 0
+# HELP cnpg_pg_replication_lag Replication lag behind primary in seconds
+# TYPE cnpg_pg_replication_lag gauge
+cnpg_pg_replication_lag 0
+# HELP cnpg_pg_replication_streaming_replicas Number of streaming replicas connected to the instance
+# TYPE cnpg_pg_replication_streaming_replicas gauge
+cnpg_pg_replication_streaming_replicas 2
+# HELP cnpg_pg_replication_is_wal_receiver_up Whether the instance wal_receiver is up
+# TYPE cnpg_pg_replication_is_wal_receiver_up gauge
+cnpg_pg_replication_is_wal_receiver_up 0
+
+The operator can be configured to automatically inject in a Cluster a set of
+monitoring queries defined in a ConfigMap or a Secret, inside the operator's namespace.
+You have to set the MONITORING_QUERIES_CONFIGMAP or
+MONITORING_QUERIES_SECRET key in the "operator configuration",
+respectively to the name of the ConfigMap or the Secret;
+the operator will then use the content of the queries key.
Any change to the queries content will be immediately reflected on all the
+deployed Clusters using it.
The operator installation manifests come with a predefined ConfigMap,
+called cnpg-default-monitoring, to be used by all Clusters.
+MONITORING_QUERIES_CONFIGMAP is by default set to cnpg-default-monitoring in the operator configuration.
If you want to disable the default set of metrics, you can:
+MONITORING_QUERIES_CONFIGMAP/MONITORING_QUERIES_SECRET key to ""
+ (empty string), in the operator ConfigMap. Changes to operator ConfigMap require an operator restart..spec.monitoring.disableDefaultQueries to true in the Cluster.Important
+The ConfigMap or Secret specified via MONITORING_QUERIES_CONFIGMAP/MONITORING_QUERIES_SECRET
+will always be copied to the Cluster's namespace with a fixed name: cnpg-default-monitoring.
+So that, if you intend to have default metrics, you should not create a ConfigMap with this name in the cluster's namespace.
CloudNativePG is inspired by the PostgreSQL Prometheus Exporter, but
+presents some differences. In particular, the cache_seconds field is not implemented
+in CloudNativePG's exporter.
The operator internally exposes Prometheus metrics
+via HTTP on port 8080, named metrics.
Info
+You can inspect the exported metrics by following the instructions in +the "How to inspect the exported metrics" +section below.
+Currently, the operator exposes default kubebuilder metrics. See
+kubebuilder documentation
+for more details.
The operator can be monitored using the +Prometheus Operator by defining a +PodMonitor +pointing to the operator pod(s), as follows (note it's applied in the same +namespace as the operator):
+kubectl -n cnpg-system apply -f - <<EOF
+---
+apiVersion: monitoring.coreos.com/v1
+kind: PodMonitor
+metadata:
+ name: cnpg-controller-manager
+spec:
+ selector:
+ matchLabels:
+ app.kubernetes.io/name: cloudnative-pg
+ podMetricsEndpoints:
+ - port: metrics
+EOF
+
+By default, the operator exposes its metrics over HTTP on port 8080.
+To secure this endpoint with TLS, follow these steps:
tls.crt) and
+ private key (tls.key).METRICS_CERT_DIR environment variable to point to the directory
+ where the certificates are mounted.Example Secret definition:
apiVersion: v1
+kind: Secret
+metadata:
+ name: cnpg-metrics-cert
+ namespace: cnpg-system
+type: kubernetes.io/tls
+data:
+ tls.crt: <base64-encoded-certificate>
+ tls.key: <base64-encoded-key>
+
+Next, update the operator deployment to mount the secret and configure the +environment variable:
+spec:
+ template:
+ spec:
+ containers:
+ - name: manager
+ env:
+ - name: METRICS_CERT_DIR
+ value: /run/secrets/cnpg.io/metrics
+ volumeMounts:
+ - mountPath: /run/secrets/cnpg.io/metrics
+ name: metrics-certificates
+ readOnly: true
+ volumes:
+ - name: metrics-certificates
+ secret:
+ secretName: cnpg-metrics-cert
+ defaultMode: 420
+
+Note
+When METRICS_CERT_DIR is set, the operator automatically enables TLS for
+the metrics server. You must also update your PodMonitor configuration to
+use the https scheme.
Example PodMonitor configuration with TLS enabled:
apiVersion: monitoring.coreos.com/v1
+kind: PodMonitor
+metadata:
+ name: cnpg-controller-manager
+ namespace: cnpg-system
+spec:
+ selector:
+ matchLabels:
+ app.kubernetes.io/name: cloudnative-pg
+ podMetricsEndpoints:
+ - port: metrics
+ scheme: https
+ tlsConfig:
+ insecureSkipVerify: true # or configure proper CA validation
+
+In this section we provide basic instructions on how to inspect +the metrics exported by a specific PostgreSQL instance manager (primary +or replica) or the operator.
+Note
+In the examples below we assume we are working in the default namespace, and
+with the operator installed in the cnpg-system namespace.
+Please adapt to your use case.
The simplest way to inspect the metrics is to port-forward the metrics ports +of the pods involved.
+For example, to inspect the metrics on the -1 instance of cluster-example,
+we port-forward the 9187 port:
kubectl port-forward cluster-example-1 9187:9187
+
+With port-forwarding active, the metrics can be inspected easily, for
+example on a web browser, using HTTP or HTTPS depending on the configuration,
+with address: localhost:9187/metrics.
The operator pod also exports metrics, on port 8080. Similarly to instances, we +port-forward the operator pod, which is located in the operator namespace:
+kubectl -n cnpg-system port-forward pod/<CONTROLLER-MANAGER-POD> 8080:8080
+
+With port forwarding active, the metrics are easily viewable on a browser at
+localhost:8080/metrics.
Create the curl pod with the following command:
kubectl apply -f - <<EOF
+---
+apiVersion: v1
+kind: Pod
+metadata:
+ name: curl
+spec:
+ containers:
+ - name: curl
+ image: curlimages/curl:8.16.0
+ command: ['sleep', '3600']
+EOF
+
+To inspect the metrics exported by an instance, you need
+to connect to port 9187 of the target pod. You will need to know the pod's
+IP address, which you can find easily by running kubectl get pod -o wide.
+The following generic command will run curl on the desired pod:
kubectl exec -ti curl -- curl -s <pod_ip>:9187/metrics
+
+For example, if your PostgreSQL cluster is called cluster-example and
+you want to retrieve the exported metrics of the first pod in the cluster,
+you can run the following command to programmatically get the IP of
+that pod:
POD_IP=$(kubectl get pod cluster-example-1 --template '{{.status.podIP}}')
+
+And then run:
+kubectl exec -ti curl -- curl -s ${POD_IP}:9187/metrics
+
+If you enabled TLS metrics, run instead:
+kubectl exec -ti curl -- curl -sk https://${POD_IP}:9187/metrics
+
+To access the metrics of the operator, you need to point +to the pod where the operator is running, and use TCP port 8080 as target.
+When you're done inspecting metrics, please remember to delete the curl pod:
kubectl delete -f curl.yaml
+
+Important
+These resources are provided for illustration and experimentation, and do +not represent any kind of recommendation for your production system
+In the doc/src/samples/monitoring/
+directory you will find a series of sample files for observability.
+Please refer to Part 4 of the quickstart
+section for context:
kube-stack-config.yaml: a configuration file for the kube-stack helm chart
+ installation. It ensures that Prometheus listens for all PodMonitor resources.prometheusrule.yaml: a PrometheusRule with alerts for CloudNativePG.
+ NOTE: this does not include inter-operation with notification services. Please refer
+ to the Prometheus documentation.podmonitor.yaml: a PodMonitor for the CloudNativePG Operator deployment.In addition, we provide the "raw" sources for the Prometheus alert rules in the
+alerts.yaml file.
A Grafana dashboard for CloudNativePG clusters and operator, is kept in the
+dedicated repository cloudnative-pg/grafana-dashboards
+as a dashboard JSON configuration:
+grafana-dashboard.json.
+The file can be downloaded, and imported into Grafana
+(menus: Dashboard > New > Import).
For a general reference on the settings available on kube-prometheus-stack,
+you can execute helm show values prometheus-community/kube-prometheus-stack.
+Please refer to the
+kube-prometheus-stack
+page for more detail.
CloudNativePG assumes the underlying Kubernetes cluster has the required +connectivity already set up. +Networking on Kubernetes is an important and extended topic; please refer to +the Kubernetes documentation for further information.
+If you're following the quickstart guide to install CloudNativePG on a local KinD or K3d cluster, you should not encounter any networking issues as neither +platform will add any networking restrictions by default.
+However, when deploying CloudNativePG on existing infrastructure, networking +restrictions might be in place that could impair the communication of the +operator with PostgreSQL clusters. +Specifically, existing Network Policies +might restrict certain types of traffic.
+Or, you might be interested in adding network policies in your environment for +increased security. +As mentioned in the security document, please ensure the operator can reach every cluster pod on ports 8000 and 5432, and that pods can connect to each other.
+Following the quickstart guide or using helm chart for deployment will install the operator in
+a dedicated namespace (cnpg-system by default).
+We recommend that you create clusters in a different namespace.
The operator must be able to connect to cluster pods.
+This might be precluded if there is a NetworkPolicy restricting
+cross-namespace traffic.
For example, the +kubernetes guide on network policies +contains an example policy denying all ingress traffic by default.
+If your local kubernetes setup has this kind of restrictive network policy, you
+will need to create a NetworkPolicy to explicitly allow connection from the
+operator namespace and pod to the cluster namespace and pods. You can find an example in the
+networkpolicy-example.yaml file in this repository.
+Please note, you'll need to adjust the cluster name and cluster namespace to
+match your specific setup, and also the operator namespace if it is not
+the default namespace.
While bootstrapping from another cluster or when using the externalClusters section,
+ensure connectivity among all clusters, object stores, and namespaces involved.
Again, we refer you to the Kubernetes documentation +for setup information.
+ +These capabilities were implemented by CloudNativePG, +classified using the +Operator SDK definition of Capability Levels +framework.
+
Important
+Based on the Operator Capability Levels model, +you can expect a "Level V - Auto Pilot" set of capabilities from the +CloudNativePG operator.
+Each capability level is associated with a certain set of management features the operator offers:
+Note
+We consider this framework as a guide for future work and implementations in the operator.
+Capability level 1 involves installing and configuring the +operator. This category includes usability and user experience +enhancements, such as improvements in how you interact with the +operator and a PostgreSQL cluster configuration.
+Important
+We consider information security part of this level.
+The operator is installed in a declarative way using a Kubernetes manifest
+that defines four major CustomResourceDefinition objects: Cluster, Pooler,
+Backup, and ScheduledBackup.
You define a PostgreSQL cluster (operand) using the Cluster custom resource
+in a fully declarative way. The PostgreSQL version is determined by the
+operand container image defined in the CR, which is automatically fetched
+from the requested registry. When deploying an operand, the operator also
+creates the following resources: Pod, Service, Secret,
+ConfigMap,PersistentVolumeClaim, PodDisruptionBudget, ServiceAccount,
+RoleBinding, and Role.
The operator is designed to support any operand container image with
+PostgreSQL inside.
+By default, the operator uses the latest available minor
+version of the latest stable major version supported by the PostgreSQL
+community and published on ghcr.io.
+You can use any compatible image of PostgreSQL supporting the
+primary/standby architecture directly by setting the imageName
+attribute in the CR. The operator also supports imagePullSecrets
+to access private container registries, and it supports digests and
+tags for finer control of container image immutability.
+If you prefer not to specify an image name, you can leverage
+image catalogs by simply referencing the PostgreSQL
+major version. Moreover, image catalogs enable you to effortlessly create
+custom catalogs, directing to images based on your specific requirements.
You can configure the operator to support inheriting labels and annotations +that are defined in a cluster's metadata. The goal is to improve the organization +of the CloudNativePG deployment in your Kubernetes infrastructure.
+Instead of relying on an external tool to
+coordinate PostgreSQL instances in the Kubernetes cluster pods,
+such as Patroni or Stolon, the operator
+injects the operator executable inside each pod, in a file named
+/controller/manager. The application is used to control the underlying
+PostgreSQL instance and to reconcile the pod status with the instance
+based on the PostgreSQL cluster topology. The instance manager also starts a
+web server that's invoked by the kubelet for probes. Unix signals invoked
+by the kubelet are filtered by the instance manager. Where appropriate, they're
+forwarded to the postgres process for fast and controlled reactions to
+external events. The instance manager is written in Go and has no external
+dependencies.
Storage is a critical component in a database workload. Taking advantage of the
+Kubernetes native capabilities and resources in terms of storage, the
+operator gives you enough flexibility to choose the right storage for your
+workload requirements, based on what the underlying Kubernetes environment
+can offer. This implies choosing a particular storage class in
+a public cloud environment or fine-tuning the generated PVC through a
+PVC template in the CR's storage parameter.
For better performance and finer control, you can also choose to host your
+cluster's write-ahead log (WAL, also known as pg_wal) on a separate volume,
+preferably on different storage.
+The "Benchmarking" section of the documentation provides
+detailed instructions on benchmarking both storage and the database before
+production. It relies on the cnpg plugin to ensure optimal performance and
+reliability.
The operator detects replicas in a cluster
+through a single parameter, called instances. If set to 1, the cluster
+comprises a single primary PostgreSQL instance with no replica. If higher
+than 1, the operator manages instances -1 replicas, including high
+availability (HA) through automated failover and rolling updates through
+switchover operations.
CloudNativePG manages replication slots for all replicas in the
+high-availability cluster. It also supports user-defined physical replication
+slots on the primary and enables logical decoding failover—natively for
+PostgreSQL 17 and later using sync_replication_slots, and through the
+pg_failover_slots extension for earlier versions.
By default, CloudNativePG creates three Kubernetes services +for applications to access the cluster via the network:
+You can disable the read-only and read services via configuration. +Additionally, you can leverage the service template capability +to create custom service resources, including load balancers, to access +PostgreSQL outside Kubernetes. This is particularly useful for DBaaS purposes.
+The operator is designed to bootstrap a PostgreSQL cluster with a single
+database. The operator transparently manages network access to the cluster
+through three Kubernetes services provisioned and managed for read-write,
+read, and read-only workloads.
+Using the convention-over-configuration approach, the operator creates a
+database called app, by default owned by a regular Postgres user with the
+same name. You can specify both the database name and the user name, if
+required, as part of the bootstrap.
Additional databases can be created or managed via
+declarative database management using
+the Database CRD, also supporting extensions, schemas, foreign data wrappers
+(FDW), and foreign servers.
Although no configuration is required to run the cluster, you can customize
+both PostgreSQL runtime configuration and PostgreSQL host-based
+authentication rules in the postgresql section of the CR.
CloudNativePG supports
+management of PostgreSQL roles, users, and groups through declarative configuration
+using the .spec.managed.roles stanza.
For InfoSec requirements, the operator doesn't require privileged mode for +any container. It enforces a read-only root filesystem to guarantee containers +immutability for both the operator and the operand pods. It also explicitly +sets the required security contexts.
+The cluster's affinity section enables fine-tuning of how pods and related
+resources, such as persistent volumes, are scheduled across the nodes of a
+Kubernetes cluster. In particular, the operator supports:
The cluster's topologySpreadConstraints section enables additional control of
+scheduling pods across topologies, enhancing what affinity and
+anti-affinity can offer.
CloudNativePG doesn't have its own command-line interface.
+It relies on the best command-line interface for Kubernetes, kubectl,
+by providing a plugin called cnpg. This plugin enhances and simplifies your PostgreSQL
+cluster management experience.
The operator continuously updates the status section of the CR with the
+observed status of the cluster. The entire PostgreSQL cluster status is
+continuously monitored by the instance manager running in each pod. The
+instance manager is responsible for applying the required changes to the
+controlled PostgreSQL instance to converge to the required status of
+the cluster. (For example, if the cluster status reports that pod -1 is the
+primary, pod -1 needs to promote itself while the other pods need to follow
+pod -1.) The same status is used by the cnpg plugin for kubectl to provide
+details.
The operator creates a certification authority for itself. +It creates and signs with the operator certification authority a leaf certificate +for the webhook server to use. This certificate ensures safe communication between the +Kubernetes API server and the operator.
+The operator creates a certification authority for every PostgreSQL
+cluster. This certification authority is used to issue and renew TLS certificates for clients' authentication,
+including streaming replication standby servers (instead of passwords).
+Support for a custom certification authority for client certificates is
+available through secrets, which also includes integration with cert-manager.
+Certificates can be issued with the cnpg plugin for kubectl.
The operator transparently and natively supports TLS/SSL connections +to encrypt client/server communications for increased security using the +cluster's certification authority. +Support for custom server certificates is available through secrets, which also +includes integration with cert-manager.
+To authorize streaming replication connections from the standby servers, +the operator relies on TLS client certificate authentication. This method is used +instead of relying on a password (and therefore a secret).
+The operator enables you to apply changes to the Cluster resource YAML
+section of the PostgreSQL configuration. Depending on the configuration option,
+it also makes sure that all instances are properly reloaded or restarted.
Note
+Changes with ALTER SYSTEM aren't detected, meaning
+that the cluster state isn't enforced.
The operator provides a declarative way to import existing
+Postgres databases in a new CloudNativePG cluster in Kubernetes, using
+offline migrations.
+The same feature also covers offline major upgrades of PostgreSQL databases.
+Offline means that applications must stop their write operations at the source
+until the database is imported.
+The feature extends the initdb bootstrap method to create a new PostgreSQL
+cluster using a logical snapshot of the data available in another PostgreSQL
+database. This data can be accessed by way of the network through a superuser
+connection. Import is from any supported version of Postgres. It relies on
+pg_dump and pg_restore being executed from the new cluster primary
+for all databases that are part of the operation and, if requested, for roles.
CloudNativePG supports the installation of clusters with the PostGIS +open source extension for geographical databases. This extension is one of the most popular +extensions for PostgreSQL.
+The operator allows you to configure LDAP authentication for your PostgreSQL +clients, using either the simple bind or search+bind mode, as described in +the LDAP authentication section of the PostgreSQL documentation.
+The operator can be installed through a Kubernetes manifest by way of kubectl
+apply, to be used in a traditional Kubernetes installation in public
+and private cloud environments. CloudNativePG also supports
+installation by way of a Helm chart or OLM bundle from OperatorHub.io.
The operator supports the convention-over-configuration paradigm, deciding
+standard default values while allowing you to override them and customize
+them. You can specify a deployment of a PostgreSQL cluster using
+the Cluster CRD in a couple of lines of YAML code.
Capability level 2 is about enabling updates of the operator and the actual +workload, in this case PostgreSQL servers. This includes PostgreSQL minor +release updates (security and bug fixes normally) as well as major online +upgrades.
+Upgrading the operator is seamless and can be done as a new deployment. After +upgrading the controller, a rolling update of all deployed PostgreSQL clusters +is initiated. You can choose to update all clusters simultaneously or +distribute their upgrades over time.
+Thanks to the instance manager's injection, upgrading the operator does not +require changes to the operand, allowing the operator to manage older versions +of it.
+Additionally, CloudNativePG supports in-place updates of the instance manager +following an operator upgrade. In-place updates do not require a rolling update +or a subsequent switchover of the cluster.
+The operand can be upgraded using a declarative configuration approach as
+part of changing the CR and, in particular, the imageName parameter.
+This is normally initiated by security updates or Postgres minor version updates.
+In the presence of standby servers, the operator performs rolling updates
+starting from the replicas. It does this by dropping the existing pod and creating a new
+one with the new requested operand image that reuses the underlying storage.
+Depending on the value of the primaryUpdateStrategy, the operator proceeds
+with a switchover before updating the former primary (unsupervised). Or, it waits
+for the user to manually issue the switchover procedure (supervised) by way of the
+cnpg plugin for kubectl.
+The setting to use depends on the business requirements, as the operation
+might generate some downtime for the applications. This downtime can range from a few seconds to
+minutes, based on the actual database workload.
CloudNativePG supports declarative offline in-place major upgrades when a new
+operand container image with a higher PostgreSQL major version is applied to a
+cluster. The upgrade can be triggered by updating the image tag via the
+.spec.imageName option or by using an image catalog to manage version
+changes. During the upgrade, all cluster pods are shut down to ensure data
+consistency. A new job is then created to validate the upgrade conditions,
+execute pg_upgrade, and create new directories for PGDATA, WAL files, and
+tablespaces if needed. Once the upgrade is complete, replicas are re-created.
+Failed upgrades can be rolled back.
At any time, convey the cluster's high availability status, for example,
+Setting up primary, Creating a new replica, Cluster in healthy state,
+Switchover in progress, Failing over, Upgrading cluster, and Upgrading
+Postgres major version.
Capability level 3 requires the operator to manage aspects of business +continuity and scalability.
+Disaster recovery is a business continuity component that requires +that both backup and recovery of a database work correctly. While as a +starting point, the goal is to achieve RPO < 5 +minutes, the long-term goal is to implement RPO=0 backup solutions. High +availability is the other important component of business continuity. Through +PostgreSQL native physical replication and hot standby replicas, it allows the +operator to perform failover and switchover operations. This area includes +enhancements in:
+The operator supports PostgreSQL continuous archiving of WAL files +to an object store (AWS S3 and S3-compatible, Azure Blob Storage, Google Cloud +Storage, and gateways like MinIO).
+WAL archiving is defined at the cluster level, declaratively, through the
+backup parameter in the cluster definition. This is done by specifying an S3 protocol
+destination URL (for example, to point to a specific folder in an AWS S3
+bucket) and, optionally, a generic endpoint URL.
WAL archiving, a prerequisite for continuous backup, doesn't require any further
+user action. The operator transparently sets
+the archive_command to rely on barman-cloud-wal-archive to ship WAL
+files to the defined endpoint. You can decide the compression algorithm,
+as well as the number of parallel jobs to concurrently upload WAL files
+in the archive. In addition, Instance Manager checks
+the correctness of the archive destination by performing the barman-cloud-check-wal-archive
+command before beginning to ship the first set of WAL files.
CloudNativePG provides a pluggable interface (CNPG-I) for managing +application-level backups using PostgreSQL’s native physical backup +mechanisms—namely base backups and continuous WAL archiving. This +design enables flexibility and extensibility while ensuring consistency and +performance.
+The CloudNativePG Community officially supports the Barman Cloud Plugin, +which enables continuous physical backups to object stores, along with full and +Point-In-Time Recovery (PITR) capabilities.
+In addition to CNPG-I plugins, CloudNativePG also natively supports backups +using Kubernetes volume snapshots, when supported by the underlying storage +class and CSI driver.
+You can initiate base backups in two ways:
+Backup custom resourceScheduledBackup custom resource, with a cron-like
+ schedule formatVolume snapshots leverage the Kubernetes API and are particularly effective for +very large databases (VLDBs) due to their speed and storage efficiency.
+Both volume snapshots and CNPG-I-based backups support:
+The operator supports offloading base backups onto a standby without impacting +the RPO of the database. This allows resources to +be preserved on the primary, in particular I/O, for standard database +operations.
+The operator enables you to bootstrap a new cluster (with its settings) +starting from an existing and accessible backup, either on a volume snapshot, +or in an object store, or via a plugin.
+Once the bootstrap process is completed, the operator initiates the instance in +recovery mode. It replays all available WAL files from the specified archive, +exiting recovery and starting as a primary. +Subsequently, the operator clones the requested number of standby instances +from the primary. +CloudNativePG supports parallel WAL fetching from the archive.
+The operator enables you to create a new PostgreSQL cluster by recovering +an existing backup to a specific point in time, defined with a timestamp, a +label, or a transaction ID. This capability is built on top of the full restore +one and supports all the options available in +PostgreSQL for PITR.
+Achieve zero data loss (RPO=0) in your local high-availability CloudNativePG
+cluster with support for both quorum-based and priority-based synchronous
+replication. The operator offers a flexible way to define the number of
+expected synchronous standby replicas available at any time, and allows
+customization of the synchronous_standby_names option as needed.
Establish a robust cross-Kubernetes cluster topology for PostgreSQL clusters,
+harnessing the power of native streaming and cascading replication. With the
+replica option, you can configure an autonomous cluster to consistently
+replicate data from another PostgreSQL source of the same major version. This
+source can be located anywhere, provided you have access to a WAL archive for
+fetching WAL files or a direct streaming connection via TLS between the two
+endpoints.
Notably, the source PostgreSQL instance can exist outside the Kubernetes +environment, whether in a physical or virtual setting.
+Replica clusters can be instantiated through various methods, including volume
+snapshots, a recovery object store (using the Barman Cloud backup format),
+or streaming using pg_basebackup. Both WAL file shipping and WAL streaming
+are supported. The deployment of replica clusters significantly elevates the
+business continuity posture of PostgreSQL databases within Kubernetes,
+extending across multiple data centers and facilitating hybrid and multi-cloud
+setups. (While anticipating Kubernetes federation native capabilities, manual
+switchover across data centers remains necessary.)
Additionally, the flexibility extends to creating delayed replica clusters
+intentionally lagging behind the primary cluster. This intentional lag aims to
+minimize the Recovery Time Objective (RTO) in the
+event of unintended errors, such as incorrect DELETE or UPDATE SQL operations.
Leverage replica clusters to +define distributed database topologies +for PostgreSQL that span across various Kubernetes clusters, facilitating hybrid +and multi-cloud deployments. With CloudNativePG, you gain powerful capabilities, +including:
+This setup can efficiently operate across two or more regions, can rely entirely +on object stores for replication, and guarantees a maximum RPO (Recovery Point +Objective) of 5 minutes. This advanced feature is uniquely provided by +CloudNativePG, ensuring robust data integrity and continuity across diverse +environments.
+CloudNativePG seamlessly integrates robust support for PostgreSQL tablespaces +by facilitating the declarative definition of individual persistent volumes. +This innovative feature empowers you to efficiently distribute I/O operations +across a diverse array of storage devices. Through the transparent +orchestration of tablespaces, CloudNativePG enhances the performance and +scalability of PostgreSQL databases, ensuring a streamlined and optimized +experience for managing large scale data storage in cloud-native environments. +Support for temporary tablespaces is also included.
+CloudNativePG configures startup, liveness, and readiness probes for PostgreSQL
+containers, which are managed by the Kubernetes kubelet. These probes interact
+with the /startupz, /healthz, and /readyz endpoints exposed by
+the instance manager's web server to monitor the Pod's health and readiness.
All probes are configured with default settings but can be fully customized to +meet specific needs, allowing for fine-tuning to align with your environment +and workloads.
+For detailed configuration options and advanced usage, +refer to the Postgres instance manager documentation.
+The operator supports rolling deployments to minimize the downtime. If a +PostgreSQL cluster is exposed publicly, the service load-balances the +read-only traffic only to available pods during the initialization or the +update.
+The operator allows you to scale up and down the number of instances in a
+PostgreSQL cluster. New replicas are started up from the
+primary server and participate in the cluster's HA infrastructure.
+The CRD declares a "scale" subresource that allows you to use the
+kubectl scale command.
The operator creates a PodDisruptionBudget resource to limit the number of
+concurrent disruptions to one primary instance. This configuration prevents the
+maintenance operation from deleting all the pods in a cluster, allowing the
+specified number of instances to be created. The PodDisruptionBudget is
+applied during the node-draining operation, preventing any disruption of the
+cluster service.
While this strategy is correct for Kubernetes clusters where
+storage is shared among all the worker nodes, it might not be the best solution
+for clusters using local storage or for clusters installed in a private
+cloud. The operator allows you to specify a maintenance window and
+configure the reaction to any underlying node eviction. The ReusePVC option
+in the maintenance window section enables to specify the strategy to use.
+Allocate new storage in a different PVC for the evicted instance, or wait
+for the underlying node to be available again.
Fencing is the process of protecting the data in one, more, or even all +instances of a PostgreSQL cluster when they appear to be malfunctioning. +When an instance is fenced, the PostgreSQL server process is +guaranteed to be shut down, while the pod is kept running. This ensures +that, until the fence is lifted, data on the pod isn't modified by PostgreSQL +and that you can investigate file system for debugging and troubleshooting +purposes.
+CloudNativePG supports hibernation of a running PostgreSQL cluster
+in a declarative manner, through the cnpg.io/hibernation annotation.
+Hibernation enables saving CPU power by removing the database pods while
+keeping the database PVCs. This feature simulates scaling to 0 instances.
When the operator needs to create a pod that was deleted by the user or
+was evicted by a Kubernetes maintenance operation, it reuses the
+PersistentVolumeClaim, if available. This ability avoids the need
+to clone the data from the primary again.
The operator allows administrators to control and manage resource usage by
+the cluster's pods in the resources section of the manifest. In
+particular, you can set requests and limits values for both CPU and RAM.
CloudNativePG provides native support for connection pooling with +PgBouncer, one of the most popular open source +connection poolers for PostgreSQL. From an architectural point of view, the +native implementation of a PgBouncer connection pooler introduces a new layer +to access the database. This optimizes the query flow toward the instances +and makes the use of the underlying PostgreSQL resources more efficient. +Instead of connecting directly to a PostgreSQL service, applications can now +connect to the PgBouncer service and start reusing any existing connection.
+CloudNativePG supports PostgreSQL's logical replication in a declarative manner
+using Publication and Subscription custom resource definitions.
Logical replication is particularly useful together with the import facility +for online data migrations (even from public DBaaS solutions) and major +PostgreSQL upgrades.
+Capability level 4 is about observability: monitoring, +alerting, trending, and log processing. This might involve the use of external tools, +such as Prometheus, Grafana, and Fluent Bit, as well as extensions in the +PostgreSQL engine for the output of error logs directly in JSON format.
+CloudNativePG was designed to provide everything needed +to easily integrate with industry-standard and community-accepted tools for +flexible monitoring and logging.
+The instance manager provides a pluggable framework. By way of its own web server
+listening on the metrics port (9187), it exposes an endpoint to export metrics
+for the Prometheus monitoring and alerting tool.
+The operator supports custom monitoring queries defined as ConfigMap
+or Secret objects using a syntax that's compatible with
+postgres_exporter for Prometheus.
+CloudNativePG provides a set of basic monitoring queries for
+PostgreSQL that can be integrated and adapted to your context.
CloudNativePG comes with a Grafana dashboard that you can use as a base to +monitor all critical aspects of a PostgreSQL cluster, and customize.
+Every log message is delivered to standard output in JSON format. The first level is the
+definition of the timestamp, the log level, and the type of log entry, such as
+postgres for the canonical PostgreSQL error message channel.
+As a result, every pod managed by CloudNativePG can be easily and directly
+integrated with any downstream log processing stack that supports JSON as source
+data type.
CloudNativePG transparently and natively supports:
+pg_stat_statements extension,
+ which enables tracking of planning and execution statistics of all SQL
+ statements executed by a PostgreSQL serverauto_explain extension,
+ which provides a means for logging execution plans of slow statements
+ automatically, without having to manually run EXPLAIN (helpful for tracking
+ down un-optimized queries)pg_failover_slots extension,
+ which makes logical replication slots usable across a physical failover,
+ ensuring resilience in change data capture (CDC) contexts based on PostgreSQL's
+ native logical replicationCloudNativePG allows database and security administrators, auditors, +and operators to track and analyze database activities using PGAudit for +PostgreSQL. +Such activities flow directly in the JSON log and can be properly routed to the +correct downstream target using common log brokers like Fluentd.
+Record major events as expected by the Kubernetes API, such as creating resources,
+removing nodes, and upgrading. Events can be displayed by using
+the kubectl describe and kubectl get events commands.
Capability level 5 is focused on automated scaling, healing, and +tuning through the discovery of anomalies and insights that emerged +from the observability layer.
+In case of detected failure on the primary, the operator changes the
+status of the cluster by setting the most aligned replica as the new target
+primary. As a consequence, the instance manager in each alive pod
+initiates the required procedures to align itself with the requested status of
+the cluster. It does this by either becoming the new primary or by following it.
+In case the former primary comes back up, the same mechanism avoids a
+split-brain by preventing applications from reaching it, running pg_rewind on
+the server and restarting it as a standby.
If the pod hosting a standby is removed, the operator initiates +the procedure to re-create a standby server.
+ +The operator for CloudNativePG is installed from a standard +deployment manifest and follows the convention over configuration paradigm. +While this is fine in most cases, there are some scenarios where you want +to change the default behavior, such as:
+By default, the operator is installed in the cnpg-system
+namespace as a Kubernetes Deployment called cnpg-controller-manager.
Note
+In the examples below we assume the default name and namespace for the operator deployment.
+The behavior of the operator can be customized through a ConfigMap/Secret that
+is located in the same namespace of the operator deployment and with
+cnpg-controller-manager-config as the name.
Important
+Any change to the config's ConfigMap/Secret will not be automatically
+detected by the operator, - and as such, it needs to be reloaded (see below).
+Moreover, changes only apply to the resources created after the configuration
+is reloaded.
Important
+The operator first processes the ConfigMap values and then the Secret’s, in this order. +As a result, if a parameter is defined in both places, the one in the Secret will be used.
+The operator looks for the following environment variables to be defined in the ConfigMap/Secret:
| Name | +Description | +
|---|---|
CERTIFICATE_DURATION |
+Determines the lifetime of the generated certificates in days. Default is 90. | +
CLUSTERS_ROLLOUT_DELAY |
+The duration (in seconds) to wait between the roll-outs of different clusters during an operator upgrade. This setting controls the timing of upgrades across clusters, spreading them out to reduce system impact. The default value is 0 which means no delay between PostgreSQL cluster upgrades. |
+
CREATE_ANY_SERVICE |
+When set to true, will create -any service for the cluster. Default is false |
+
ENABLE_INSTANCE_MANAGER_INPLACE_UPDATES |
+When set to true, enables in-place updates of the instance manager after an update of the operator, avoiding rolling updates of the cluster (default false) |
+
EXPIRING_CHECK_THRESHOLD |
+Determines the threshold, in days, for identifying a certificate as expiring. Default is 7. | +
INCLUDE_PLUGINS |
+A comma-separated list of plugins to be always included in the Cluster's reconciliation. | +
INHERITED_ANNOTATIONS |
+List of annotation names that, when defined in a Cluster metadata, will be inherited by all the generated resources, including pods |
+
INHERITED_LABELS |
+List of label names that, when defined in a Cluster metadata, will be inherited by all the generated resources, including pods |
+
INSTANCES_ROLLOUT_DELAY |
+The duration (in seconds) to wait between roll-outs of individual PostgreSQL instances within the same cluster during an operator upgrade. The default value is 0, meaning no delay between upgrades of instances in the same PostgreSQL cluster. |
+
KUBERNETES_CLUSTER_DOMAIN |
+Defines the domain suffix for service FQDNs within the Kubernetes cluster. If left unset, it defaults to "cluster.local". | +
METRICS_CERT_DIR |
+The directory where TLS certificates for the operator metrics server are stored. When set, enables TLS for the metrics endpoint on port 8080. The directory must contain tls.crt and tls.key files following standard Kubernetes TLS secret conventions. If not set, the metrics server operates without TLS (default behavior). |
+
MONITORING_QUERIES_CONFIGMAP |
+The name of a ConfigMap in the operator's namespace with a set of default queries (to be specified under the key queries) to be applied to all created Clusters |
+
MONITORING_QUERIES_SECRET |
+The name of a Secret in the operator's namespace with a set of default queries (to be specified under the key queries) to be applied to all created Clusters |
+
OPERATOR_IMAGE_NAME |
+The name of the operator image used to bootstrap Pods. Defaults to the image specified during installation. | +
POSTGRES_IMAGE_NAME |
+The name of the PostgreSQL image used by default for new clusters. Defaults to the version specified in the operator. | +
PULL_SECRET_NAME |
+Name of an additional pull secret to be defined in the operator's namespace and to be used to download images | +
STANDBY_TCP_USER_TIMEOUT |
+Defines the TCP_USER_TIMEOUT socket option for replication connections from standby instances to the primary. Default is 0 (system's default). |
+
DRAIN_TAINTS |
+Specifies the taint keys that should be interpreted as indicators of node drain. By default, it includes the taints commonly applied by kubectl, Cluster Autoscaler, and Karpenter: node.kubernetes.io/unschedulable, ToBeDeletedByClusterAutoscaler, karpenter.sh/disrupted, karpenter.sh/disruption. |
+
Values in INHERITED_ANNOTATIONS and INHERITED_LABELS support path-like wildcards. For example, the value example.com/* will match
+both the value example.com/one and example.com/two.
When you specify an additional pull secret name using the PULL_SECRET_NAME parameter,
+the operator will use that secret to create a pull secret for every created PostgreSQL
+cluster. That secret will be named <cluster-name>-pull.
The namespace where the operator looks for the PULL_SECRET_NAME secret is where
+you installed the operator. If the operator is not able to find that secret, it
+will ignore the configuration parameter.
The example below customizes the behavior of the operator, by defining
+the label/annotation names to be inherited by the resources created by
+any Cluster object that is deployed at a later time, by enabling
+in-place updates for the instance
+manager,
+and by spreading upgrades.
apiVersion: v1
+kind: ConfigMap
+metadata:
+ name: cnpg-controller-manager-config
+ namespace: cnpg-system
+data:
+ CLUSTERS_ROLLOUT_DELAY: '60'
+ ENABLE_INSTANCE_MANAGER_INPLACE_UPDATES: 'true'
+ INHERITED_ANNOTATIONS: categories
+ INHERITED_LABELS: environment, workload, app
+ INSTANCES_ROLLOUT_DELAY: '10'
+
+The example below customizes the behavior of the operator, by defining
+the label/annotation names to be inherited by the resources created by
+any Cluster object that is deployed at a later time, and by enabling
+in-place updates for the instance
+manager,
+and by spreading upgrades.
apiVersion: v1
+kind: Secret
+metadata:
+ name: cnpg-controller-manager-config
+ namespace: cnpg-system
+type: Opaque
+stringData:
+ CLUSTERS_ROLLOUT_DELAY: '60'
+ ENABLE_INSTANCE_MANAGER_INPLACE_UPDATES: 'true'
+ INHERITED_ANNOTATIONS: categories
+ INHERITED_LABELS: environment, workload, app
+ INSTANCES_ROLLOUT_DELAY: '10'
+
+For the change to be effective, you need to recreate the operator pods to +reload the config map. If you have installed the operator on Kubernetes +using the manifest you can do that by issuing:
+kubectl rollout restart deployment \
+ -n cnpg-system \
+ cnpg-controller-manager
+
+In general, given a specific namespace, you can delete the operator pods with +the following command:
+kubectl delete pods -n [NAMESPACE_NAME_HERE] \
+ -l app.kubernetes.io/name=cloudnative-pg
+
+Warning
+Customizations will be applied only to Cluster resources created
+after the reload of the operator deployment.
Following the above example, if the Cluster definition contains a categories
+annotation and any of the environment, workload, or app labels, these will
+be inherited by all the resources generated by the deployment.
The operator can expose a pprof HTTP server on localhost:6060.
+To enable it, edit the operator deployment and add the flag
+--pprof-server=true to the container args:
kubectl edit deployment -n cnpg-system cnpg-controller-manager
+
+Add --pprof-server=true to the args list, for example:
containers:
+ - args:
+ - controller
+ - --enable-leader-election
+ - --config-map-name=cnpg-controller-manager-config
+ - --secret-name=cnpg-controller-manager-config
+ - --log-level=info
+ - --pprof-server=true # relevant line
+ command:
+ - /manager
+
+After saving, the deployment will roll out and the new pod will +have the pprof server enabled.
+Important
+The pprof server only serves plain HTTP on port 6060.
To access the pprof endpoints from your local machine, use +port-forwarding:
+kubectl port-forward -n cnpg-system deploy/cnpg-controller-manager 6060
+curl -sS http://localhost:6060/debug/pprof/
+go tool pprof http://localhost:6060/debug/pprof/profile?seconds=30
+
+You can also access pprof using the browser at http://localhost:6060/debug/pprof/.
+Warning
+The example above uses kubectl port-forward for local testing only.
+This is not the intended way to expose the feature in production.
+Treat pprof as a sensitive debugging interface and never expose it publicly.
+If you must access it remotely, secure it with proper network policies and access controls.
PostGIS is a very popular open source extension +for PostgreSQL that introduces support for storing GIS (Geographic Information +Systems) objects in the database and be queried via SQL.
+Important
+This section assumes you are familiar with PostGIS and provides some basic +information about how to create a new PostgreSQL cluster with a PostGIS database +in Kubernetes via CloudNativePG.
+The CloudNativePG Community maintains container images that are built on top +of the maintained PostgreSQL Container images. +For more information, please visit:
+ +Conceptually, a PostGIS-based PostgreSQL cluster (or simply a PostGIS cluster) +is like any other PostgreSQL cluster. The only differences are:
+Since CloudNativePG is based on Immutable Application Containers, the only way +to provision PostGIS is to add it to the container image that you use for the +operand. The "Container Image Requirements" section provides +detailed instructions on how this is achieved. More simply, you can just use +the PostGIS container images from the Community, as in the examples below.
+The second step is to install the extension in the PostgreSQL database. You can +do this in two ways:
+template1 database to make it available for all the
+ databases you end up creating in the cluster, in case you adopt the monolith
+ architecture where the instance is shared by multiple databasesInfo
+For more information on the microservice vs monolith architecture in the database, +please refer to the "How many databases should be hosted in a single PostgreSQL instance?" FAQ +or the "Database import" section.
+Let's suppose you want to create a new PostgreSQL 18 cluster with PostGIS 3.6.
+The first step is to ensure you use the right PostGIS container image for the
+operand, and properly set the .spec.imageName option in the Cluster
+resource.
The postgis-example.yaml manifest below
+provides some guidance on how the creation of a PostGIS cluster can be done.
Warning
+Please consider that, although convention over configuration applies in
+CloudNativePG, you should spend time configuring and tuning your system for
+production. Also, the imageName in the example below deliberately points
+to the latest available image for PostgreSQL 18 - you should use a specific
+image name or, preferably, the SHA256 digest for true immutability.
+Alternatively, use the provided image catalogs.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: postgis-example
+spec:
+ instances: 1
+ imageName: ghcr.io/cloudnative-pg/postgis:18-3.6-system-trixie
+ storage:
+ size: 1Gi
+ postgresql:
+ parameters:
+ log_statement: ddl
+---
+apiVersion: postgresql.cnpg.io/v1
+kind: Database
+metadata:
+ name: postgis-example-app
+spec:
+ name: app
+ owner: app
+ cluster:
+ name: postgis-example
+ extensions:
+ - name: postgis
+ - name: postgis_topology
+ - name: fuzzystrmatch
+ - name: postgis_tiger_geocoder
+
+The example leverages the Database resource's declarative extension
+management to add the specified extensions to the app database.
Info
+For more details, see the +"Managing Extensions in a Database" section.
+You can easily verify the available version of PostGIS that is in the
+container, by connecting to the app database (you might obtain different
+values from the ones in this document):
$ kubectl cnpg psql postgis-example -- app
+psql (18.0 (Debian 18.0-1.pgdg13+3))
+Type "help" for help.
+
+app=# SELECT * FROM pg_available_extensions WHERE name ~ '^postgis' ORDER BY 1;
+ name | default_version | installed_version | comment
+--------------------------+-----------------+-------------------+------------------------------------------------------------
+ postgis | 3.6.0 | 3.6.0 | PostGIS geometry and geography spatial types and functions
+ postgis-3 | 3.6.0 | | PostGIS geometry and geography spatial types and functions
+ postgis_raster | 3.6.0 | | PostGIS raster types and functions
+ postgis_raster-3 | 3.6.0 | | PostGIS raster types and functions
+ postgis_sfcgal | 3.6.0 | | PostGIS SFCGAL functions
+ postgis_sfcgal-3 | 3.6.0 | | PostGIS SFCGAL functions
+ postgis_tiger_geocoder | 3.6.0 | 3.6.0 | PostGIS tiger geocoder and reverse geocoder
+ postgis_tiger_geocoder-3 | 3.6.0 | | PostGIS tiger geocoder and reverse geocoder
+ postgis_topology | 3.6.0 | 3.6.0 | PostGIS topology spatial types and functions
+ postgis_topology-3 | 3.6.0 | | PostGIS topology spatial types and functions
+(10 rows)
+
+The next step is to verify that the extensions listed in the Database
+resource have been correctly installed in the app database.
app=# \dx
+ List of installed extensions
+ Name | Version | Default version | Schema | Description
+------------------------+---------+-----------------+------------+------------------------------------------------------------
+ fuzzystrmatch | 1.2 | 1.2 | public | determine similarities and distance between strings
+ plpgsql | 1.0 | 1.0 | pg_catalog | PL/pgSQL procedural language
+ postgis | 3.6.0 | 3.6.0 | public | PostGIS geometry and geography spatial types and functions
+ postgis_tiger_geocoder | 3.6.0 | 3.6.0 | tiger | PostGIS tiger geocoder and reverse geocoder
+ postgis_topology | 3.6.0 | 3.6.0 | topology | PostGIS topology spatial types and functions
+
+Finally:
+app=# SELECT postgis_full_version();
+ postgis_full_version
+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------
+ POSTGIS="3.6.0 4c1967d" [EXTENSION] PGSQL="180" GEOS="3.13.1-CAPI-1.19.2" PROJ="9.6.0 NETWORK_ENABLED=OFF URL_ENDPOINT=https://cdn.proj.org USER_WRITABLE_DIRECTORY=/tmp/proj DATABASE_PATH=/usr/share/proj/proj.
+db" (compiled against PROJ 9.6.0) LIBXML="2.9.14" LIBJSON="0.18" LIBPROTOBUF="1.5.1" WAGYU="0.5.0 (Internal)" TOPOLOGY
+(1 row)
+
+
+ PostgreSQL upgrades fall into two categories:
+PostgreSQL version numbers follow a major.minor format. For instance, in
+version 17.1:
17 is the major version1 is the minor versionMinor releases are fully compatible with earlier and later minor releases of +the same major version. They include bug fixes and security updates but do not +introduce changes to the internal storage format.
+To upgrade to a newer minor version, simply update the PostgreSQL container +image reference in your cluster definition, either directly or via image catalogs. +CloudNativePG will trigger a rolling update of the cluster, +replacing each instance one by one, starting with the replicas. Once all +replicas have been updated, it will perform either a switchover or a restart of +the primary to complete the process.
+Major PostgreSQL releases introduce changes to the internal data storage +format, requiring a more structured upgrade process.
+CloudNativePG supports three methods for performing major upgrades:
+pg_upgrade – In-place upgrade, offline (covered in the
+ "Offline In-Place Major Upgrades" section below).Each method has trade-offs in terms of downtime, complexity, and data volume +handling. The best approach depends on your upgrade strategy and operational +constraints.
+Important
+We strongly recommend testing all methods in a controlled environment +before proceeding with a production upgrade.
+CloudNativePG performs an offline in-place major upgrade when a new operand +container image with a higher PostgreSQL major version is declaratively +requested for a cluster.
+Important
+Major upgrades are only supported between images based on the same
+operating system distribution. For example, if your previous version uses a
+bullseye image, you cannot upgrade to a bookworm image.
Warning
+There is a bug in PostgreSQL 17.0 through 17.5 that prevents successful upgrades
+if the max_slot_wal_keep_size parameter is set to any value other than -1.
+The upgrade process will fail with an error related to replication slot configuration.
+This issue has been fixed in PostgreSQL 17.6 and 18beta2 or later versions.
+If you are using PostgreSQL 17.0 through 17.5, ensure that you upgrade to at least
+PostgreSQL 17.6 before attempting a major upgrade, or make sure to temporarily set
+the max_slot_wal_keep_size parameter to -1 in your cluster configuration.
You can trigger the upgrade in one of two ways:
+.spec.imageName
+ option.For details on supported image tags, see +"Image Tag Requirements".
+Warning
+CloudNativePG is not responsible for PostgreSQL extensions. You must ensure +that extensions in the source PostgreSQL image are compatible with those in the +target image and that upgrade paths are supported. Thoroughly test the upgrade +process in advance to avoid unexpected issues. +The extensions management feature +can help manage extension upgrades declaratively.
+.status.pgDataImageInfo.PGDATA, and where applicable, WAL files and
+ tablespaces.pg_upgrade with the --link option.Warning
+During the upgrade process, the entire PostgreSQL cluster, including +replicas, is unavailable to applications. Ensure that your system can +tolerate this downtime before proceeding.
+Warning
+Performing an in-place upgrade is an exceptional operation that carries inherent +risks. It is strongly recommended to take a full backup of the cluster before +initiating the upgrade process.
+Info
+For detailed guidance on pg_upgrade, refer to the official
+PostgreSQL documentation.
If the upgrade is successful, CloudNativePG:
+Warning
+Re-cloning replicas can be time-consuming, especially for very large +databases. Plan accordingly to accommodate potential delays. After completing +the upgrade, it is strongly recommended to take a full backup. Existing backup +data (namely base backups and WAL files) is only available for the previous +minor PostgreSQL release.
+Warning
+pg_upgrade doesn't transfer optimizer statistics. After the upgrade, you
+may want to run ANALYZE on your databases to update them.
If the upgrade fails, you must manually revert the major version change in the +cluster's configuration and delete the upgrade job, as CloudNativePG cannot +automatically decide the rollback.
+Important
+This process protects your existing database from data loss, as no data +is modified during the upgrade. If the upgrade fails, a rollback is +usually possible, without having to perform a full recovery from a backup. +Ensure you monitor the process closely and take corrective action if needed.
+Consider the following PostgreSQL cluster running version 16:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ imageName: ghcr.io/cloudnative-pg/postgresql:16-minimal-trixie
+ instances: 3
+ storage:
+ size: 1Gi
+
+You can check the current PostgreSQL version using the following command:
+kubectl cnpg psql cluster-example -- -qAt -c 'SELECT version()'
+
+This will return output similar to:
+PostgreSQL 16.x ...
+
+To upgrade the cluster to version 17, update the imageName field by changing
+the major version tag from 16 to 17:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ imageName: ghcr.io/cloudnative-pg/postgresql:17-minimal-trixie
+ instances: 3
+ storage:
+ size: 1Gi
+
+-major-upgrade. This job runs pg_upgrade
+ on the primary’s persistent volume group.cluster-example-2 and
+ cluster-example-3) are removed.cluster-example-4 and cluster-example-5) are
+ re-cloned from the upgraded primary.Once the upgrade is complete, you can verify the new major version by running +the same command:
+kubectl cnpg psql cluster-example -- -qAt -c 'SELECT version()'
+
+This should now return output similar to:
+PostgreSQL 17.x ...
+
+You can now update the statistics by running ANALYZE on the app database:
kubectl cnpg psql cluster-example -- app -c 'ANALYZE'
+
+
+ Users that are familiar with PostgreSQL are aware of the existence of the +following three files to configure an instance:
+postgresql.conf: main run-time configuration file of PostgreSQLpg_hba.conf: clients authentication filepg_ident.conf: map external users to internal usersDue to the concepts of declarative configuration and immutability of the PostgreSQL
+containers, users are not allowed to directly touch those files. Configuration
+is possible through the postgresql section of the Cluster resource definition
+by defining custom postgresql.conf, pg_hba.conf, and pg_ident.conf settings
+via the parameters, the pg_hba, and the pg_ident keys.
These settings are the same across all instances.
+Warning
+Please don't use the ALTER SYSTEM query to change the configuration of
+the PostgreSQL instances in an imperative way. Changing some of the options
+that are normally controlled by the operator might indeed lead to an
+unpredictable/unrecoverable state of the cluster.
+Moreover, ALTER SYSTEM changes are not replicated across the cluster.
+See "Enabling ALTER SYSTEM" below for details.
A reference for custom settings usage is included in the samples, see
+cluster-example-custom.yaml.
postgresql sectionThe PostgreSQL instance in the pod starts with a default postgresql.conf file,
+to which these settings are automatically added:
listen_addresses = '*'
+include custom.conf
+
+The custom.conf file will contain the user-defined settings in the
+postgresql section, as in the following example:
# ...
+ postgresql:
+ parameters:
+ shared_buffers: "1GB"
+ # ...
+
+PostgreSQL GUCs: Grand Unified Configuration
+Refer to the PostgreSQL documentation for +more information on the available parameters, +also known as GUC (Grand Unified Configuration). +Please note that CloudNativePG accepts only strings for the PostgreSQL parameters.
+The content of custom.conf is automatically generated and maintained by the
+operator by applying the following sections in this order:
The global default parameters are:
+archive_timeout = '5min'
+dynamic_shared_memory_type = 'posix'
+full_page_writes = 'on'
+logging_collector = 'on'
+log_destination = 'csvlog'
+log_directory = '/controller/log'
+log_filename = 'postgres'
+log_rotation_age = '0'
+log_rotation_size = '0'
+log_truncate_on_rotation = 'false'
+max_parallel_workers = '32'
+max_replication_slots = '32'
+max_worker_processes = '32'
+shared_memory_type = 'mmap'
+shared_preload_libraries = ''
+ssl_max_protocol_version = 'TLSv1.3'
+ssl_min_protocol_version = 'TLSv1.3'
+wal_keep_size = '512MB'
+wal_level = 'logical'
+wal_log_hints = 'on'
+wal_sender_timeout = '5s'
+wal_receiver_timeout = '5s'
+
+Warning
+It is your duty to plan for WAL segments retention in your PostgreSQL
+cluster and properly configure either wal_keep_size or wal_keep_segments,
+depending on the server version, based on the expected and observed workloads.
Alternatively, if the only streaming replication clients are the replica instances
+running in the High Availability cluster, you can take advantage of the
+replication slots feature, which adds support for replication slots at the
+cluster level. You can enable the feature with the
+replicationSlots.highAvailability option (for more information, please refer to the
+"Replication" section.)
Without replication slots nor continuous backups in place, configuring
+wal_keep_size or wal_keep_segments is the only way to
+protect standbys from falling out of sync.
+If a standby did fall out of sync it would produce error
+messages like:
+"could not receive data from WAL stream: ERROR: requested WAL segment ************************ has already been removed".
+This will require you to dedicate a part of your PGDATA, or the volume
+dedicated to storing WAL files, to keep older WAL segments for streaming
+replication purposes.
The following parameters are fixed and exclusively controlled by the operator:
+archive_command = '/controller/manager wal-archive %p'
+hot_standby = 'true'
+listen_addresses = '*'
+port = '5432'
+restart_after_crash = 'false'
+ssl = 'on'
+ssl_ca_file = '/controller/certificates/client-ca.crt'
+ssl_cert_file = '/controller/certificates/server.crt'
+ssl_key_file = '/controller/certificates/server.key'
+unix_socket_directories = '/controller/run'
+
+Since the fixed parameters are added at the end, they can't be overridden by the +user via the YAML configuration. Those parameters are required for correct WAL +archiving and replication.
+The wal_level
+parameter in PostgreSQL determines the amount of information written to the
+Write-Ahead Log (WAL). It accepts the following values:
minimal: Writes only the information required for crash recovery.replica: Adds sufficient information to support WAL archiving and streaming
+ replication, including the ability to run read-only queries on standby
+ instances.logical: Includes all information from replica, plus additional information
+ required for logical decoding and replication.By default, upstream PostgreSQL sets wal_level to replica. CloudNativePG,
+instead, sets wal_level to logical by default to enable logical replication
+out of the box. This makes it easier to support use cases such as migrations
+from external PostgreSQL servers.
If your cluster does not require logical replication, it is recommended to set
+wal_level to replica to reduce WAL volume and overhead.
Finally, CloudNativePG allows wal_level to be set to minimal only for
+single-instance clusters with WAL archiving disabled.
The primary_conninfo, restore_command, and recovery_target_timeline
+parameters are automatically managed by the operator based on the instance's
+role within the cluster. These parameters are effectively applied only when the
+instance is operating as a replica.
primary_conninfo = 'host=<PRIMARY> user=postgres dbname=postgres'
+recovery_target_timeline = 'latest'
+
+The STANDBY_TCP_USER_TIMEOUT operator configuration setting,
+if specified, sets the tcp_user_timeout parameter on all standby instances
+managed by the operator.
The tcp_user_timeout parameter determines how long transmitted data can
+remain unacknowledged before the TCP connection is forcibly closed. Adjusting
+this value allows you to fine-tune the responsiveness of standby instances to
+network disruptions. For more details, refer to the
+PostgreSQL documentation.
The operator requires PostgreSQL to output its log in CSV format, and the +instance manager automatically parses it and outputs it in JSON format. +As a result, certain PostgreSQL log settings, listed in this section, +are fixed and cannot be modified.
+For further information, please refer to the "Logging" section.
+The shared_preload_libraries option in PostgreSQL exists to specify one or
+more shared libraries to be pre-loaded at server start, in the form of a
+comma-separated list. Typically, it is used in PostgreSQL to load those
+extensions that need to be available to most database sessions in the whole system
+(e.g. pg_stat_statements).
In CloudNativePG the shared_preload_libraries option is empty by
+default. Although you can override the content of shared_preload_libraries,
+we recommend that only expert Postgres users take advantage of this option.
Important
+In case a specified library is not found, the server fails to start,
+preventing CloudNativePG from any self-healing attempt and requiring
+manual intervention. Please make sure you always test both the extensions and
+the settings of shared_preload_libraries if you plan to directly manage its
+content.
CloudNativePG is able to automatically manage the content of the
+shared_preload_libraries option for some of the most used PostgreSQL
+extensions (see the "Managed extensions" section below
+for details).
Specifically, as soon as the operator notices that a configuration parameter +requires one of the managed libraries, it will automatically add the needed +library. The operator will also remove the library as soon as no actual parameter +requires it.
+Important
+Please always keep in mind that removing libraries from
+shared_preload_libraries requires a restart of all instances in the cluster
+in order to be effective.
You can provide additional shared_preload_libraries via
+.spec.postgresql.shared_preload_libraries as a list of strings: the operator
+will merge them with the ones that it automatically manages.
As anticipated in the previous section, CloudNativePG automatically
+manages the content in shared_preload_libraries for some well-known and
+supported extensions. The current list includes:
auto_explainpg_stat_statementspgauditpg_failover_slotsSome of these libraries also require additional objects in a database before
+using them, normally views and/or functions managed via the CREATE EXTENSION
+command to be run in a database (the DROP EXTENSION command typically removes
+those objects).
For such libraries, CloudNativePG automatically handles the creation +and removal of the extension in all databases that accept a connection in the +cluster, identified by the following query:
+SELECT datname FROM pg_database WHERE datallowconn
+
+Note
+The above query also includes template databases like template1.
Important
+With the introduction of declarative extensions
+in the Database CRD, you can now manage extensions directly. As a result,
+the managed extensions feature may undergo significant changes in future
+versions of CloudNativePG, and some functionalities might be deprecated.
auto_explainThe auto_explain
+extension provides a means for logging execution plans of slow statements
+automatically, without having to manually run EXPLAIN (helpful for tracking
+down un-optimized queries).
You can enable auto_explain by adding to the configuration a parameter
+that starts with auto_explain. as in the following example excerpt (which
+automatically logs execution plans of queries that take longer than 10 seconds
+to complete):
# ...
+ postgresql:
+ parameters:
+ auto_explain.log_min_duration: "10s"
+ # ...
+
+Note
+Enabling auto_explain can lead to performance issues. Please refer to the auto explain documentation
pg_stat_statementsThe pg_stat_statements
+extension is one of the most important capabilities available in PostgreSQL for
+real-time monitoring of queries.
You can enable pg_stat_statements by adding to the configuration a parameter
+that starts with pg_stat_statements. as in the following example excerpt:
# ...
+ postgresql:
+ parameters:
+ pg_stat_statements.max: "10000"
+ pg_stat_statements.track: all
+ # ...
+
+As explained previously, the operator will automatically add
+pg_stat_statements to shared_preload_libraries and run CREATE EXTENSION IF
+NOT EXISTS pg_stat_statements on each database, enabling you to run queries
+against the pg_stat_statements view.
pgauditThe pgaudit extension provides detailed session and/or object audit logging via the standard PostgreSQL logging facility.
CloudNativePG has transparent and native support for +PGAudit on PostgreSQL clusters. For further information, please refer to the "PGAudit" logs section.
+You can enable pgaudit by adding to the configuration a parameter
+that starts with pgaudit. as in the following example excerpt:
#
+postgresql:
+ parameters:
+ pgaudit.log: "all, -misc"
+ pgaudit.log_catalog: "off"
+ pgaudit.log_parameter: "on"
+ pgaudit.log_relation: "on"
+#
+
+pg_failover_slotsThe pg_failover_slots
+extension by EDB ensures that logical replication slots can survive a
+failover scenario. Failovers are normally implemented using physical
+streaming replication, like in the case of CloudNativePG.
You can enable pg_failover_slots by adding to the configuration a parameter
+that starts with pg_failover_slots.: as explained above, the operator will
+transparently manage the pg_failover_slots entry in the
+shared_preload_libraries option depending on this.
Please refer to thepg_failover_slotsdocumentation
+for details on this extension.
Additionally, for each database that you intend to you use with pg_failover_slots
+you need to add an entry in the pg_hba section that enables each replica to
+connect to the primary.
+For example, suppose that you want to use the app database with pg_failover_slots,
+you need to add this entry in the pg_hba section:
postgresql:
+ pg_hba:
+ - hostssl app streaming_replica all cert
+
+pg_hba sectionpg_hba is a list of PostgreSQL Host Based Authentication rules
+used to create the pg_hba.conf used by the pods.
Important
+See the PostgreSQL documentation for
+more information on pg_hba.conf.
Since the first matching rule is used for authentication, the pg_hba.conf file
+generated by the operator can be seen as composed of four sections:
Fixed rules:
+local all all peer
+
+hostssl postgres streaming_replica all cert map=cnpg_streaming_replica
+hostssl replication streaming_replica all cert map=cnpg_streaming_replica
+hostssl all cnpg_pooler_pgbouncer all cert map=cnpg_pooler_pgbouncer
+
+Default rules:
+host all all all <default-authentication-method>
+
+From PostgreSQL 14 the default value of the password_encryption
+database parameter is set to scram-sha-256. Because of that,
+the default authentication method is scram-sha-256 from this
+PostgreSQL version.
PostgreSQL 13 and older will use md5 as the default authentication
+method.
The resulting pg_hba.conf will look like this:
local all all peer
+
+hostssl postgres streaming_replica all cert map=cnpg_streaming_replica
+hostssl replication streaming_replica all cert map=cnpg_streaming_replica
+hostssl all cnpg_pooler_pgbouncer all cert map=cnpg_pooler_pgbouncer
+
+<user defined rules>
+<user defined LDAP>
+
+host all all all scram-sha-256 # (or md5 for PostgreSQL version <= 13)
+
+Inside the cluster manifest, pg_hba lines are added as list items
+in .spec.postgresql.pg_hba, as in the following excerpt:
postgresql:
+ pg_hba:
+ - hostssl app app 10.244.0.0/16 md5
+
+In the above example we are enabling access for the app user to the app
+database using MD5 password authentication (you can use scram-sha-256
+if you prefer) via a secure channel (hostssl).
Under the postgres section of the cluster spec there is an optional ldap section available to define an LDAP
+configuration to be converted into a rule added into the pg_hba.conf file.
This will support two modes: simple bind mode which requires specifying a server, prefix and suffix in the LDAP
+section and the search+bind mode which requires specifying server, baseDN, binDN, and a bindPassword which is
+a secret containing the ldap password. Additionally, in search+bind mode you have the option to specify a
+searchFilter or searchAttribute. If no searchAttribute is specified the default one of uid will be used.
Additionally, both modes allow the specification of a scheme for ldapscheme and a port. Neither scheme nor port are
+required, however.
This section filled out for search+bind could look as follows:
+postgresql:
+ ldap:
+ server: 'openldap.default.svc.cluster.local'
+ bindSearchAuth:
+ baseDN: 'ou=org,dc=example,dc=com'
+ bindDN: 'cn=admin,dc=example,dc=com'
+ bindPassword:
+ name: 'ldapBindPassword'
+ key: 'data'
+ searchAttribute: 'uid'
+
+pg_ident sectionpg_ident is a list of PostgreSQL User Name Maps that CloudNativePG uses to
+generate and maintain the ident map file (known as pg_ident.conf) inside the
+data directory.
Important
+See the PostgreSQL documentation for
+more information on pg_ident.conf.
The pg_ident.conf file written by the operator is made up of the following
+two sections:
Currently the only fixed rule, automatically generated by the operator, is:
+local <postgres system user> postgres
+
+The instance manager detects the user running the PostgreSQL instance and
+automatically adds a rule to map it to the postgres user in the database.
If the postgres user is not properly configured inside the container, the
+instance manager will allow any local user to connect and then log a warning
+message like the following:
Unable to identify the current user. Falling back to insecure mapping.
+
+The resulting pg_ident.conf will look like this:
local <postgres system user> postgres
+
+<user defined lines>
+
+Inside the cluster manifest, pg_ident lines are added as list items
+in .spec.postgresql.pg_ident.
+For example:
postgresql:
+ pg_ident:
+ - "mymap /^(.*)@mydomain\\.com$ \\1"
+
+You can apply configuration changes by editing the postgresql section of
+the Cluster resource.
After the change, the cluster instances will immediately reload the +configuration to apply the changes. +If the change involves a parameter requiring a restart, the operator will +perform a rolling upgrade.
+ALTER SYSTEMCloudNativePG strongly advocates employing the Cluster manifest as the +exclusive method for altering the configuration of a PostgreSQL cluster. This +approach guarantees coherence across the entire high-availability cluster and +aligns with best practices for Infrastructure-as-Code.
+In CloudNativePG the default configuration disables the use of ALTER SYSTEM
+on new Postgres clusters. This decision is rooted in the recognition of
+potential risks associated with this command. To enable the use of ALTER SYSTEM,
+you can explicitly set .spec.postgresql.enableAlterSystem to true.
Warning
+Proceed with caution when utilizing ALTER SYSTEM. This command operates
+directly on the connected instance and does not undergo replication.
+CloudNativePG assumes responsibility for certain fixed parameters and complete
+control over others, emphasizing the need for careful consideration.
Starting from PostgreSQL 17, the .spec.postgresql.enableAlterSystem setting
+directly controls the allow_alter_system GUC in PostgreSQL
+— a feature directly contributed by CloudNativePG to PostgreSQL.
Prior to PostgreSQL 17, when .spec.postgresql.enableAlterSystem is set to
+false, the postgresql.auto.conf file is made read-only. Consequently, any
+attempt to execute the ALTER SYSTEM command will result in an error. The
+error message might look like this:
ERROR: could not open file "postgresql.auto.conf": Permission denied
+
+PostgreSQL supports a few implementations for dynamic shared memory
+management through the
+dynamic_shared_memory_type
+configuration option. In CloudNativePG we recommend to limit ourselves to
+any of the following two values:
posix: which relies on POSIX shared memory allocated using shm_open (default setting)sysv: which is based on System V shared memory allocated via shmgetIn PostgreSQL, this setting is particularly important for memory allocation in parallel queries.
+For details, please refer to this
+thread from the pgsql-general mailing list.
The default setting of posix should be enough in most cases, considering that
+the operator automatically mounts a memory-bound EmptyDir volume called shm
+under /dev/shm. You can verify the size of such volume inside the running
+Postgres container with:
mount | grep shm
+
+You should get something similar to the following output:
+shm on /dev/shm type tmpfs (rw,nosuid,nodev,noexec,relatime,size=******)
+
+If you would like to set a maximum size for the shm volume, you can do so by
+setting the .spec.ephemeralVolumesSizeLimit.shm field in the Cluster resource.
+For example:
spec:
+ ephemeralVolumesSizeLimit:
+ shm: 1Gi
+
+In case your Kubernetes cluster has a high enough value for the SHMMAX
+and SHMALL parameters, you can also set:
dynamic_shared_memory_type: "sysv"
+
+You can check the SHMMAX/SHMALL from inside a PostgreSQL container, by running:
ipcs -lm
+
+For example:
+------ Shared Memory Limits --------
+max number of segments = 4096
+max seg size (kbytes) = 18014398509465599
+max total shared memory (kbytes) = 18014398509481980
+min seg size (bytes) = 1
+
+As you can see, the very high number of max total shared memory recommends
+setting dynamic_shared_memory_type to sysv.
An alternate method is to run:
+cat /proc/sys/kernel/shmall
+cat /proc/sys/kernel/shmmax
+
+Some PostgreSQL configuration parameters should be managed exclusively by the +operator. The operator prevents the user from setting them using a webhook.
+Users are not allowed to set the following configuration parameters in the
+postgresql section:
allow_alter_systemallow_system_table_modsarchive_cleanup_commandarchive_commandarchive_modebonjourbonjour_namecluster_nameconfig_filedata_directorydata_sync_retryevent_sourceexternal_pid_filehba_filehot_standbyident_filejit_providerlisten_addresseslog_destinationlog_directorylog_file_modelog_filenamelog_rotation_agelog_rotation_sizelog_truncate_on_rotationlogging_collectorportprimary_conninfoprimary_slot_namepromote_trigger_filerecovery_end_commandrecovery_min_apply_delayrecovery_targetrecovery_target_actionrecovery_target_inclusiverecovery_target_lsnrecovery_target_namerecovery_target_timerecovery_target_timelinerecovery_target_xidrestart_after_crashrestore_commandshared_preload_librariessslssl_ca_filessl_cert_filessl_crl_filessl_dh_params_filessl_ecdh_curvessl_key_filessl_passphrase_commandssl_passphrase_command_supports_reloadssl_prefer_server_ciphersstats_temp_directorysynchronous_standby_namessyslog_facilitysyslog_identsyslog_sequence_numberssyslog_split_messagesunix_socket_directoriesunix_socket_groupunix_socket_permissionsCloudNativePG candidate releases are pre-release versions made available for +testing before the community issues a new generally available (GA) release. +These versions are feature-frozen, meaning no new features are added, and are +intended for public testing prior to the final release.
+Important
+CloudNativePG release candidates are not intended for use in production +systems.
+Release candidates are provided to the community for extensive testing before +the official release. While a release candidate aims to be identical to the +initial release of a new minor version of CloudNativePG, additional changes may +be implemented before the GA release.
+The stability of each CloudNativePG minor release significantly depends on the +community's efforts to test the upcoming version with their workloads and +tools. Identifying bugs and regressions through user testing is crucial in +determining when we can finalize the release.
+The CloudNativePG Community strongly advises against using preview versions of +CloudNativePG in production environments or active development projects. Although +CloudNativePG undergoes extensive automated and manual testing, beta releases +may contain serious bugs. Features in preview versions may change in ways that +are not backwards compatible and could be removed entirely.
+There are currently no preview versions available.
+The current preview version is 1.28.0-rc1.
+For more information on the current preview version and how to test, please view the links below:
+ + +This section guides you through testing a PostgreSQL cluster on your local machine by +deploying CloudNativePG on a local Kubernetes cluster +using either Kind or +Minikube.
+Warning
+The instructions contained in this section are for demonstration, +testing, and practice purposes only and must not be used in production.
+Like any other Kubernetes application, CloudNativePG is deployed using +regular manifests written in YAML.
+By following the instructions on this page you should be able to start a PostgreSQL +cluster on your local Kubernetes installation and experiment with it.
+Important
+Make sure that you have kubectl installed on your machine in order
+to connect to the Kubernetes cluster. Please follow the Kubernetes documentation
+on how to install kubectl.
The first part is about installing Minikube or Kind. Please spend some time +reading about the systems and decide which one to proceed with. +After setting up one of them, please proceed with part 2.
+We also provide instructions for setting up monitoring with Prometheus and +Grafana for local testing/evaluation, in part 4
+Minikube is a tool that makes it easy to run Kubernetes locally. Minikube runs a +single-node Kubernetes cluster inside a Virtual Machine (VM) on your laptop for +users looking to try out Kubernetes or develop with it day-to-day. Normally, it +is used in conjunction with VirtualBox.
+You can find more information in the official Kubernetes documentation on how to +install Minikube in your local personal environment. +When you installed it, run the following command to create a minikube cluster:
+minikube start
+
+This will create the Kubernetes cluster, and you will be ready to use it. +Verify that it works with the following command:
+kubectl get nodes
+
+You will see one node called minikube.
If you do not want to use a virtual machine hypervisor, then Kind is a tool for running +local Kubernetes clusters using Docker container "nodes" (Kind stands for "Kubernetes IN Docker" indeed).
+Install kind on your environment following the instructions in the Quickstart,
+then create a Kubernetes cluster with:
kind create cluster --name pg
+
+Now that you have a Kubernetes installation up and running +on your laptop, you can proceed with CloudNativePG installation.
+Please refer to the "Installation" section and then proceed +with the deployment of a PostgreSQL cluster.
+As with any other deployment in Kubernetes, to deploy a PostgreSQL cluster
+you need to apply a configuration file that defines your desired Cluster.
The cluster-example.yaml sample file
+defines a simple Cluster using the default storage class to allocate
+disk space:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+
+ storage:
+ size: 1Gi
+
+There's more
+For more detailed information about the available options, please refer +to the "API Reference" section.
+In order to create the 3-node PostgreSQL cluster, you need to run the following command:
+kubectl apply -f cluster-example.yaml
+
+You can check that the pods are being created with the get pods command:
kubectl get pods
+
+That will look for pods in the default namespace. To separate your cluster
+from other workloads on your Kubernetes installation, you could always create
+a new namespace to deploy clusters on.
+Alternatively, you can use labels. The operator will apply the cnpg.io/cluster
+label on all objects relevant to a particular cluster. For example:
kubectl get pods -l cnpg.io/cluster=<CLUSTER>
+
+Important
+Note that we are using cnpg.io/cluster as the label. In the past you may
+have seen or used postgresql. This label is being deprecated, and
+will be dropped in the future. Please use cnpg.io/cluster.
By default, the operator will install the latest available minor version
+of the latest major version of PostgreSQL when the operator was released.
+You can override this by setting the imageName key in the spec section of
+the Cluster definition. For example, to install PostgreSQL 13.6:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ # [...]
+spec:
+ # [...]
+ imageName: ghcr.io/cloudnative-pg/postgresql:13.6
+ #[...]
+
+Important
+The immutable infrastructure paradigm requires that you always
+point to a specific version of the container image.
+Never use tags like latest or 13 in a production environment
+as it might lead to unpredictable scenarios in terms of update
+policies and version consistency in the cluster.
+For strict deterministic and repeatable deployments, you can add the digests
+to the image name, through the <image>:<tag>@sha256:<digestValue> format.
There's more
+There are some examples cluster configurations bundled with the operator. +Please refer to the "Examples" section.
+Important
+Installing Prometheus and Grafana is beyond the scope of this project. +The instructions in this section are provided for experimentation and +illustration only.
+In this section we show how to deploy Prometheus and Grafana for observability, +and how to create a Grafana Dashboard to monitor CloudNativePG clusters, and a +set of Prometheus Rules defining alert conditions.
+We leverage the Kube-Prometheus stack +Helm chart, which is maintained by the Prometheus Community. +Please refer to the project website for additional documentation and background.
+The Kube-Prometheus-stack Helm chart installs the Prometheus Operator, +including the Alert Manager, +and a Grafana deployment.
+We include a configuration file for the deployment of this Helm chart that will +provide useful initial settings for observability of CloudNativePG clusters.
+If you don't have Helm installed yet, please follow the +instructions to install it in your +system.
+We need to add the prometheus-community helm chart repository, and then
+install the Kube Prometheus stack with our sample configuration
+kube-stack-config.yaml.
We can accomplish this with the following commands:
+helm repo add prometheus-community \
+ https://prometheus-community.github.io/helm-charts
+
+helm upgrade --install \
+ -f https://raw.githubusercontent.com/cloudnative-pg/cloudnative-pg/main/docs/src/samples/monitoring/kube-stack-config.yaml \
+ prometheus-community \
+ prometheus-community/kube-prometheus-stack
+
+After completion, you will have Prometheus, Grafana, and Alert Manager,
+configured with the kube-stack-config.yaml file:
PodMonitor (see monitoring).Seealso
+For further information about the above helm commands, refer to the helm +install +documentation.
+You can see several Custom Resources have been created:
+% kubectl get crds
+NAME CREATED AT
+…
+alertmanagers.monitoring.coreos.com <timestamp>
+…
+prometheuses.monitoring.coreos.com <timestamp>
+prometheusrules.monitoring.coreos.com <timestamp>
+…
+
+as well as a series of Services:
+% kubectl get svc
+NAME TYPE PORT(S)
+… … …
+prometheus-community-grafana ClusterIP 80/TCP
+prometheus-community-kube-alertmanager ClusterIP 9093/TCP
+prometheus-community-kube-operator ClusterIP 443/TCP
+prometheus-community-kube-prometheus ClusterIP 9090/TCP
+
+At this point, a CloudNativePG cluster deployed with monitoring activated +would be observable via Prometheus.
+For example, you could deploy a simple cluster with PodMonitor enabled:
kubectl apply -f - <<EOF
+---
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-with-metrics
+spec:
+ instances: 3
+
+ storage:
+ size: 1Gi
+---
+apiVersion: monitoring.coreos.com/v1
+kind: PodMonitor
+metadata:
+ name: cluster-with-metrics
+spec:
+ selector:
+ matchLabels:
+ cnpg.io/cluster: cluster-with-metrics
+ podMetricsEndpoints:
+ - port: metrics
+EOF
+
+To access Prometheus, port-forward the Prometheus service:
+kubectl port-forward svc/prometheus-community-kube-prometheus 9090
+
+Then access the Prometheus console locally at: http://localhost:9090/
You should find a series of metrics relating to CloudNativePG clusters. +Please refer to the monitoring section for more information.
+
You can also monitor the CloudNativePG operator by creating a PodMonitor to +target it. See the relevant section in the monitoring page.
+You can define some alerts by creating a prometheusRule:
kubectl apply -f \
+ https://raw.githubusercontent.com/cloudnative-pg/cloudnative-pg/main/docs/src/samples/monitoring/prometheusrule.yaml
+
+You should see the default alerts now:
+% kubectl get prometheusrules
+NAME AGE
+cnpg-default-alerts 3m27s
+
+In the Prometheus console, you can click on the Alerts menu to see the alerts +we just installed.
+In our installation so far, Grafana is deployed with no predefined dashboards.
+To open Grafana, you can port-forward the grafana service:
+kubectl port-forward svc/prometheus-community-grafana 3000:80
+
+and access Grafana locally at http://localhost:3000/
+providing the credentials admin as username, prom-operator as password
+(defined in kube-stack-config.yaml).
CloudNativePG provides a default dashboard for Grafana in the dedicated
+grafana-dashboards repository.
+You can download the file
+grafana-dashboard.json
+and manually import it via the GUI (menu: Dashboards > New > Import).
+You can now click on the CloudNativePG dashboard just created:

Warning
+Some graphs in the previous dashboard make use of metrics that are in alpha stage by the time
+this was created, like kubelet_volume_stats_available_bytes and kubelet_volume_stats_capacity_bytes
+producing some graphs to show No data.
Note that in our local setup, Prometheus and Grafana are configured to +automatically discover and monitor any CloudNativePG clusters deployed with the +Monitoring feature enabled.
+ +In PostgreSQL, recovery refers to the process of starting an instance from +an existing physical backup. PostgreSQL's recovery system is robust and +feature-rich, supporting Point-In-Time Recovery (PITR)—the ability to +restore a cluster to any specific moment, from the earliest available backup to +the latest archived WAL file.
+Important
+A valid WAL archive is required to perform PITR.
+In CloudNativePG, recovery is not performed in-place on an existing +cluster. Instead, it is used to bootstrap a new cluster from a physical +backup.
+Note
+For more details on configuring the bootstrap stanza, refer to
+Bootstrap.
The recovery bootstrap mode allows you to initialize a cluster from a
+physical base backup and replay the associated WAL files to bring the system to
+a consistent and optionally point-in-time state.
CloudNativePG supports recovery via:
+With the deprecation of native Barman Cloud support in version 1.26, this +section now focuses on two supported recovery methods: using the Barman Cloud +Plugin for recovery from object stores, and the native interface for +recovery from volume snapshots.
+Important
+For legacy documentation, see +Appendix B – Recovery from an Object Store.
+This section outlines how to recover a PostgreSQL cluster from an object store +using the recommended Barman Cloud Plugin.
+Important
+The object store must contain backup data produced by a CloudNativePG
+Cluster—either using the deprecated native Barman Cloud integration or
+the Barman Cloud Plugin.
Info
+For full details, refer to the +“Recovery of a Postgres Cluster” section in the Barman Cloud Plugin documentation.
+Begin by defining the object store that holds both your base backups and WAL
+files. The Barman Cloud Plugin uses a custom ObjectStore resource for this
+purpose. The following example shows how to configure one for Azure Blob
+Storage:
apiVersion: barmancloud.cnpg.io/v1
+kind: ObjectStore
+metadata:
+ name: cluster-example-backup
+spec:
+ configuration:
+ destinationPath: https://STORAGEACCOUNTNAME.blob.core.windows.net/CONTAINERNAME/
+ azureCredentials:
+ storageAccount:
+ name: recovery-object-store-secret
+ key: storage_account_name
+ storageKey:
+ name: recovery-object-store-secret
+ key: storage_account_key
+ wal:
+ maxParallel: 8
+
+Next, configure the Cluster resource to use the ObjectStore you defined. In
+the bootstrap section, specify the recovery source, and define an
+externalCluster entry that references the plugin:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-restore
+spec:
+ [...]
+
+ superuserSecret:
+ name: superuser-secret
+
+ bootstrap:
+ recovery:
+ source: origin
+
+ externalClusters:
+ - name: origin
+ plugin:
+ name: barman-cloud.cloudnative-pg.io
+ parameters:
+ barmanObjectName: cluster-example-backup
+ serverName: cluster-example
+
+VolumeSnapshot ObjectsWarning
+When creating replicas after recovering a primary instance from a
+VolumeSnapshot, the operator may fall back to using pg_basebackup to
+synchronize them. This process can be significantly slower—especially for large
+databases—because it involves a full base backup. This limitation will be
+addressed in the future with support for online backups and PVC cloning in
+the scale-up process.
CloudNativePG allows you to create a new cluster from a VolumeSnapshot of a
+PersistentVolumeClaim (PVC) that belongs to an existing Cluster.
+These snapshots are created using the declarative API for
+volume snapshot backups.
To complete the recovery process, the new cluster must also reference an +external cluster that provides access to the WAL archive needed to reapply +changes and finalize the recovery.
+The following example shows a cluster being recovered using both a
+VolumeSnapshot for the base backup and a WAL archive accessed through the
+Barman Cloud Plugin:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-restore
+spec:
+ [...]
+
+ bootstrap:
+ recovery:
+ source: origin
+ volumeSnapshots:
+ storage:
+ name: <snapshot name>
+ kind: VolumeSnapshot
+ apiGroup: snapshot.storage.k8s.io
+
+ externalClusters:
+ - name: origin
+ plugin:
+ name: barman-cloud.cloudnative-pg.io
+ parameters:
+ barmanObjectName: cluster-example-backup
+ serverName: cluster-example
+
+In case the backed-up cluster was using a separate PVC to store the WAL files, +the recovery must include that too:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-restore
+spec:
+ [...]
+
+ bootstrap:
+ recovery:
+ volumeSnapshots:
+ storage:
+ name: <snapshot name>
+ kind: VolumeSnapshot
+ apiGroup: snapshot.storage.k8s.io
+
+ walStorage:
+ name: <snapshot name>
+ kind: VolumeSnapshot
+ apiGroup: snapshot.storage.k8s.io
+
+The previous example assumes that the application database and its owning user
+are named app by default. If the PostgreSQL cluster being restored uses
+different names, you must specify these names before exiting the recovery phase,
+as documented in "Configure the application database".
Warning
+If bootstrapping a replica-mode cluster from snapshots, to leverage +snapshots for the standby instances and not just the primary, +we recommend that you:
+Backup objectIf a Backup resource is already available in the namespace in which you need
+to create the cluster, you can specify the name using
+.spec.bootstrap.recovery.backup.name, as in the following example:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example-initdb
+spec:
+ instances: 3
+
+ bootstrap:
+ recovery:
+ backup:
+ name: backup-example
+
+ storage:
+ size: 1Gi
+
+This bootstrap method allows you to specify just a reference to the +backup that needs to be restored.
+The previous example assumes that the application database and its owning user
+are named app by default. If the PostgreSQL cluster being restored uses
+different names, you must specify these names before exiting the recovery phase,
+as documented in "Configure the application database".
Whether you recover from an object store, a volume snapshot, or an existing
+Backup resource, no changes to the database, including the catalog, are
+permitted until the Cluster is fully promoted to primary and accepts write
+operations. This restriction includes any role overrides, which are deferred
+until the Cluster transitions to primary.
+As a result, the following considerations apply:
enableSuperuserAccess and supply a superuserSecret.By default, recovery continues up to the latest available WAL on the default
+target timeline (latest). You can optionally specify a recoveryTarget to
+perform a point-in-time recovery (see Point in Time Recovery (PITR)).
Important
+Consider using the barmanObjectStore.wal.maxParallel option to speed
+up WAL fetching from the archive by concurrently downloading the transaction
+logs from the recovery object store.
Instead of replaying all the WALs up to the latest one, after extracting a base +backup, you can ask PostgreSQL to stop replaying WALs at any given point in +time. PostgreSQL uses this technique to achieve PITR. The presence of a WAL +archive is mandatory.
+Important
+PITR requires you to specify a recovery target by using the options +described in Recovery targets.
+The operator generates the configuration parameters required for this +feature to work if you specify a recovery target.
+This example uses the same recovery object store in Azure defined earlier for +the Barman Cloud plugin, containing both the base backups and the WAL archive. +The recovery target is based on a requested timestamp.
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-restore-pitr
+spec:
+ instances: 3
+
+ storage:
+ size: 5Gi
+
+ bootstrap:
+ recovery:
+ # Recovery object store containing WAL archive and base backups
+ source: origin
+ recoveryTarget:
+ # Time base target for the recovery
+ targetTime: "2023-08-11 11:14:21.00000+02"
+
+ externalClusters:
+ - name: origin
+ plugin:
+ name: barman-cloud.cloudnative-pg.io
+ parameters:
+ barmanObjectName: cluster-example-backup
+ serverName: cluster-example
+
+In this example, you had to specify only the targetTime in the form of a
+timestamp. You didn't have to specify the base backup from which to start the
+recovery.
The backupID option is the one that allows you to specify the base backup
+from which to initiate the recovery process. By default, this value is
+empty.
If you assign a value to it (in the form of a Barman backup ID), the operator +uses that backup as the base for the recovery.
+Important
+You need to make sure that such a backup exists and is accessible.
+If you don't specify the backup ID, the operator detects the base backup for +the recovery as follows:
+targetTime or targetLSN, the operator selects the closest
+ backup that was completed before that target.VolumeSnapshot ObjectsThe following example demonstrates how to perform a Point-in-Time Recovery (PITR) using:
+VolumeSnapshot of the PGDATA directory, which provides the
+ base backup. This snapshot is specified in the recovery.volumeSnapshots
+ section and is named test-snapshot-1.ObjectStore
+ resource (not shown here), and referenced using the recovery.source field,
+ which points to an external cluster configuration.The cluster will be restored to a specific point in time using the
+recoveryTarget.targetTime option.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example-snapshot
+spec:
+ # ...
+ bootstrap:
+ recovery:
+ source: origin
+ volumeSnapshots:
+ storage:
+ name: test-snapshot-1
+ kind: VolumeSnapshot
+ apiGroup: snapshot.storage.k8s.io
+ recoveryTarget:
+ targetTime: "2023-07-06T08:00:39"
+ externalClusters:
+ - name: origin
+ plugin:
+ name: barman-cloud.cloudnative-pg.io
+ parameters:
+ barmanObjectName: minio-backup
+ serverName: cluster-example
+
+This setup enables CloudNativePG to restore the base data from a volume +snapshot and apply WAL segments from the object store to reach the desired +recovery target.
+Note
+If the backed-up cluster had walStorage enabled, you also must specify
+the volume snapshot containing the PGWAL directory, as mentioned in
+Recovery from VolumeSnapshot objects.
Warning
+It's your responsibility to ensure that the end time of the base backup in +the volume snapshot is before the recovery target timestamp.
+Warning
+If you added or removed a tablespace in your cluster +since the last base backup, replaying the WAL will fail. You need a base +backup between the time of the tablespace change and the recovery target +timestamp.
+Here are the recovery target criteria you can use:
+exclusive option.)Warning
+PostgreSQL recovery will stop when it encounters the first transaction that +occurs after the specified time. If no such transaction exists after the +target time, the recovery process will fail.
+exclusive option.)
+ Keep in mind that while transaction IDs are assigned sequentially at
+ transaction start, transactions can complete in a different numeric order.
+ The transactions that are recovered are those that committed before
+ (and optionally including) the specified one.pg_create_restore_point()) to which
+ recovery proceeds.exclusive option.)Important
+The operator can retrieve the closest backup when you specify either
+targetTime or targetLSN. However, this isn't possible for the remaining
+targets: targetName, targetXID, and targetImmediate. In such cases, it's
+mandatory to specify backupID.
This example uses a targetName-based recovery target:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+ bootstrap:
+ recovery:
+ source: origin
+ recoveryTarget:
+ backupID: 20220616T142236
+ targetName: 'restore_point_1'
+[...]
+
+You can choose only a single one among the targets in each recoveryTarget
+configuration.
Additionally, you can specify targetTLI to force recovery to a specific
+timeline.
By default, the previous parameters are considered to be inclusive, stopping +just after the recovery target, matching +the behavior in PostgreSQL.
+You can request exclusive behavior, stopping right before the recovery target,
+by setting the exclusive parameter to true. The following example shows
+this behavior, relying on a blob container in Azure for both base backups and
+the WAL archive:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-restore-pitr
+spec:
+ instances: 3
+
+ storage:
+ size: 5Gi
+
+ bootstrap:
+ recovery:
+ source: origin
+ recoveryTarget:
+ backupID: 20220616T142236
+ targetName: "maintenance-activity"
+ exclusive: true
+
+ externalClusters:
+ - name: origin
+ plugin:
+ name: barman-cloud.cloudnative-pg.io
+ parameters:
+ barmanObjectName: cluster-example-backup
+ serverName: cluster-example
+
+For the recovered cluster, you can configure the application database name and +credentials with additional configuration. To update application database +credentials, you can generate your own passwords, store them as secrets, and +update the database to use the secrets. Or you can also let the operator +generate a secret with a randomly secure password for use. +See Bootstrap an empty cluster +for more information about secrets.
+Important
+While the Cluster is in recovery mode, no changes to the database,
+including the catalog, are permitted. This restriction includes any role
+overrides, which are deferred until the Cluster transitions to primary.
+During this phase, users remain as defined in the source cluster.
The following example configures the app database with the owner app and
+the password stored in the provided secret app-secret, following the
+bootstrap from a live cluster.
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+[...]
+spec:
+ bootstrap:
+ recovery:
+ database: app
+ owner: app
+ secret:
+ name: app-secret
+ [...]
+
+With the above configuration, the following will happen only after recovery is +completed:
+app database does not exist, it will be created.app user does not exist, it will be created.app user is not the owner of the app database, ownership will be
+ granted to the app user.username value matches the owner value in the secret, the
+ password for the application user (the app user in this case) will be
+ updated to the password value in the secret.You can use the data uploaded to the object storage to bootstrap a new
+cluster from an existing backup. The operator orchestrates the recovery process
+using the barman-cloud-restore tool (for the base backup) and the
+barman-cloud-wal-restore tool (for WAL files, including parallel support, if
+requested).
For details and instructions on the recovery bootstrap method, see
+Bootstrap from a backup.
Important
+If you're not familiar with how
+PostgreSQL PITR
+works, we suggest that you configure the recovery cluster as the original
+one when it comes to .spec.postgresql.parameters. Once the new cluster is
+restored, you can then change the settings as desired.
The way it works is that the operator injects an init container in the first +instance of the new cluster, and the init container starts recovering the +backup from the object storage.
+Important
+The duration of the base backup copy in the new PVC depends on +the size of the backup, as well as the speed of both the network and the +storage.
+When the base backup recovery process is complete, the operator starts the
+Postgres instance in recovery mode. In this phase, PostgreSQL is up, though not
+able to accept connections, and the pod is healthy according to the
+liveness probe. By way of the restore_command, PostgreSQL starts fetching WAL
+files from the archive. You can speed up this phase by setting the
+maxParallel option and enabling the parallel WAL restore capability.
This phase terminates when PostgreSQL reaches the target, either the end of the
+WAL or the required target in case of PITR. You can optionally specify a
+recoveryTarget to perform a PITR. If left unspecified, the recovery continues
+up to the latest available WAL on the default target timeline (latest).
Once the recovery is complete, the operator sets the required superuser +password into the instance. The new primary instance starts as usual, and the +remaining instances join the cluster as replicas.
+The process is transparent for the user and is managed by the instance manager +running in the pods.
+When restoring a cluster, the manifest may include a plugins section with
+Barman Cloud plugin pointing to a backup object store resource. This enables
+the newly created cluster to begin archiving WAL files and taking backups
+immediately after recovery—provided backup policies are configured.
Avoid reusing the same ObjectStore configuration for both backup and
+recovery in the same cluster. If you must, ensure that each cluster uses a
+unique serverName to prevent accidental overwrites of backup or WAL archive
+data.
Warning
+CloudNativePG includes a safety check to prevent a cluster from overwriting
+existing data in a shared storage bucket. If a conflict is detected, the
+cluster remains in the Setting up primary state, and the associated pods will
+fail with an error. The pod logs will display:
+ERROR: WAL archive check failed for server recoveredCluster: Expected empty archive.
Important
+You can bypass this safety check by setting the
+cnpg.io/skipEmptyWalArchiveCheck annotation to enabled on the recovered
+cluster. However, this is strongly discouraged unless you are highly
+familiar with PostgreSQL's recovery process. Skipping the check incorrectly can
+lead to severe data loss. Use with caution and only in expert scenarios.
The first public release of CloudNativePG is version 1.15.0. Before that, +the product was entirely owned by EDB and distributed under the name of +"Cloud Native PostgreSQL".
+The list of changes in this page is only for informative purposes, to +demonstrate the history of the product on top of commits. +None of the versions listed here exists for CloudNativePG.
+Release date: 25 March 2022
+Features:
+LastBackupSucceeded condition for the Cluster objectmax_connections,
+ max_prepared_transactions, max_locks_per_transaction, max_wal_senders,
+ max_worker_processesOnline upgrade in progress phase in the Cluster object to show
+ when an online upgrade of the operator is in progresskubectl cnp plugin named maintenance to set
+ maintenance window to cluster(s) in one or all namespaces across the Kubernetes
+ clusterContainer Images:
+Security Enhancements:
+Fixes:
+pg_rewind by preserving a
+ copy of the original pg_control filepg_rewindprimary_conninfo to avoid random restarts with
+ PostgreSQL versions prior to 13ServiceAccount changes (e.g., labels, annotations) upon
+ reconciliationtargetLSN option when recovering a cluster with the LSN
+ specifiedRelease date: 17 February 2022
+Features:
+status.conditions. The condition ContinuousArchiving
+ indicates that the Cluster has started to archive WAL filescnp plugin for kubectl with additional information:
+ add a Cluster Summary section showing the status of the Cluster and a Certificates Status
+ section including the status of the certificates used in the Cluster along
+ with the time left to expirebarman-cloud-check-wal-archive command to detect a non-empty backup destination
+ when creating a new clusterSecret to add default monitoring queries through
+ MONITORING_QUERIES_SECRET configuration variable.cnp plugin for kubectl, now the plugin is available
+ on Windows x86 and ARMContainer Images:
+Security Fix:
+Fixes:
+.spec.postgresUID and .spec.postgresGID fields
+ in validation webhook. Changing these fields after Cluster creation makes PostgreSQL unable to startHealthy when not initialized yetRelease date: 11 January 2022
+Features:
+.spec.inheritedMetadata field to the Cluster allowing the user
+ to specify labels and annotations that will apply to all objects generated
+ by the ClusterFixes:
+wal_sender_timeout
+ and wal_receiver_timeout to 5 seconds to make sure standby nodes will
+ quickly notice if the primary has network issuesenablePodMonitor is falseRelease date: 15 December 2021
+Features:
+backupObjectStore.maxParallel option to set the maximum number of parallel
+ jobs to be executed during both WAL archiving (by PostgreSQL’s
+ archive_command) and WAL restore (by restore_command). Using parallel
+ restore option can allow newly promoted Standbys to get to a ready state faster
+ by fetching needed WAL files to replay in parallel rather than sequentiallyConfigMap called
+ default-monitoring is automatically deployed in the same namespace of the
+ operator and, by default, added to any existing Postgres cluster. Such behavior
+ can be changed globally by setting the MONITORING_QUERIES_CONFIGMAP parameter
+ in the operator’s configuration, or at cluster level through the
+ .spec.monitoring.disableDefaultQueries option (by default set to false)enablePodMonitor option in the monitoring section of a
+ cluster to automatically manage a PodMonitor resource and seamlessly
+ integrate with PrometheusstopDelay time, and a fast
+ shutdown for the remaining half, before the pod is killed by KubernetesswitchoverDelay option to control the time given to the former
+ primary to shut down gracefully and archive all the WAL files before
+ promoting the new primary (by default, CloudNativePG waits
+ indefinitely to privilege data durability)Cluster by
+ issuing a rolling updatestatus command of the cnp plugin for kubectl with
+ additional information: streaming replication status, total size of the
+ database, role of an instance in the clusterdynamic_shared_memory_type and shared_memory_type
+ parameters for PostgreSQL’s management of shared memoryENABLE_AZURE_PVC_UPDATES operator configuration option, by issuing a
+ rolling update of the cluster if needed (disabled by default)cnpg.io/reconciliationLoop annotation that, when
+ set to disabled on a given Postgres cluster, prevents the reconciliation
+ loop from runningpostInitApplicationSQL option as part of the initdb
+ bootstrap method to specify a list of SQL queries to be executed on the main
+ application database as a superuser immediately after the cluster has been
+ createdFixes:
+CrashLoopBackOff status.pg_rewind to re-align with the current primary after a timeline
+ diversion. This fixes the pod of the new standby from repeatedly being killed
+ by Kubernetes.Waited for
+ 1.182388649s due to client-side throttling, not priority and fairness,
+ request: GET)Pooler resource inherit the imagePullSecret defined in the
+ operator, if existsRelease date: 11 November 2021
+Features:
+Pooler resource and
+ controller to automatically manage a PgBouncer deployment to be used as a
+ connection pooler for a local PostgreSQL Cluster. The feature includes TLS
+ client/server connections, password authentication, High Availability, pod
+ templates support, configuration of key PgBouncer parameters, PAUSE/RESUME,
+ logging in JSON format, Prometheus exporter for stats, pools, and listsinitdb bootstrap
+ method to dataChecksums, encoding, localeCollate, localeCType,
+ walSegmentSize. This makes the options array obsolete and planned to be
+ removed in the v2 APIpostInitTemplateSQL option as part of the initdb bootstrap
+ method to specify a list of SQL queries to be executed on the template1
+ database as a superuser immediately after the cluster has been created. This
+ feature allows you to include default objects in all application databases
+ created in the clusterstatus command of the cnp plugin with information
+ about the backupk8s.enterprisedb.io/cluster label for every
+ object generated in a Cluster, including Backup objectspostgresql.cnpg.io/v1alpha1 on the Cluster, Backup, and
+ ScheduledBackup kindsSecurity:
+false for the operator containers
+ securityContextFixes:
+Release date: 15 October 2021
+Features:
+wal-archive for PostgreSQL's
+ archive_command, and wal-restore for restore_command in a standbyFixes:
+.history files
+ from being archived)--cloud-provider option to Barman Cloud tools only when using
+ Barman 2.13 or higher to avoid errors with older operandsRelease date: 30 September 2021
+This release is to celebrate the launch of
+PostgreSQL 14
+by making it the default major version when a new Cluster is created without
+defining a specific image name.
Fixes:
+Error while getting barman endpoint CA secret message to
+ appear in the logs of the primary pod, which prevented the backup to work
+ correctlyRelease date: 28 September 2021
+Version 1.9.0 is not available on OpenShift due to delays with the +release process and the subsequent release of version 1.9.1.
+Features:
+--restore-target-wal option in pg_rewind, in
+ order to fetch WAL files from the backup archive, if necessary (available
+ only with PostgreSQL 13+)pg_catalog.pg_largeobject table in each databaseFixes:
+postgres user when invoking pg_isready - required by
+ restricted SCC in OpenShiftFirstRecoverabilityPoint in the statusarchive_mode = always on the designated primary if backup is requestedRelease date: 13 September 2021
+Features:
+immediate option in scheduled backups to request a backup
+ immediately after the first Postgres instance running, adding the capability
+ to rewind to the very beginning of a cluster when Point-In-Time Recovery is
+ configuredfirstRecoverabilityPoint in the cluster status to report the oldest
+ consistent point in time to request a recovery based on the backup object
+ store’s contentEnhance the default Prometheus exporter for a PostgreSQL instance by exposing + the following new metrics:
+.ready and .done files in the archive status folderAdd support for the runonserver option when defining custom metrics in the
+ Prometheus exporter to limit the collection of a metric to a range of
+ PostgreSQL versions
pg_isready for the liveness probe.spec.imagePullPolicy to control the pull policy of image
+ containers for all pods and jobs created for a clusterlogLevel option to the cluster's spec to specify one of the
+ following levels: error, info, debug or traceSecurity Enhancements:
+.spec.enableSuperuserAccess to enable/disable network access with the
+ postgres user through password authenticationFixes:
+Release date: 11 August 2021
+Features:
+minSyncReplicas and maxSyncReplicas settings in case the cluster is not
+ able to meet the requirements during self-healing operationspostInitSQL option as part of the initdb bootstrap method
+ to specify a list of SQL queries to be executed as a superuser immediately
+ after the cluster has been createdFixes:
+LOCAL synchronous commit levelRelease date: 28 July 2021
+Features:
+logger called pgaudit
+ directly available in the JSON outputEnhance monitoring and observability capabilities through:
+pg_stat_statements and auto_explain extensionstarget_databases option in the Prometheus exporter to run a
+ user-defined metric query on one or more databases (including
+ auto-discovery of databases through shell-like pattern matching)manual_switchover_required metric to promptly report
+ whether a cluster with primaryUpdateStrategy set to supervised
+ requires a manual switchoverTransparently handle shared_preload_libraries for pg_audit,
+ auto_explain and pg_stat_statements
shared_preload_libraries for PostgreSQL when
+ pg_stat_statements, pgaudit or auto_explain options are added to
+ the postgresql parameters sectionSupport the cnpg.io/reload label to finely control the
+ automated reload of config maps and secrets, including those used for custom
+ monitoring/alerting metrics in the Prometheus exporter or to store certificates
reload command to the cnp plugin for kubectl to trigger a
+ reconciliation loop on the instancesadditionalPodAffinity and additionalPodAntiAffinityPodDisruptionBudget for primary instances, by
+ requiring at least a primary instance to run at any timeSecurity Enhancements:
+.spec.certificates.clientCASecret and
+ .spec.certificates.replicationTLSSecret options to define custom client
+ Certification Authority and certificate for the PostgreSQL server, to be used
+ to authenticate client certificates and secure communication between PostgreSQL
+ nodes.spec.backup.barmanObjectStore.endpointCA option to define the
+ custom Certification Authority bundle of the endpoint of Barman’s backup
+ object storeFixes:
+Release date: 12 July 2021
+Features:
+.spec.postgresql.promotionTimeout parameter to specify the maximum amount of
+ seconds to wait when promoting an instance to primary, defaulting to 40000000 seconds..spec.affinity.podAntiAffinityType parameter. It can be set to
+ preferred (default), resulting in
+ preferredDuringSchedulingIgnoredDuringExecution being used, or to
+ required, resulting in requiredDuringSchedulingIgnoredDuringExecution.Changes:
+kubectl-cnp on clusters without annotations.debug.Release date: 17 June 2021
+Change:
+Release date: 11 June 2021
+Features:
+pg_basebackup bootstrap method to create a new PostgreSQL
+ cluster as a copy of an existing PostgreSQL instance of the same major
+ version, even outside KubernetesAffinity section of the
+ Cluster resource, allowing users to distribute PostgreSQL instances on
+ Kubernetes nodes with the required taint<image>:<tag>@sha256:<digestValue> format, for more deterministic and
+ repeatable deploymentsSecurity Enhancements:
+sslmode for the WAL receiver process of internal and
+ automatically managed streaming replicas from require to verify-caChanges:
+promote subcommand of the cnp plugin for kubectl to accept
+ just the node number rather than the whole name of the podpg_basebackup methodBackup resource with beginWal, endWal, beginLSN,
+ endLSN, startedAt and stoppedAt regarding the physical base backupcnp-bench helm charts and guidelines for benchmarking the
+ storage and PostgreSQL for database workloadsimmediateCheckpoint and jobs parameter in
+ barmanObjectStore backupsRelease date: 18 May 2021
+Features:
+restart command to the cnp plugin for kubectl to restart
+ the pods of a given PostgreSQL cluster in a rollout fashionSecurity Enhancements:
+readOnlyRootFilesystem security context for podsChanges:
+control-plane: controller-manager to
+ app.kubernetes.io/name: cloudnative-pgmetrics name to the TCP port for the Prometheus exportercnp_metrics_exporter as the application_name to the metrics exporter
+ connection in PostgreSQLpostgres database-R when calling pg_basebackupRelease date: 23 Apr 2021
+Features:
+Security Enhancements:
+Changes:
+Release date: 6 Apr 2021
+Release date: 31 Mar 2021
+postgres_exporter for PrometheusRelease date: 3 Mar 2021
+kubectl cnp status to pretty-print the status of a cluster, including
+ JSON and YAML outputkubectl cnp certificate to enable TLS authentication for client applications-ro service to route connections to the available hot
+ standby replicas only, enabling offload of read-only queries from
+ the cluster's primary instancemaxSyncReplicasSIGINT signal (fast shutdown) to PostgreSQL process on SIGTERMRelease date: 4 Feb 2021
+The first major stable release of CloudNativePG implements Cluster,
+Backup and ScheduledBackup in the API group postgresql.cnpg.io/v1.
+It uses these resources to create and manage PostgreSQL clusters inside
+Kubernetes with the following main capabilities:
nodeSelector propertyHistory of user-visible changes for CloudNativePG, classified for each minor release.
+ +For information on the community support policy for CloudNativePG, please +refer to "Supported releases".
+Older releases:
+We also keep record of all the +release notes from 1.0.0 to 1.14.0 of Cloud Native PostgreSQL by EDB, +the predecessor of CloudNativePG.
+ +History of user-visible changes in the 1.15 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Warning
+This is expected to be the last release in the 1.15.X series. +Users are encouraged to update to a newer minor version soon.
+Release date: Oct 6, 2022
+Enhancements:
+leaseDuration and renewDeadline parameters in the controller
+ manager to enhance configuration of the leader election in operator
+ deployments (#759).check-empty-wal-archive is placed in the PGDATA immediately after
+ the cluster is bootstrapped and it is then removed after the first WAL
+ file is successfully archivedSecurity:
+distroless/static:nonroot container image, by using the
+ nonroot:nonroot user (#754)Fixes:
+metav1.Conditions struct (#720)MAPPEDMETRIC and DURATION metric types conversion in the native
+ Prometheus exporter (#765)Release date: Sep 6, 2022
+Enhancements:
+cnpg.io/instanceName and the cnpg.io/podRole labels
+ are always present on pods and PVCs (#632 and #680)role label of an instance to the underlying PVC (#634)Fixes:
+GetAllAccessibleDatabases
+ (#641)Release date: Aug 12, 2022
+Enhancements:
+huge_pages option for PostgreSQL (#456)instanceName and clusterName labels on jobs, pods, and PVCs to
+ improve interaction with these resources (#534)Security:
+securityContext to the Pooler deployment (#485)Fixes:
+stopDelay) and for switchover
+ (switchoverDelay), as they were erroneously swapped before. This is an
+ important fix, as it might block indefinitely restarts if switchoverDelay is
+ not set and uses the default value of 40000000 seconds (#531)Minor changes:
+Release date: Jul 7, 2022 (patch release)
+Enhancements:
+Changes:
+Fixes:
+WithActiveInstance function that
+ starts the CSV log pipe for the PostgreSQL server, ensuring proper logging in
+ the cluster initialization phase - this is especially useful in bootstrap
+ operations like recovery from a backup are failing (before this patch, such
+ logs were not sent to the standard output channel and were permanently lost)ALTER ROLE statementsbackupID in the recoveryTarget sectionbarman-cloud-wal-archive and
+ barman-cloud-backup commandsbarmanObjectStore.serverName option when recovering from
+ a backup object store using a server name that doesn’t match the current
+ cluster namecnpg plug-in:-n parameter when
+ specified by the userstatus command to sort results and remove variability in the
+ outputRelease date: May 27, 2022 (patch release)
+Minor changes:
+archive_timeout setting for PostgreSQL, which
+ was previously a fixed parameter (by default set to 5 minutes)backupOwnerReference in the scheduledBackup
+ resource to set the ownership reference on the created backup resources, with
+ possible values being none (default), self (objects owned by the scheduled
+ backup object), and cluster (owned by the Postgres cluster object)pg_stat_wal metrics for PostgreSQL 14 or
+ higher in the native Prometheus exporterFixes:
+postgresWithActiveInstance function to collect
+ logs from recovery bootstrap jobs and help in the troubleshooting phaserecovery bootstrap method, make sure that the recovery object
+ store and the backup object store are different to avoid overwriting existing
+ backupscnpg report operator command to correctly handle the case of a
+ deployment of the operator in high availability, with more than one replicainheritedMetadata set of labels
+ and annotations to the related resources of the cluster without requiring a
+ restartcnpg plugin to correctly parse any custom configmap and secret name
+ defined in the operator deployment, instead of relying just on the default
+ valuesminidocks/mkdocs image
+ for mkdocsRelease date: 21 April 2022
+Features:
+cnpg.io/fencedInstances
+ annotation, which, if not empty, disables switchover/failovers in the cluster;
+ fenced instances are shut down and the pod is kept running (while considered
+ not ready) for inspection and emergenciespg_hba.conf to be defined in the Postgres cluster spec
+ declaratively, enabling the optional use of Kubernetes secrets for sensitive
+ options such as ldapbindpasswdprimaryUpdateMethod option, accepting the values of
+ switchover (default) and restart, to be used in case of unsupervised
+ primaryUpdateStrategy; this method controls what happens to the primary
+ instance during the rolling update procedurereport command in the kubectl cnp plugin for better diagnosis and
+ more effective troubleshooting of both the operator and a specific Postgres
+ clusterBackup objects that are no longer in the backup object storeLSN in Point-In-Time Recovery
+ bootstrap methodAWS_SESSION_TOKEN authentication token in AWS S3 through
+ the sessionToken optionPooler pods set to
+ quay.io/enterprisedb/pgbouncer:1.17.0Fixes:
+targetTime correctly
+ works now, as previously a target prior to the latest available backup was
+ not possible (the detection algorithm was always wrong by selecting the last
+ backup as a starting point)pg_controldataVersion 1.15.0 is the first release of CloudNativePG. Previously, this software +was called EDB Cloud Native PostgreSQL (now EDB Postgres for Kubernetes). If you +are looking for information about a previous release, please refer to the +EDB documentation.
+ +History of user-visible changes in the 1.16 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: Dec 21, 2022
+Warning
+This is expected to be the last release in the 1.16.X series. +Users are encouraged to update to a newer minor version soon.
+Important announcements:
+Enhancements:
+Fixes:
+Technical enhancements:
+ciclops for test summary (#1064): rely on the ciclops GitHub action to
+ provide summaries of the E2E suite, inheriting improvements from that projectRelease date: Nov 10, 2022
+Security:
+SeccomProfile to Pods and Containers (#888)Enhancements:
+status command for the cnpg plugin:Fixes:
+plpgsql functions (#974)ScheduledBackup format does not follow Kubernetes CronJob
+ format (#883)barmanObjectStore in documentation (#877)krew release (#866)Release date: Oct 6, 2022
+Enhancements:
+leaseDuration and renewDeadline parameters in the controller
+ manager to enhance configuration of the leader election in operator
+ deployments (#759).check-empty-wal-archive is placed in the PGDATA immediately after
+ the cluster is bootstrapped and it is then removed after the first WAL
+ file is successfully archivedSecurity:
+distroless/static:nonroot container image, by using the
+ nonroot:nonroot user (#754)Fixes:
+Release date: Sep 6, 2022
+Enhancements:
+cnpg.io/instanceName and the cnpg.io/podRole labels
+ are always present on pods and PVCs (#632 and #680)role label of an instance to the underlying PVC (#634)Fixes:
+shared_preload_libraries when bootstrapping the new cluster's primary
+ (#642)GetAllAccessibleDatabases
+ (#641)Release date: Aug 12, 2022
+Enhancements:
+huge_pages option for PostgreSQL (#456)instanceName and clusterName labels on jobs, pods, and PVCs to
+ improve interaction with these resources (#534)Security:
+securityContext to the Pooler deployment (#485)Fixes:
+stopDelay) and for switchover
+ (switchoverDelay), as they were erroneously swapped before. This is an
+ important fix, as it might block indefinitely restarts if switchoverDelay is
+ not set and uses the default value of 40000000 seconds (#531)postgres user while running pg_restore in database import (#411)sslmode in the monolith import
+ case to control SSL connections with the origin external server (#572)dbname was specified in
+ connectionParameters (#569)Minor changes:
+Release date: Jul 7, 2022 (minor release)
+Features:
+bootstrap.initdb.import section to provide a way to import objects via the
+ network from an existing PostgreSQL instance (even outside Kubernetes) inside a
+ brand new CloudNativePG cluster using the PostgreSQL logical backup concept
+ (pg_dump/pg_restore). The same method can be used to perform major
+ PostgreSQL upgrades on a new cluster. The feature introduces two types of
+ import: microservice (import one database only in the new cluster) and
+ monolith (import the selected databases and roles from the existing
+ instance).Enhancements:
+DROP EXTENSION IF EXISTS
+ unnecessarilypg_basebackup (this was only possible in the initdb bootstrap
+ so far)inheritFromAzureAD optionbarmanObjectStore.s3Credentials.region to define the region
+ in AWS (AWS_DEFAULT_REGION) for both backup and recovery object storesChanges:
+Fixes:
+WithActiveInstance function that
+ starts the CSV log pipe for the PostgreSQL server, ensuring proper logging in
+ the cluster initialization phase - this is especially useful in bootstrap
+ operations like recovery from a backup are failing (before this patch, such
+ logs were not sent to the standard output channel and were permanently lost)ALTER ROLE statementsbackupID in the recoveryTarget sectionbarman-cloud-wal-archive and
+ barman-cloud-backup commandsbarmanObjectStore.serverName option when recovering from
+ a backup object store using a server name that doesn’t match the current
+ cluster namecnpg plug-in:-n parameter when
+ specified by the userstatus command to sort results and remove variability in the
+ outputHistory of user-visible changes in the 1.17 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: March 20, 2023
+Warning
+This is expected to be the last release in the 1.17.X series. +Users are encouraged to update to a newer minor version soon.
+Enhancements:
+debug cluster's log level to the initdb job (#1503)pg_hba for the PgBouncer pooler (#1395)psql command to the cnpg plugin for kubectl (#1668) allowing
+ the user to start a psql session with a pod (the primary by default)Technical enhancements:
+Fixes:
+status command of the cnpg plugin (#1666)Release date: Feb 14, 2023
+Features:
+Enhancements:
+backup command in the cnpg plugin for kubectl to
+ issue a new base backup of the cluster (#1348)streaming_replica user (#1349)recovery (#1211)Technical enhancements:
+Fixes:
+walStorage resize when using pvcTemplate (#1315)PodMonitor reconcile if Prometheus is not installed (#1238)PodMonitor when not needed (#1213)Release date: Dec 21, 2022
+Important announcements:
+Enhancements:
+Fixes:
+Technical enhancements:
+ciclops for test summary (#1064): rely on the ciclops GitHub action to
+ provide summaries of the E2E suite, inheriting improvements from that projectRelease date: Nov 10, 2022
+Security:
+SeccomProfile to Pods and Containers (#888)Enhancements:
+status command for the cnpg plugin:Fixes:
+plpgsql functions (#974)ScheduledBackup format does not follow Kubernetes CronJob
+ format (#883)barmanObjectStore in documentation (#877)krew release (#866)Release date: Oct 6, 2022
+Enhancements:
+leaseDuration and renewDeadline parameters in the controller
+ manager to enhance configuration of the leader election in operator
+ deployments (#759).check-empty-wal-archive is placed in the PGDATA immediately after
+ the cluster is bootstrapped and it is then removed after the first WAL
+ file is successfully archivedSecurity:
+distroless/static:nonroot container image, by using the
+ nonroot:nonroot user (#754)Fixes:
+MAPPEDMETRIC and DURATION metric types conversion in the native
+ Prometheus exporter (#765)Release date: Sep 6, 2022 (minor release)
+Features:
+walStorage section to separate WAL files (pg_wal) in a dedicated volume,
+ separate from the PGDATA defined in the main and mandatory storage section
+ (#513). Current limitations: walStorage can only be set at cluster creation
+ and cannot be added or removed when the cluster is up and running.Enhancements:
+cnpg.io/instanceName and the cnpg.io/podRole labels
+ are always present on pods and PVCs (#632 and #680)role label of an instance to the underlying PVC (#634)kubectl cnpg destroy command to help remove an instance and
+ all the associated PVCs (#643)Fixes:
+shared_preload_libraries when bootstrapping the new cluster's primary
+ (#642)GetAllAccessibleDatabases
+ (#641)History of user-visible changes in the 1.18 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: June 12, 2023
+Warning
+This is expected to be the last release in the 1.18.X series. +Users are encouraged to update to a newer minor version soon.
+Enhancements:
+snapshot command to the cnpg plugin to create a consistent cold
+ backup of the cluster from a standby using the Kubernetes VolumeSnapshot
+ standard resource (#1960).spec.bootstrap.recovery.volumeSnapshot stanza (#1960)pg_failover_slots to managed extensions (#2057)schemaOnly option in the import stanza, to avoid exporting
+ and importing data when you bootstrap a new Postgres Cluster from one or more
+ existing databases (#2234)PodMonitor support to the Poolerfor PgBouncer (#2034)Pooler resource (#1983)Technical enhancements:
+Fixes:
+Release date: April 27, 2023
+Important
+CloudNativePG is dropping support for PostgreSQL 10, as PostgreSQL 10 +reached End-of-Life (EOL) in November 2022. Versions 11 and newer are +supported. Please plan your migration to PostgreSQL 15 as soon as possible. +Refer to "Importing Postgres databases" +for more information on PostgreSQL major offline upgrades.
+Enhancements:
+--logs option of the report command of the cnpg plugin for
+ kubectl to also include the previous logs where available (#1811)-any service is now disabled by default (#1755)Security:
+SeccompProfile through override via a local file (#1827)Fixes:
+initdb
+ bootstrap phase, before the server is started the first time (#1858)Release date: March 20, 2023
+Enhancements:
+debug cluster's log level to the initdb job (#1503)pg_hba for the PgBouncer pooler (#1395)psql command to the cnpg plugin for kubectl (#1668) allowing
+ the user to start a psql session with a pod (the primary by default)Technical enhancements:
+--name option introduced with Barman 3.3 to make the
+ association between backups and the object store more robust.Fixes:
+status command of the cnpg plugin (#1666)nodeAffinity is applied even if AdditionalPodAffinity and
+ AdditionalPodAntiAffinity are not set (#1663)Release date: Feb 14, 2023
+Enhancements:
+backup command in the cnpg plugin for kubectl to
+ issue a new base backup of the cluster (#1348)min_wal_size, max_wal_size,
+ keep_wal_size, wal_keep_segments, as well as the maximum number of WALs
+ that can be stored in the dedicated volume (#1382)streaming_replica user (#1349)recovery (#1211)Technical enhancements:
+Fixes:
+CHECKPOINT as the streaming_replica user (#1408)waitForWalArchiveWorking resilient to connection errors (#1399)walStorage resize when using pvcTemplate (#1315)ExecCommand obeys timeout (#1242)PodMonitor reconcile if Prometheus is not installed (#1238)PodMonitor when not needed (#1213)Release date: Dec 21, 2022
+Important announcements:
+postgresql as a label to identify the
+ CNPG cluster. In the remote case you have used this label, please start using
+ the cnpg.io/cluster label instead (#1130)Enhancements:
+nodeAffinity support (#1182) - allows for richer scheduling optionslevel
+ and ts fields might interfere with the existing logging (#843)fio command to the kubectl-cnpg plugin (#1097)Fixes:
+--dry-run
+(#1088)Technical enhancements:
+ciclops for test summary (#1064): rely on the ciclops GitHub action to
+ provide summaries of the E2E suite, inheriting improvements from that projectRelease date: Nov 10, 2022
+Features:
+kubectl cnpg hibernate on/off/status <cluster-name>.
+ Hibernation destroys all the resources generated by the cluster, except the
+ PVCs that belong to the PostgreSQL primary instance (#782)Security:
+SeccomProfile to Pods and Containers (#888)Enhancements:
+pvcTemplate (#914)status command for the cnpg plugin:cnpg plugin:pgbench generates a job definition executing pgbench against a cluster
+(#958)install generates an installation manifest for the operator (#944)Fixes:
+plpgsql functions (#974)ScheduledBackup format does not follow Kubernetes CronJob
+ format (#883)barmanObjectStore in documentation (#877)krew release (#866)History of user-visible changes in the 1.19 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: Nov 3, 2023
+Warning
+This is expected to be the last release in the 1.19.X series. +Users are encouraged to update to a newer minor version soon.
+Enhancements:
+status command of the cnpg plugin for kubectl with progress
+ information on active streaming base backups (#3101)max_prepared_statements with the pgBouncer
+ Pooler resource (#3174)Fixes:
+synchronous_commit = local
+ when managing the PostgreSQL cluster (#3143).spec.certificates.replicationTLSSecret are now working (#3209)cnpg.io/cluster label to the Pooler pods (#3153)Changes:
+postgresql.auto.conf file inside PGDATA to control Postgres
+ replication settings, and replace it with a file named override.conf(#2812)Technical enhancements:
+Release date: Oct 11, 2023
+Warning
+Version 1.19 will reach its End-of-Life (EOL) on November 9, 2023. +If you haven't done it yet, please start planning an upgrade +as soon as possible.
+Important Changes:
+stopDelay to 1800 seconds instead of 30 seconds
+ (#2848)smartShutdownTimeout, to control the
+ window of time reserved for the smart shutdown of Postgres to complete; the
+ general formula to compute the overall timeout to stop Postgres is
+ max(stopDelay - smartShutdownTimeout, 30) (#2848)startDelay to 3600, instead of 30 seconds
+ (#2847)switchoverDelay to 3600 seconds instead of
+ 40000000 seconds (#2846)postgresql label - replaced by cnpg.io/cluster in
+ 1.18 (#2744)Security:
+seccompProfile to the operator deployment (#2926)Enhancements:
+cnpg.io/coredumpFilter annotation to control the content of a
+ core dump generated in the unlikely event of a PostgreSQL crash, by default
+ set to exclude shared memory segments from the dump (#2733)shared_buffers are coherent with the pods'
+ allocated memory resources (#2840)uri and jdbc-uri fields in the credential secrets to facilitate
+ developers when connecting their applications to the database (#2186)Waiting for the instances to become active for finer
+ control of a cluster's state waiting for the replicas to be ready (#2612)podSpec annotation
+ (#2243)status command
+ (#2953)Fixes:
+cnpg.io/skipEmptyWalArchiveCheck annotation during recovery to
+ bypass the check for an empty WAL archive (#2731)app both the owner and database during recovery bootstrap
+ (#2957)ResourceName sections
+ of the generated roles (#2875)ContinuousArchiving condition is properly set to 'failed'
+ in case of errors (#2625)Backup resource reconciliation cycle more resilient on
+ interruptions by stopping only if the backup is completed or failed (#2591)labels and annotations (#2583)resourceNames on the Role object
+ (#2956)Observability:
+pg_stat_replication metric (#2961)pg_wal_stat default metric for Prometheus (#2569)pg_replication default metric for Prometheus (#2744 and
+ #2750)alertInstanceLabelFilter instead of alertName in the provided
+ Grafana dashboardstandard_conforming_strings in metric collection (#2888)Changes:
+cnpg hibernate plugin command use the
+ ClusterManifestAnnotationName and PgControldataAnnotationName annotations
+ on PVCs (#2657)cnpg.io/instanceRole label while deprecating the existing role
+ label (#2915)Technical enhancements:
+k8s-api-docgen with gen-crd-api-reference-docs to automatically
+ build the API reference documentation (#2606)Release date: July 27, 2023
+Enhancements:
+logs command in the kubectl plugin, to retrieve or follow the logs
+ of all pods in a cluster (#2375)Technical enhancements:
+Fixes:
+Security:
+Release date: June 12, 2023
+Enhancements:
+snapshot command to the cnpg plugin to create a consistent cold
+ backup of the cluster from a standby using the Kubernetes VolumeSnapshot
+ standard resource (#1960).spec.bootstrap.recovery.volumeSnapshot stanza (#1960)pg_failover_slots to managed extensions (#2057)schemaOnly option in the import stanza, to avoid exporting
+ and importing data when you bootstrap a new Postgres Cluster from one or more
+ existing databases (#2234)PodMonitor support to the Poolerfor PgBouncer (#2034)Pooler resource (#1983)Technical enhancements:
+Fixes:
+Release date: April 27, 2023
+Enhancements:
+--logs option of the report command of the cnpg plugin for
+ kubectl to also include the previous logs where available (#1811)-any service is now disabled by default (#1755)Security:
+SeccompProfile through override via a local file (#1827)Fixes:
+initdb
+ bootstrap phase, before the server is started the first time (#1858)Release date: March 20, 2023
+Enhancements:
+debug cluster's log level to the initdb job (#1503)pg_hba for the PgBouncer pooler (#1395)psql command to the cnpg plugin for kubectl (#1668) allowing
+ the user to start a psql session with a pod (the primary by default)Technical enhancements:
+--name option introduced with Barman 3.3 to make the
+ association between backups and the object store more robust.Fixes:
+status command of the cnpg plugin (#1666)nodeAffinity is applied even if AdditionalPodAffinity and
+ AdditionalPodAntiAffinity are not set (#1663)Release date: Feb 14, 2023
+Important announcements:
+Features:
+.spec.backup.target option accepting
+ that when set to prefer-standby will run take the physical base backup from
+ the most aligned replica (#1162)failoverDelay parameter to delay the
+ failover process once the primary has been detected unhealthy (#1366)Enhancements:
+backup command in the cnpg plugin for kubectl to
+ issue a new base backup of the cluster (#1348)min_wal_size, max_wal_size,
+ keep_wal_size, wal_keep_segments, as well as the maximum number of WALs
+ that can be stored in the dedicated volume (#1382)streaming_replica user (#1349)recovery (#1211)Technical enhancements:
+Fixes:
+CHECKPOINT as the streaming_replica user (#1408)waitForWalArchiveWorking resilient to connection errors (#1399)walStorage resize when using pvcTemplate (#1315)ExecCommand obeys timeout (#1242)PodMonitor reconcile if Prometheus is not installed (#1238)PodMonitor when not needed (#1213)History of user-visible changes in the 1.20 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: Feb 2, 2024
+Warning
+This is expected to be the last release in the 1.20.X series. +Users are encouraged to update to a newer minor version soon.
+Enhancements:
+ephemeralVolumeSource option (#3678)pgadmin4 command in the cnpg plugin for kubectl,
+ providing a straightforward method to demonstrate connecting to a given
+ database cluster and navigate its content in a local environment such as kind -
+ for evaluation purposes only (#3701).spec.postgresql.pg_ident stanza, through a list of user name maps (#3534)Fixes:
+pg_rewind failing due to
+ postgresql.auto.conf being read-only on clusters where the ALTER SYSTEM
+ SQL command is disabled - the default (#3728)initdb bootstrap method, by disabling the durability settings in the PostgreSQL
+ instance for the duration of the import process (#3743)pgpass connection fields (#3713).spec.projectedVolumeTemplate.sources (#3647)Release date: Dec 21, 2023
+Security:
+ssl_ciphers,
+ ssl_min_protocol_version, and ssl_max_protocol_version GUCs (#3408).Enhancements:
+ALTER SYSTEM command (#3535).podMonitorMetricRelabelings and podMonitorRelabelings options in the
+ .spec.monitoring stanza of the Cluster and Pooler resources (#3075).PGPASSFILE environment variable when
+ establishing a network connection to PostgreSQL (#3522).cnpg report plugin command by collecting a cluster's PVCs (#3357).cnpg status plugin command, providing information about
+ managed roles, including alerts (#3310).Pooler resource to 0 is now possible (#3517).cnpg.io/podRole label with a value of 'pooler' to every
+ pooler deployment, differentiating them from instance pods (#3396).Fixes:
+PodDisruptionBudget
+ resources (#3312 and #3434).cnpg backup plugin command when
+ either immediate-checkpoint or wait-for-archive options are requested (#3449).Pooler resource (#3428).RoleBinding for Pooler resources (#3391).imagePullSecret for Pooler resources (#3389).Pooler and addition of the required labels (#3349).Pooler labels to the deployment as well, not just the pods (#3350).Changes:
+Release date: Nov 3, 2023
+Enhancements:
+status command of the cnpg plugin for kubectl with progress
+ information on active streaming base backups (#3101)max_prepared_statements with the pgBouncer
+ Pooler resource (#3174)Fixes:
+synchronous_commit = local
+ when managing the PostgreSQL cluster (#3143).spec.certificates.replicationTLSSecret are now working (#3209)cnpg.io/cluster label to the Pooler pods (#3153)Changes:
+postgresql.auto.conf file inside PGDATA to control Postgres
+ replication settings, and replace it with a file named override.conf (#2812)Technical enhancements:
+Release date: Oct 11, 2023
+Important Changes:
+stopDelay to 1800 seconds instead of 30 seconds
+ (#2848)smartShutdownTimeout, to control the
+ window of time reserved for the smart shutdown of Postgres to complete; the
+ general formula to compute the overall timeout to stop Postgres is
+ max(stopDelay - smartShutdownTimeout, 30) (#2848)startDelay to 3600, instead of 30 seconds
+ (#2847)switchoverDelay to 3600 seconds instead of
+ 40000000 seconds (#2846)postgresql label - replaced by cnpg.io/cluster in
+ 1.18 (#2744)Security:
+seccompProfile to the operator deployment (#2926)Enhancements:
+cnpg.io/coredumpFilter annotation to control the content of a
+ core dump generated in the unlikely event of a PostgreSQL crash, by default
+ set to exclude shared memory segments from the dump (#2733)shared_buffers are coherent with the pods'
+ allocated memory resources (#2840)uri and jdbc-uri fields in the credential secrets to facilitate
+ developers when connecting their applications to the database (#2186)Waiting for the instances to become active for finer
+ control of a cluster's state waiting for the replicas to be ready (#2612)podSpec annotation
+ (#2243)status command
+ (#2953)Fixes:
+cnpg.io/skipEmptyWalArchiveCheck annotation during recovery to
+ bypass the check for an empty WAL archive (#2731)app both the owner and database during recovery bootstrap
+ (#2957)ResourceName sections
+ of the generated roles (#2875)ContinuousArchiving condition is properly set to 'failed'
+ in case of errors (#2625)Backup resource reconciliation cycle more resilient on
+ interruptions by stopping only if the backup is completed or failed (#2591)labels and annotations (#2583)resourceNames on the Role object
+ (#2956)Observability:
+pg_stat_replication metric (#2961)pg_wal_stat default metric for Prometheus (#2569)pg_replication default metric for Prometheus (#2744 and
+ #2750)alertInstanceLabelFilter instead of alertName in the provided
+ Grafana dashboardstandard_conforming_strings in metric collection (#2888)Changes:
+cnpg snapshot command with cnpg backup -m volumeSnapshot for
+ the kubectl plugincnpg hibernate plugin command use the
+ ClusterManifestAnnotationName and PgControldataAnnotationName annotations
+ on PVCs (#2657)cnpg.io/instanceRole label while deprecating the existing role
+ label (#2915)Technical enhancements:
+k8s-api-docgen with gen-crd-api-reference-docs to automatically
+ build the API reference documentation (#2606)Release date: July 27, 2023
+Enhancements:
+logs command in the kubectl plugin, to retrieve or follow the logs
+ of all pods in a cluster (#2375)Technical enhancements:
+Fixes:
+Security:
+Release date: June 12, 2023
+Enhancements:
+snapshot command to the cnpg plugin to create a consistent cold
+ backup of the cluster from a standby using the Kubernetes VolumeSnapshot
+ standard resource (#1960).spec.bootstrap.recovery.volumeSnapshot stanza (#1960)pg_failover_slots to managed extensions (#2057)schemaOnly option in the import stanza, to avoid exporting
+ and importing data when you bootstrap a new Postgres Cluster from one or more
+ existing databases (#2234)PodMonitor support to the Poolerfor PgBouncer (#2034)Pooler resource (#1983)Technical enhancements:
+Fixes:
+Release date: April 27, 2023
+Important changes from previous versions
+CloudNativePG 1.20 introduces some changes to the default behavior of a
+few features for newly created Cluster resources, compared to previous
+versions of the operator. The goal of these changes is to improve the
+resilience of a Postgres cluster out of the box through convention over
+configuration. For clusters with one or more replicas:
target is
+ explicitly set to primaryprimaryUpdateMethod
+ defaults to restart, unless explicitly set to switchover)For further information, please refer to the "Installation and upgrades" section.
+Features:
+managed.roles stanza in the
+ Cluster spec to provide full lifecycle management of database roles, by
+ providing an abstraction to the related DDL commands in PostgreSQL, such as
+ CREATE ROLE and ALTER ROLE (#1524, #1793 and #1815)cnpg.io/hibernation to declaratively hibernate a
+ PostgreSQL cluster by deleting all pods and keeping the PVCs only; the feature
+ also implements the inverse procedure (#1657)Enhancements:
+--logs option of the report command of the cnpg plugin for
+ kubectl to also include the previous logs where available (#1811)-any service is now disabled by default (#1755)Security:
+SeccompProfile through override via a local file (#1827)Fixes:
+initdb
+ bootstrap phase, before the server is started the first time (#1858)History of user-visible changes in the 1.21 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: Jun 12, 2024
+Warning
+This is expected to be the last release in the 1.21.X series. +Users are encouraged to update to a newer minor version soon.
+Enabled configuration of standby-sensitive parameters during recovery using a + physical backup (#4564)
+Enabled the configuration of the liveness probe timeout via the
+ .spec.livenessProbeTimeout option (#4719)
cnpg plugin for kubectl:
--color
+ option, which accepts always, never, and auto (default) as values
+ (#4775)Prevented fenced instances from entering an unnecessary loop and consuming + all available CPU (#4625)
+Resolved an issue where the instance manager on the primary would + indefinitely wait for the instance to start after encountering a failure + following a stop operation (#4434)
+Fixed an issue where the interaction between hot_standby_feedback and
+ managed cluster-level replication slots was preventing the autovacuum from
+ operating correctly; this issue was causing disk space to remain occupied by
+ dead tuples (#4811)
Fixed a panic in the backup controller that occurred when pod container + statuses were missing (#4765)
+Prevented unnecessary shutdown of the instance manager (#4670)
+Prevented unnecessary reloads of PostgreSQL configuration when unchanged (#4531)
+Prevented unnecessary reloads of the ident map by ensuring a consistent and + unique method of writing its content (#4648)
+Avoided conflicts during phase registration by patching the status of the + resource instead of updating it (#4637)
+Implemented a timeout when restarting PostgreSQL and lifting fencing (#4504)
+Ensured that a replica cluster is restarted after promotion to properly set + the archive mode (#4399)
+Removed an unneeded concurrent keep-alive routine that was causing random + failures in volume snapshot backups (#4768)
+Ensured correct parsing of the additional rows field returned when the
+ pgaudit.log_rows option was enabled, preventing audit logs from being
+ incorrectly routed to the normal log stream (#4394)
cnpg plugin for kubectl:
cnpg status command (#4530)Release date: Apr 24, 2024
+wal_log_hints PostgreSQL parameter (#4218)
+ (#4218)barman-cloud-wal-restore in recovery
+ bootstrap (#4101)PGDATA in case of failure of the restore job (#4151)cnpg plugin for kubectl:destroy, fencing, and promote
+ plugin commands (#4280)cnpg status
+ (#4131)sync-sequences command
+ (#4346)Release date: Mar 14, 2024
+wal_level GUC in PostgreSQL (#4020)cnpg.io/skipWalArchiving annotation to disable WAL archiving when
+ set to enabled (#4055)cnpg plugin for kubectl with the publication and
+ subscription command groups to imperatively set up PostgreSQL native
+ logical replication (#4052)CERTIFICATE_DURATION and EXPIRING_CHECK_THRESHOLD
+ for automated management of TLS certificates handled by the operator (#3686)cnpg plugin for
+ kubectl (#3875)wal_sender_timeout when cloning a replica to prevent timeout errors
+ due to slow connections (#4080)Role instead of ClusterRole for operator permissions in OLM,
+ requiring fewer privileges when installed on a per-namespace basis (#3855,
+ #3990)--server-side option by default
+ when installing the operator (#3729).Release date: Feb 2, 2024
+ephemeralVolumeSource option (#3678)pgadmin4 command in the cnpg plugin for kubectl,
+ providing a straightforward method to demonstrate connecting to a given
+ database cluster and navigate its content in a local environment such as kind -
+ for evaluation purposes only (#3701).spec.postgresql.pg_ident stanza, through a list of user name maps (#3534)pg_rewind failing due to
+ postgresql.auto.conf being read-only on clusters where the ALTER SYSTEM
+ SQL command is disabled - the default (#3728)initdb bootstrap method, by disabling the durability settings in the PostgreSQL
+ instance for the duration of the import process (#3743)pgpass connection fields (#3713).spec.projectedVolumeTemplate.sources (#3647)postgresql.auto.conf any
+ options now incorporated into override.conf (#3773)Release date: Dec 21, 2023
+ssl_ciphers,
+ ssl_min_protocol_version, and ssl_max_protocol_version GUCs (#3408).ALTER SYSTEM command (#3535).podMonitorMetricRelabelings and podMonitorRelabelings options in the
+ .spec.monitoring stanza of the Cluster and Pooler resources (#3075).PGPASSFILE environment variable when
+ establishing a network connection to PostgreSQL (#3522).cnpg report plugin command by collecting a cluster's PVCs (#3357).cnpg status plugin command, providing information about
+ managed roles, including alerts (#3310).Pooler resource to 0 is now possible (#3517).cnpg.io/podRole label with a value of 'pooler' to every
+ pooler deployment, differentiating them from instance pods (#3396).PodDisruptionBudget
+ resources (#3312 and #3434).cnpg backup plugin command when
+ either immediate-checkpoint or wait-for-archive options are requested (#3449).Pooler resource (#3428).RoleBinding for Pooler resources (#3391).imagePullSecret for Pooler resources (#3389).Pooler and addition of the required labels (#3349).Pooler labels to the deployment as well, not just the pods (#3350).Release date: Nov 3, 2023
+online option and the onlineConfiguration stanza in
+ .spec.backup.volumeSnapshot. Unless explicitly set, backups on volume
+ snapshots are now taken online by default (#3102)ScheduledBackup and Backup resources (#3208, #3226)backup command of the cnpg plugin for kubectl improves the volume
+ snapshot backup experience through the --online, --immediate-checkpoint,
+ and --wait-for-archive runtime optionsstatus command of the cnpg plugin for kubectl with progress
+ information on active streaming base backups (#3101)max_prepared_statements with the pgBouncer
+ Pooler resource (#3174)synchronous_commit = local
+ when managing the PostgreSQL cluster (#3143).spec.certificates.replicationTLSSecret are now working (#3209)cnpg.io/cluster label to the Pooler pods (#3153)VolumeSnapshots resources and render them
+ into more appropriate annotations (#3151).spec.backup.volumeSnapshot to
+ falsepostgresql.auto.conf file inside PGDATA to control Postgres
+ replication settings, and replace it with a file named override.conf (#2812)Release date: Oct 12, 2023
+Important changes from previous versions
+This release contains a few changes to the default settings of +CloudNativePG with the goal to improve general stability and security through +predefined values. If you are upgrading from a previous version, please +carefully read the "Important Changes" section below, as well as the +upgrade documentation.
+Volume Snapshot support for backup and recovery: leverage the standard + Kubernetes API on Volume Snapshots to take advantage of capabilities like + incremental and differential copy for both backup and recovery operations. This + first step, covering cold backups from a standby, will continue in 1.22 with + support for hot backups using the PostgreSQL API and tablespaces.
+OLM installation method: introduce support for Operator Lifecycle Manager + via OperatorHub.io for the latest patch version of the latest minor release + through the stable channel. Many thanks to EDB for donating the bundle of + their "EDB Postgres for Kubernetes" operator and adapting it for CloudNativePG.
+stopDelay to 1800 seconds instead of 30 seconds
+ (#2848)smartShutdownTimeout, to control the
+ window of time reserved for the smart shutdown of Postgres to complete; the
+ general formula to compute the overall timeout to stop Postgres is
+ max(stopDelay - smartShutdownTimeout, 30) (#2848)startDelay to 3600, instead of 30 seconds
+ (#2847)switchoverDelay to 3600 seconds instead of
+ 40000000 seconds (#2846)postgresql label - replaced by cnpg.io/cluster in
+ 1.18 (#2744)seccompProfile to the operator deployment (#2926)cnpg.io/coredumpFilter annotation to control the content of a
+ core dump generated in the unlikely event of a PostgreSQL crash, by default
+ set to exclude shared memory segments from the dump (#2733)shared_buffers are coherent with the pods'
+ allocated memory resources (#2840)uri and jdbc-uri fields in the credential secrets to facilitate
+ developers when connecting their applications to the database (#2186)Waiting for the instances to become active for finer
+ control of a cluster's state waiting for the replicas to be ready (#2612)podSpec annotation
+ (#2243)status command
+ (#2953)cnpg.io/skipEmptyWalArchiveCheck annotation during recovery to
+ bypass the check for an empty WAL archive (#2731)app both the owner and database during recovery bootstrap
+ (#2957)ResourceName sections
+ of the generated roles (#2875)ContinuousArchiving condition is properly set to 'failed'
+ in case of errors (#2625)Backup resource reconciliation cycle more resilient on
+ interruptions by stopping only if the backup is completed or failed (#2591)labels and annotations (#2583)resourceNames on the Role object
+ (#2956)Observability:
+pg_stat_replication metric (#2961)pg_wal_stat default metric for Prometheus (#2569)pg_replication default metric for Prometheus (#2744 and
+ #2750)alertInstanceLabelFilter instead of alertName in the provided
+ Grafana dashboardstandard_conforming_strings in metric collection (#2888)cnpg snapshot command with cnpg backup -m volumeSnapshot for
+ the kubectl plugincnpg hibernate plugin command use the
+ ClusterManifestAnnotationName and PgControldataAnnotationName annotations
+ on PVCs (#2657)cnpg.io/instanceRole label while deprecating the existing role
+ label (#2915)k8s-api-docgen with gen-crd-api-reference-docs to automatically
+ build the API reference documentation (#2606)History of user-visible changes in the 1.22 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: Jul 29, 2024
+Warning
+This is expected to be the last release in the 1.22.X series. +Users are encouraged to update to a newer minor version soon.
+allow_alter_system parameter,
+ enabling or disabling the ALTER SYSTEM command through the
+ .spec.postgresql.enableAlterSystem option (#4921).reconcilePodSpec annotation on the Cluster and Pooler
+ resources to control the restart of pods following a change in the Pod
+ specification (#5069).Pooler metrics
+ collector (#5044).reusePVC flag is
+ set to false (#2056).pg_rewind execution for newly created replicas that employ
+ VolumeSnapshot during bootstrapping to avoid introducing a new shutdown
+ checkpoint entry in the WAL files. This ensures that replicas can reconnect to
+ the primary without issues, which would otherwise be hindered by the additional
+ checkpoint entry (#5081).postmaster to shut down before starting it again (#4938).pg_database metric monitoring and alerting
+ processes (#4980).Pooler instance when
+ the operator image is upgraded (#5006)postmaster was accessible (#4920).Pooler (#4967).AccessModes in the template only when unspecified (#4845).cnpg plugin:status command execution.status command (#4915).Release date: Jun 12, 2024
+Warning
+Version 1.22 is approaching its End-of-Life (EOL) on Jul 24, 2024. +If you haven't already, please begin planning for an upgrade promptly to +ensure continued support and security.
+Enabled configuration of standby-sensitive parameters during recovery using a + physical backup (#4564)
+Enabled the configuration of the liveness probe timeout via the
+ .spec.livenessProbeTimeout option (#4719)
cnpg plugin for kubectl:
--color
+ option, which accepts always, never, and auto (default) as values
+ (#4775)Prevented fenced instances from entering an unnecessary loop and consuming + all available CPU (#4625)
+Resolved an issue where the instance manager on the primary would + indefinitely wait for the instance to start after encountering a failure + following a stop operation (#4434)
+Fixed an issue where the interaction between hot_standby_feedback and
+ managed cluster-level replication slots was preventing the autovacuum from
+ operating correctly; this issue was causing disk space to remain occupied by
+ dead tuples (#4811)
Fixed a panic in the backup controller that occurred when pod container + statuses were missing (#4765)
+Prevented unnecessary shutdown of the instance manager (#4670)
+Prevented unnecessary reloads of PostgreSQL configuration when unchanged (#4531)
+Prevented unnecessary reloads of the ident map by ensuring a consistent and + unique method of writing its content (#4648)
+Avoided conflicts during phase registration by patching the status of the + resource instead of updating it (#4637)
+Implemented a timeout when restarting PostgreSQL and lifting fencing (#4504)
+Ensured that a replica cluster is restarted after promotion to properly set + the archive mode (#4399)
+Removed an unneeded concurrent keep-alive routine that was causing random + failures in volume snapshot backups (#4768)
+Ensured correct parsing of the additional rows field returned when the
+ pgaudit.log_rows option was enabled, preventing audit logs from being
+ incorrectly routed to the normal log stream (#4394)
cnpg plugin for kubectl:
cnpg status command (#4530)Release date: Apr 24, 2024
+wal_log_hints PostgreSQL parameter (#4218)
+ (#4218)barman-cloud-wal-restore in recovery
+ bootstrap (#4101)PGDATA in case of failure of the restore job (#4151)cnpg plugin for kubectl:destroy, fencing, and promote
+ plugin commands (#4280)cnpg status
+ (#4131)sync-sequences command
+ (#4346)Release date: Mar 14, 2024
+wal_level GUC in PostgreSQL (#4020)cnpg.io/skipWalArchiving annotation to disable WAL archiving when
+ set to enabled (#4055)cnpg plugin for kubectl with the publication and
+ subscription command groups to imperatively set up PostgreSQL native
+ logical replication (#4052)CERTIFICATE_DURATION and EXPIRING_CHECK_THRESHOLD
+ for automated management of TLS certificates handled by the operator (#3686)cnpg plugin for
+ kubectl (#3875)wal_sender_timeout when cloning a replica to prevent timeout errors
+ due to slow connections (#4080)Role instead of ClusterRole for operator permissions in OLM,
+ requiring fewer privileges when installed on a per-namespace basis (#3855,
+ #3990)--server-side option by default
+ when installing the operator (#3729).Release date: Feb 2, 2024
+Enhancements:
+ephemeralVolumeSource option (#3678)pgadmin4 command in the cnpg plugin for kubectl,
+ providing a straightforward method to demonstrate connecting to a given
+ database cluster and navigate its content in a local environment such as kind -
+ for evaluation purposes only (#3701).spec.postgresql.pg_ident stanza, through a list of user name maps (#3534)Fixes:
+pg_rewind failing due to
+ postgresql.auto.conf being read-only on clusters where the ALTER SYSTEM
+ SQL command is disabled - the default (#3728)initdb bootstrap method, by disabling the durability settings in the PostgreSQL
+ instance for the duration of the import process (#3743)pgpass connection fields (#3713).spec.projectedVolumeTemplate.sources (#3647)postgresql.auto.conf any
+ options now incorporated into override.conf (#3773)Release date: Dec 21, 2023
+Important changes from previous versions
+This release introduces a significant change, disabling the default usage
+of the ALTER SYSTEM command in PostgreSQL. For users upgrading from a
+previous version who wish to retain the old behavior: please refer to the
+upgrade documentation for detailed instructions.
Declarative Tablespaces: Introducing the tablespaces stanza in the
+ Cluster spec, enabling comprehensive lifecycle management of PostgreSQL
+ tablespaces for enhanced vertical scalability (#3410).
Temporary Tablespaces: Adding the .spec.tablespaces[*].temporary
+ option to facilitate the utilization of a tablespace for temporary database
+ operations, by incorporating the name into the temp_tablespaces PostgreSQL
+ parameter (#3464).
ssl_ciphers,
+ ssl_min_protocol_version, and ssl_max_protocol_version GUCs (#3408).ALTER SYSTEM command (#3535).podMonitorMetricRelabelings and podMonitorRelabelings options in the
+ .spec.monitoring stanza of the Cluster and Pooler resources (#3075).PGPASSFILE environment variable when
+ establishing a network connection to PostgreSQL (#3522).cnpg report plugin command by collecting a cluster's PVCs (#3357).cnpg status plugin command, providing information about
+ managed roles, including alerts (#3310).Pooler resource to 0 is now possible (#3517).cnpg.io/podRole label with a value of 'pooler' to every
+ pooler deployment, differentiating them from instance pods (#3396).Fixes:
+PodDisruptionBudget
+ resources (#3312 and #3434).cnpg backup plugin command when
+ either immediate-checkpoint or wait-for-archive options are requested (#3449).Pooler resource (#3428).RoleBinding for Pooler resources (#3391).imagePullSecret for Pooler resources (#3389).Pooler and addition of the required labels (#3349).Pooler labels to the deployment as well, not just the pods (#3350).Changes:
+ALTER SYSTEM command is now disabled by default (#3545). History of user-visible changes in the 1.23 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release Date: December 23, 2024
+Warning
+This is the final release in the 1.23.x series. +Users are strongly encouraged to upgrade to a newer minor version, as 1.23 +is no longer supported.
+.spec.probes stanza. (#6266)cnpg.io/userType label to secrets generated for predefined users,
+ specifically superuser and app. (#4392)spec.schedule field in ScheduledBackups,
+ raising warnings for potential misconfigurations. (#5396)cnpg plugin:User-Agent header in HTTP requests with the API server. (#6153)PGDATA volume during bootstrap. (#6265)Unrecoverable when
+ all previously generated PersistentVolumeClaims are missing. (#6170)synchronous_standby_names GUC when
+ .spec.postgresql.synchronous.method is set to first. (#5955)primary_slot_name definition from the override.conf file on
+ the primary to ensure it is always empty. (#6219)PGHOST, in PgBouncer pods
+ to enable seamless access to the pgbouncer virtual database using psql
+ from within the container. (#6247)cnpg plugin:kubectl context is properly passed in the psql command. (#6257)status command. (#5998)Release date: Oct 16, 2024
+pg_database_size from the status probe, as it caused
+ high resource utilization by scanning the entire PGDATA directory to
+ compute database sizes. The kubectl status plugin will now rely on du
+ to provide detailed size information retrieval (#5689).full_page_writes parameter in
+ PostgreSQL. This setting defaults to on, in line with PostgreSQL's
+ recommendations (#5516).logs pretty command in the cnpg plugin to read a log stream
+ from standard input and output a human-readable format, with options to
+ filter log entries (#5770)status command by allowing multiple -v options to
+ increase verbosity for more detailed output (#5765).--image
+ flag in the pgadmin4 plugin command, giving users control over the
+ Docker image used for pgAdmin4 deployments (#5515).TMPDIR and PSQL_HISTORY environment variables for pods
+ and jobs, improving temporary file and history management (#5503).logs cluster command (#5775).potential sync status in the status plugin (#5533).pgadmin4 command didn’t have
+ a writable home directory (#5800).Release date: Aug 22, 2024
+cnpg plugin updates:install generate command by adding a --control-plane option,
+ allowing deployment of the operator on control-plane nodes by setting
+ node affinity and tolerations (#5271).destroy command to delete also any job related to the target
+ instance (#5298).smartShutdownTimeout to zero, enabling immediate fast
+ shutdown and bypassing the smart shutdown process when required (#5347).Release date: Jul 29, 2024
+allow_alter_system parameter,
+ enabling or disabling the ALTER SYSTEM command through the
+ .spec.postgresql.enableAlterSystem option (#4921).reconcilePodSpec annotation on the Cluster and Pooler
+ resources to control the restart of pods following a change in the Pod
+ specification (#5069).Pooler metrics
+ collector (#5044).reusePVC flag is
+ set to false (#2056).pg_rewind execution for newly created replicas that employ
+ VolumeSnapshot during bootstrapping to avoid introducing a new shutdown
+ checkpoint entry in the WAL files. This ensures that replicas can reconnect to
+ the primary without issues, which would otherwise be hindered by the additional
+ checkpoint entry (#5081).postmaster to shut down before starting it again (#4938).Pooler service template can override the default service
+ (#4846).pg_database metric monitoring and alerting
+ processes (#4980).ClusterImageCatalogs (#5034).Pooler instance when
+ the operator image is upgraded (#5006)postmaster was accessible (#4920).Pooler (#4967).AccessModes in the template only when unspecified (#4845).cnpg plugin:status command execution.status command (#4915).Release date: Jun 12, 2024
+Enabled configuration of standby-sensitive parameters during recovery using a + physical backup (#4564)
+Enabled the configuration of the liveness probe timeout via the
+ .spec.livenessProbeTimeout option (#4719)
cnpg plugin for kubectl:
--color
+ option, which accepts always, never, and auto (default) as values
+ (#4775)Prevented fenced instances from entering an unnecessary loop and consuming + all available CPU (#4625)
+Resolved an issue where the instance manager on the primary would + indefinitely wait for the instance to start after encountering a failure + following a stop operation (#4434)
+Fixed an issue where the interaction between hot_standby_feedback and
+ managed cluster-level replication slots was preventing the autovacuum from
+ operating correctly; this issue was causing disk space to remain occupied by
+ dead tuples (#4811)
Fixed a panic in the backup controller that occurred when pod container + statuses were missing (#4765)
+Prevented unnecessary shutdown of the instance manager (#4670)
+Prevented unnecessary reloads of PostgreSQL configuration when unchanged (#4531)
+Prevented unnecessary reloads of the ident map by ensuring a consistent and + unique method of writing its content (#4648)
+Avoided conflicts during phase registration by patching the status of the + resource instead of updating it (#4637)
+Implemented a timeout when restarting PostgreSQL and lifting fencing (#4504)
+Ensured that a replica cluster is restarted after promotion to properly set + the archive mode (#4399)
+Removed an unneeded concurrent keep-alive routine that was causing random + failures in volume snapshot backups (#4768)
+Ensured correct parsing of the additional rows field returned when the
+ pgaudit.log_rows option was enabled, preventing audit logs from being
+ incorrectly routed to the normal log stream (#4394)
cnpg plugin for kubectl:
cnpg status command (#4530)Release date: Apr 29, 2024
+PodMonitor resources, which was
+ failing due to a regression (#4286)Release date: Apr 24, 2024
+Important changes to Community Supported Versions
+We've updated our support policy to streamline our focus on one supported +minor release at a time, rather than two. Additionally, we've extended the +supplementary support period for the previous minor release to 3 months.
+ClusterImageCatalog and
+ ImageCatalog CRDs to manage operand container images based on PostgreSQL
+ major version. This is facilitated through the Cluster's
+ .spec.imageCatalogRef stanza. This feature provides an alternative to the
+ imageName option and will eventually replace it as the default method to define
+ operand container images.replicationSlots.synchronizeReplicas..spec.enablePDB field to disable PDBs on the primary instance, allowing
+ proper eviction of the pod during maintenance operations. This is particularly
+ useful for single-instance deployments. This feature is intended to replace the
+ node maintenance window feature.wal_log_hints PostgreSQL parameter (#4218)
+ (#4218)status command to show the status of PDBs
+ (#4319)barman-cloud-wal-restore in recovery
+ bootstrap (#4101)PGDATA in case of failure of the restore job (#4151)cnpg plugin for kubectl:destroy, fencing, and promote
+ plugin commands (#4280)cnpg status
+ (#4131)sync-sequences command
+ (#4346)gcr.io/distroless/static-debian12:nonroot
+ (#4201)History of user-visible changes in the 1.24 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: May 23, 2025
+Warning
+This is the final release in the 1.24.x series. +Users are strongly encouraged to upgrade to a newer minor version, as 1.24 +is no longer supported.
+Added the KUBERNETES_CLUSTER_DOMAIN configuration option to the operator,
+ allowing users to specify the domain suffix for fully qualified domain names
+ (FQDNs) generated within the Kubernetes cluster. If not set, it defaults to
+ cluster.local. (#6989)
Implemented the cnpg.io/validation annotation, enabling users to disable
+ the validation webhook on CloudNativePG-managed resources. Use with caution,
+ as this allows unrestricted changes. (#7196)
Added support for collecting pg_stat_wal metrics in PostgreSQL 18. (#7005)
Added support for LZ4, XZ, and Zstandard compression methods when archiving + WAL files via Barman Cloud (deprecated). (#7151)
+imagePullPolicy to Always for the operator deployment to ensure that
+ images are always pulled from the registry, reducing the risk of using
+ outdated or potentially unsafe local images. (#7250)Fixed native replication slot synchronization and logical replication
+ failover for PostgreSQL 17 by appending the dbname parameter to
+ primary_conninfo in replica configurations (#7298).
Improved backup efficiency by introducing a fail-fast mechanism in WAL + archiving, allowing quicker detection of unexpected primary demotion and + avoiding unnecessary retries (#7483).
+Fixed an off-by-one error in parallel WAL archiving that could cause one + extra worker process to be spawned beyond the requested number (#7389).
+Resolved a race condition that caused the operator to perform two switchovers + when updating the PostgreSQL configuration. (#6991)
+Corrected the PodMonitor configuration by adjusting the matchLabels scope
+ for the targeted pooler and cluster pods. Previously, the matchLabels were
+ too broad, inadvertently inheriting labels from the cluster and leading to data
+ collection from unintended targets. (#7063)
Added a webhook warning for clusters with a missing unit (e.g., MB, GB) in
+ the shared_buffers configuration. This will become an error in future
+ releases. Users should update their configurations to include explicit units
+ (e.g., 512MB instead of 512). (#7160)
CloudNativePG Interface (CNPG-I):
+Implemented automatic reloading of TLS certificates for plugins when they + change. (#7029)
+Ensured the operator properly closes the plugin connection when + performing a backup using the plugin. (#7095, #7096)
+Improved performance and resilience of CNPG-I by removing timeouts for local + plugin operations, avoiding failures during longer backup or WAL archiving + executions (#7496).
+cnpg plugin:
Ensured that the primary Pod is recreated during an imperative restart when
+ primaryUpdateMethod is set to restart, aligning its definition with the
+ replicas. (#7122)
Updated the default PostgreSQL version to 17.5 for new cluster + definitions. (#7556)
+Updated the default PgBouncer version to 1.24.1 for new Pooler
+ deployments (#7399).
Release Date: February 28, 2025
+-r service with the Pooler. (#6868)--ttl flag for the pgbench plugin, enabling
+ automatic deletion of completed jobs after a user-defined duration. (#6701)cosign, and
+ OCI attestations are generated, incorporating the Software Bill of
+ Materials (SBOM) and provenance data. Additionally, OCI annotations
+ have been added to improve traceability and ensure the integrity of
+ the images..spec.probes
+ is defined, ensuring users can override all settings, including
+ failureThreshold. If unspecified in the startup probe, failureThreshold is
+ now correctly derived from .spec.startupDelay / periodSeconds (default: 10,
+ now overridable). The same logic applies to liveness probes via
+ .spec.livenessProbeTimeout. (#6656)isWALArchiver flag in the CNPG-I plugin configuration,
+ allowing users to designate a plugin as a WAL archiver. This enables seamless
+ migration from in-tree Barman Cloud support to the plugin while maintaining WAL
+ archive consistency. (#6593)override.conf is consistently included in postgresql.conf during
+ replica cluster bootstrapping, preventing replication failures due to missing
+ configuration settings. (#6808)override.conf is correctly initialized before invoking pg_rewind
+ to prevent failures during primary role changes. (#6670)cnpg plugin:promote plugin command. (#6476)kubectl cnpg report --logs <cluster> to collect logs from all
+ containers, including sidecars. (#6636)pgbench jobs can run when a Cluster uses an ImageCatalog.
+ (#6868)client-gen, enabling automated generation of
+ Go clients for all CloudNativePG CRDs. (#6695)Release Date: December 23, 2024
+.spec.probes stanza. (#6266)cnpg.io/userType label to secrets generated for predefined users,
+ specifically superuser and app. (#4392)spec.schedule field in ScheduledBackups,
+ raising warnings for potential misconfigurations. (#5396)cnpg plugin:User-Agent header in HTTP requests with the API server. (#6153)PGDATA volume during bootstrap. (#6265)Unrecoverable when
+ all previously generated PersistentVolumeClaims are missing. (#6170)synchronous_standby_names GUC when
+ .spec.postgresql.synchronous.method is set to first. (#5955)primary_slot_name definition from the override.conf file on
+ the primary to ensure it is always empty. (#6219)PGHOST, in PgBouncer pods
+ to enable seamless access to the pgbouncer virtual database using psql
+ from within the container. (#6247)cnpg plugin:kubectl context is properly passed in the psql command. (#6257)status command. (#5998)Release date: Oct 16, 2024
+pg_database_size from the status probe, as it caused
+ high resource utilization by scanning the entire PGDATA directory to
+ compute database sizes. The kubectl status plugin will now rely on du
+ to provide detailed size information retrieval (#5689).full_page_writes parameter in
+ PostgreSQL. This setting defaults to on, in line with PostgreSQL's
+ recommendations (#5516).logs pretty command in the cnpg plugin to read a log stream
+ from standard input and output a human-readable format, with options to
+ filter log entries (#5770)status command by allowing multiple -v options to
+ increase verbosity for more detailed output (#5765).--image
+ flag in the pgadmin4 plugin command, giving users control over the
+ Docker image used for pgAdmin4 deployments (#5515)..spec.postgresql.synchronous, ensure that the
+ synchronous_standby_names parameter is correctly set, even when no replicas
+ are reachable (#5831).TMPDIR and PSQL_HISTORY environment variables for pods
+ and jobs, improving temporary file and history management (#5503).logs cluster command (#5775).potential sync status in the status plugin (#5533).pgadmin4 command didn’t have
+ a writable home directory (#5800).Release date: Aug 22, 2024
+role label in the selectors of Service and
+ PodDisruptionBudget resources in favor of cnpg.io/instanceRole (#4897).required (#5156).Warning
+The PodAntiAffinity change will trigger a rollout of all the instances when +the operator is upgraded, even when online upgrades are enabled.
+managed.services
+ stanza (#4769 and #4952), allowing you to:synchronous_standby_names option, providing greater
+ control and flexibility (#5148)..spec.replica.minApplyDelay option, leveraging PostgreSQL's
+ recovery_min_apply_delay capability (#5181).postInitSQLRefs and postInitTemplateSQLRefs to allow users to
+ define postInit and postInitTemplate instructions as one or more config
+ maps or secrets (#5074).allow_alter_system parameter,
+ enabling or disabling the ALTER SYSTEM command through the
+.spec.postgresql.enableAlterSystem option (#4921).name key/value pair, which can replace the name automatically inherited
+ from the parent key (#4779).predicate_query field (#4503).barman-cloud-wal-archive and
+ barman-cloud-wal-restore (#5099).reconcilePodSpec annotation on the Cluster and Pooler
+ resources to control the restart of pods following a change in the Pod
+ specification (#5069).Pooler metrics
+ collector (#5044).cnpg plugin updates:install generate command by adding a --control-plane option,
+ allowing deployment of the operator on control-plane nodes by setting
+ node affinity and tolerations (#5271).destroy command to delete also any job related to the target
+ instance (#5298).status command to display demotionToken and
+ promotionToken when available, providing more detailed operational
+ insights with distributed topologies (#5149).publication
+ and subscription subcommands. This enhancement offers greater flexibility
+ for synchronizing data from an external cluster with multiple databases (#5113).reusePVC flag is
+ set to false (#2056).pg_rewind execution for newly created replicas that employ
+ VolumeSnapshot during bootstrapping to avoid introducing a new shutdown
+ checkpoint entry in the WAL files. This ensures that replicas can reconnect to
+ the primary without issues, which would otherwise be hindered by the additional
+ checkpoint entry (#5081).postmaster to shut down before starting it again (#4938).Pooler service template can override the default service
+ (#4846).pg_database metric monitoring and alerting
+ processes (#4980).ClusterImageCatalogs (#5034).Pooler instance when
+ the operator image is upgraded (#5006)postmaster was accessible (#4920).Pooler (#4967).AccessModes in the template only when unspecified (#4845).smartShutdownTimeout to zero, enabling immediate fast
+ shutdown and bypassing the smart shutdown process when required (#5347).cnpg plugin:status command execution.status command (#4915).History of user-visible changes in the 1.25 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: Oct 23, 2025
+Warning
+This is the final release in the 1.25.x series. +Users are strongly encouraged to upgrade to a newer minor version, as 1.25 +is no longer supported.
+Adopted the new format of postgres-containers and postgis-containers
+ images and image catalog artifacts, and updated the default PostgreSQL
+ version to 18.0-system-trixie (PostgreSQL 18 is now supported).
+ (#8578,
+ #8760,
+ #8558)
Deprecated the monitoring.enablePodMonitor field in the Cluster and
+ Pooler resources. This field will be removed in a future release. Users who
+ rely on PodMonitor resources should create them manually instead.
+ (#8753)
Added support for overriding the PgBouncer auth_type, server_tls_sslmode,
+ and client_tls_sslmode settings, which were previously hardcoded. Default
+ values remain consistent with the former behavior but can now be customized
+ when required.
+ (#8674)
Added a CHECKPOINT step before PostgreSQL smart and fast shutdowns to
+ reduce shutdown duration and replica promotion time, especially on systems
+ with a high checkpoint_timeout.
+ (#8867)
Added a warning in the instance manager for deprecated or unsupported OS
+ versions, based on the official postgres-containers project.
+ (#8601)
Improved certificate parsing error reporting. Failures now log specific + errors instead of a generic message, aiding troubleshooting. This is + particularly relevant after the CVE-2025-58187 fix in Go 1.25.2 and 1.24.8, + which may trigger parsing failures for invalid DNS SANs. + (#8801)
+Added a check to ensure the destination WAL archive path is empty when
+ bootstrapping a cluster using the pg_basebackup method, consistent with
+ other bootstrap methods.
+ (#8895)
Added validation to prevent backups from running on hibernated clusters.
+ Backups attempted on such clusters now fail with reason
+ ClusterIsHibernated, following the standard prerequisite check pattern.
+ (#8870)
Added support for pprof profiling. Instances can now enable the pprof
+ tool by adding the alpha.cnpg.io/enableInstancePprof annotation to the
+ Cluster resource for advanced debugging.
+ (#7876)
cnpg plugin:
Updated the Flexible I/O Tester (FIO) image to
+ wallnerryan/fiotools-aio:v2, as provided by Ryan Wallner.
+ (#8847)
Enhanced the cnpg status backup command to provide more detailed status
+ information when using a barman-cloud-based backup plugin.
+ (#8780,
+ #8690)
Fixed backup restoration failures when using custom WAL segment sizes with + parallel WAL recovery. The operator no longer manages the end-of-WAL file + marker during restoration, preventing errors when backups span multiple WAL + segments. + (#8873)
+Fixed a bug in major upgrades where a volume snapshot from a previous minor + version could be incorrectly used to optimize replica creation. + (#8475)
+Fixed initdb to wait for the application user secret before bootstrapping
+ a new cluster, preventing potential race conditions.
+ (#8663)
Fixed the connection retry logic in the cnpgi plugin. The reconciliation
+ loop now detects connection pool changes correctly and uses exponential
+ backoff to reduce "closed pool" errors.
+ (#8554)
Fixed volume snapshot usage during replica scaling to work with backup plugins. + Previously, this optimization was only available with the in-tree backup + implementation, but now clusters using backup plugins can also leverage volume + snapshots when creating new replicas. + (#8506)
+Fixed the Pooler templating to correctly inherit settings for the
+ bootstrap controller init container.
+ (#8394)
Fixed webhook errors to use the correct API group (postgresql.cnpg.io) in
+ Pooler and backup webhooks, ensuring consistent API error reporting.
+ (#8485)
Fixed a potential nil pointer dereference in the hibernation reconciler when + handling errors. Contributed by @PascalBourdier. + (#8756)
+Fixed an issue in the environment cache where callers could inadvertently
+ modify shared data. The LoadEnv function now returns a copy of cached
+ environment slices to prevent mutations from affecting the cache.
+ (#8880)
Release date: Jul 25, 2025
+In memory of DJ Walker-Morgan.
+386 and ARM (v5/v6/v7) architectures from the cnpg plugin build
+ matrix, reducing the number of published binaries
+ (#7648).Improved validation of shared_buffers by correctly considering HugePages
+ settings, ensuring accurate memory configuration checks
+ (#7864).
Set oom_score_adj for PostgreSQL worker processes to improve prioritization
+ during out-of-memory situations
+ (#7891).
Added fqdn-uri and fqdn-jdbc-uri fields in user secrets to simplify
+ application connection string management and align with DNS-based connection
+ best practices (#7852).
Added the systemID field and related condition in the Cluster status to track
+ the PostgreSQL system identifier.
+ (#7717).
Added a mutex in the connection pooler to protect concurrent access to the + connections map, improving stability in high-concurrency environments + (#7804).
+Fixed replica cluster instance ordering by correctly detecting the designated + primary, improving replica cluster stability and switchover operations + (#8108).
+Added support for reconciling VolumeAttributesClass for PVCs, enhancing
+ storage compatibility and lifecycle management
+ (#7885).
Made the internal webserver routines non-blocking to improve responsiveness + under load (#8071).
+Fixed an issue where the ensureReplicationClientLeafCertificate error did
+ not display the correct secretName in the not found message
+ (#8086).
Prevented invalid ALTER SUBSCRIPTION statements by updating only
+ PostgreSQL‑supported parameters; unsupported options like copy_data are
+ ignored to avoid reconciliation failures
+ (7844).
Fixed an issue where the bootstrap-controller in the connection pooler did
+ not apply resources settings correctly
+ (#7922).
Ensured online backups fail cleanly if the targetPod becomes unhealthy
+ during backup, preventing partial or misleading backups
+ (#7944).
Ensured the Backup resource status is set properly after a failure, improving + observability and scripting reliability + (#7898).
+Release date: May 23, 2025
+Added the KUBERNETES_CLUSTER_DOMAIN configuration option to the operator,
+ allowing users to specify the domain suffix for fully qualified domain names
+ (FQDNs) generated within the Kubernetes cluster. If not set, it defaults to
+ cluster.local. (#6989)
Implemented the cnpg.io/validation annotation, enabling users to disable
+ the validation webhook on CloudNativePG-managed resources. Use with caution,
+ as this allows unrestricted changes. (#7196)
Added support for collecting pg_stat_wal metrics in PostgreSQL 18. (#7005)
Added support for LZ4, XZ, and Zstandard compression methods when archiving + WAL files via Barman Cloud (deprecated). (#7151)
+CloudNativePG Interface (CNPG-I):
+A plugin can now trigger instance rollouts by implementing the EVALUATE
+ verb, ensuring that plugin-induced changes are properly reconciled. (#7126)
Introduced support for WAL recovery via CNPG-I plugins during snapshot + restore. (#7284)
+imagePullPolicy to Always for the operator deployment to ensure that
+ images are always pulled from the registry, reducing the risk of using
+ outdated or potentially unsafe local images. (#7250)Fixed native replication slot synchronization and logical replication
+ failover for PostgreSQL 17 by appending the dbname parameter to
+ primary_conninfo in replica configurations (#7298).
Fixed a regression in WAL restore operations that prevented fallback to the
+ in-tree barmanObjectStore configuration defined in the externalCluster
+ source when a plugin failed to locate a WAL file (#7507).
Improved backup efficiency by introducing a fail-fast mechanism in WAL + archiving, allowing quicker detection of unexpected primary demotion and + avoiding unnecessary retries (#7483).
+Fixed an off-by-one error in parallel WAL archiving that could cause one + extra worker process to be spawned beyond the requested number (#7389).
+Resolved a race condition that caused the operator to perform two switchovers + when updating the PostgreSQL configuration. (#6991)
+Corrected the PodMonitor configuration by adjusting the matchLabels scope
+ for the targeted pooler and cluster pods. Previously, the matchLabels were
+ too broad, inadvertently inheriting labels from the cluster and leading to data
+ collection from unintended targets. (#7063)
Added a webhook warning for clusters with a missing unit (e.g., MB, GB) in
+ the shared_buffers configuration. This will become an error in future
+ releases. Users should update their configurations to include explicit units
+ (e.g., 512MB instead of 512). (#7160)
Treated timeout errors during volume snapshot creation as retryable to + prevent unnecessary backup failures. (#7010)
+Moved the defaulting logic for .spec.postgresql.synchronous.dataDurability
+ from the CRD to the webhook to avoid UI issues with OLM. (#7600)
CloudNativePG Interface (CNPG-I):
+Implemented automatic reloading of TLS certificates for plugins when they + change. (#7029)
+Ensured the operator properly closes the plugin connection when + performing a backup using the plugin. (#7095, #7096)
+Improved performance and resilience of CNPG-I by removing timeouts for local + plugin operations, avoiding failures during longer backup or WAL archiving + executions (#7496).
+cnpg plugin:
Increased the buffer size in the logs pretty command to better handle
+ larger log output (#7281).
Ensured the plugin-name parameter is required for plugin-based backups
+ and disallowed for non-plugin backup methods (#7506).
Ensured that the primary Pod is recreated during an imperative restart when
+ primaryUpdateMethod is set to restart, aligning its definition with the
+ replicas. (#7122)
Updated the default PostgreSQL version to 17.5 for new cluster + definitions. (#7556)
+Updated the default PgBouncer version to 1.24.1 for new Pooler
+ deployments (#7399).
Release Date: February 28, 2025
+-r service with the Pooler. (#6868)--ttl flag for the pgbench plugin, enabling
+ automatic deletion of completed jobs after a user-defined duration. (#6701)cosign, and
+ OCI attestations are generated, incorporating the Software Bill of
+ Materials (SBOM) and provenance data. Additionally, OCI annotations
+ have been added to improve traceability and ensure the integrity of
+ the images..spec.probes
+ is defined, ensuring users can override all settings, including
+ failureThreshold. If unspecified in the startup probe, failureThreshold is
+ now correctly derived from .spec.startupDelay / periodSeconds (default: 10,
+ now overridable). The same logic applies to liveness probes via
+ .spec.livenessProbeTimeout. (#6656)Database, Publication, and Subscription CRDs
+ became stuck in cluster resource has been deleted, skipping reconciliation
+ after cluster rehydration. This patch forces status.observedGeneration to
+ zero, ensuring proper reconciliation. (#6607) isWALArchiver flag in the CNPG-I plugin configuration,
+ allowing users to designate a plugin as a WAL archiver. This enables seamless
+ migration from in-tree Barman Cloud support to the plugin while maintaining WAL
+ archive consistency. (#6593)override.conf is consistently included in postgresql.conf during
+ replica cluster bootstrapping, preventing replication failures due to missing
+ configuration settings. (#6808)override.conf is correctly initialized before invoking pg_rewind
+ to prevent failures during primary role changes. (#6670)cnpg plugin:promote plugin command. (#6476)kubectl cnpg report --logs <cluster> to collect logs from all
+ containers, including sidecars. (#6636)pgbench jobs can run when a Cluster uses an ImageCatalog.
+ (#6868)client-gen, enabling automated generation of
+ Go clients for all CloudNativePG CRDs. (#6695)Release Date: December 23, 2024
+Declarative Database Management: Introduce the Database Custom Resource
+ Definition (CRD), enabling users to create and manage PostgreSQL databases
+ declaratively within a cluster. (#5325)
Logical Replication Management: Add Publication and Subscription CRDs
+ for declarative management of PostgreSQL logical replication. These simplify
+ replication setup and facilitate online migrations to CloudNativePG. (#5329)
Experimental Support for CNPG-I: Introducing CNPG-I (CloudNativePG + Interface), a standardized framework designed to extend CloudNativePG + functionality through third-party plugins and foster the growth of the CNPG + ecosystem. + The Barman Cloud Plugin serves as a live + example, illustrating how plugins can be developed to enhance backup and + recovery workflows. Although CNPG-I support is currently experimental, it + offers a powerful approach to extending CloudNativePG without modifying the + operator’s core code—akin to PostgreSQL extensions. We welcome community + feedback and contributions to shape this exciting new capability.
+dataDurability option to the .spec.postgresql.synchronous stanza,
+ allowing users to choose between required (default) or preferred
+ durability in synchronous replication. (#5878).spec.probes stanza. (#6266)pg_dump and pg_restore options to enhance database
+ import flexibility. (#6214)maxConcurrentReconciles in the CloudNativePG controller and
+ set the default to 10, improving the operator's ability to efficiently manage
+ larger deployments out of the box. (#5678)cnpg.io/userType label to secrets generated for predefined users,
+ specifically superuser and app. (#4392)spec.schedule field in ScheduledBackups,
+ raising warnings for potential misconfigurations. (#5396)cnpg plugin:backup command to support plugins. (#6045)User-Agent header in HTTP requests with the API server. (#6153)PGDATA volume during bootstrap. (#6265)Unrecoverable when
+ all previously generated PersistentVolumeClaims are missing. (#6170)synchronous_standby_names GUC when
+ .spec.postgresql.synchronous.method is set to first. (#5955)primary_slot_name definition from the override.conf file on
+ the primary to ensure it is always empty. (#6219)PGHOST, in PgBouncer pods
+ to enable seamless access to the pgbouncer virtual database using psql
+ from within the container. (#6247)cnpg plugin:kubectl context is properly passed in the psql command. (#6257)status command. (#5998)History of user-visible changes in the 1.26 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: Oct 23, 2025
+Delayed the decommissioning of native in-core support for Barman Cloud to at + least version 1.29. + (#8670)
+Adopted the new format of postgres-containers and postgis-containers
+ images and image catalog artifacts, and updated the default PostgreSQL
+ version to 18.0-system-trixie (PostgreSQL 18 is now supported).
+ (#8578,
+ #8760,
+ #8558)
Deprecated the monitoring.enablePodMonitor field in the Cluster and
+ Pooler resources. This field will be removed in a future release. Users who
+ rely on PodMonitor resources should create them manually instead.
+ (#8753)
Added support for overriding the PgBouncer auth_type, server_tls_sslmode,
+ and client_tls_sslmode settings, which were previously hardcoded. Default
+ values remain consistent with the former behavior but can now be customized
+ when required.
+ (#8674)
Added a CHECKPOINT step before PostgreSQL smart and fast shutdowns to
+ reduce shutdown duration and replica promotion time, especially on systems
+ with a high checkpoint_timeout.
+ (#8867)
Added a warning in the instance manager for deprecated or unsupported OS
+ versions, based on the official postgres-containers project.
+ (#8601)
Improved certificate parsing error reporting. Failures now log specific + errors instead of a generic message, aiding troubleshooting. This is + particularly relevant after the CVE-2025-58187 fix in Go 1.25.2 and 1.24.8, + which may trigger parsing failures for invalid DNS SANs. + (#8801)
+Added a check to ensure the destination WAL archive path is empty when
+ bootstrapping a cluster using the pg_basebackup method, consistent with
+ other bootstrap methods.
+ (#8895)
Added validation to prevent backups from running on hibernated clusters.
+ Backups attempted on such clusters now fail with reason
+ ClusterIsHibernated, following the standard prerequisite check pattern.
+ (#8870)
Added support for pprof profiling. Instances can now enable the pprof
+ tool by adding the alpha.cnpg.io/enableInstancePprof annotation to the
+ Cluster resource for advanced debugging.
+ (#7876)
cnpg plugin:
Updated the Flexible I/O Tester (FIO) image to
+ wallnerryan/fiotools-aio:v2, as provided by Ryan Wallner.
+ (#8847)
Enhanced the cnpg status backup command to provide more detailed status
+ information when using a barman-cloud-based backup plugin.
+ (#8780,
+ #8690)
Fixed backup restoration failures when using custom WAL segment sizes with + parallel WAL recovery. The operator no longer manages the end-of-WAL file + marker during restoration, preventing errors when backups span multiple WAL + segments. + (#8873)
+Fixed a bug in major upgrades where a volume snapshot from a previous minor + version could be incorrectly used to optimize replica creation. + (#8475)
+Fixed initdb to wait for the application user secret before bootstrapping
+ a new cluster, preventing potential race conditions.
+ (#8663)
Fixed the connection retry logic in the cnpgi plugin. The reconciliation
+ loop now detects connection pool changes correctly and uses exponential
+ backoff to reduce "closed pool" errors.
+ (#8554)
Fixed volume snapshot usage during replica scaling to work with backup plugins. + Previously, this optimization was only available with the in-tree backup + implementation, but now clusters using backup plugins can also leverage volume + snapshots when creating new replicas. + (#8506)
+Fixed the Pooler templating to correctly inherit settings for the
+ bootstrap controller init container.
+ (#8394)
Fixed webhook errors to use the correct API group (postgresql.cnpg.io) in
+ Pooler and backup webhooks, ensuring consistent API error reporting.
+ (#8485)
Fixed a potential nil pointer dereference in the hibernation reconciler when + handling errors. Contributed by @PascalBourdier. + (#8756)
+Fixed an issue in the environment cache where callers could inadvertently
+ modify shared data. The LoadEnv function now returns a copy of cached
+ environment slices to prevent mutations from affecting the cache.
+ (#8880)
Release date: Jul 25, 2025
+In memory of DJ Walker-Morgan.
+CloudNativePG is moving toward a plugin-based, backup and recovery agnostic
+architecture (initiated in 1.26.0 with Barman Cloud). As part of this
+transition, the following fields in the .status section of the Cluster
+resource are now deprecated:
firstRecoverabilityPointfirstRecoverabilityPointByMethodlastSuccessfulBackuplastSuccessfulBackupByMethodlastFailedBackupAdditionally, the following Prometheus metrics are deprecated:
+cnpg_collector_first_recoverability_pointcnpg_collector_last_available_backup_timestampcnpg_collector_last_failed_backup_timestampThese fields and metrics will no longer update when using plugin-based backups +(e.g., Barman Cloud via CNPG-I). They remain functional for users still using +in-core Barman Cloud and volume snapshot backups.
+++Note: We, as maintainers, are sorry for any inconvenience caused by not +highlighting this change during the 1.26.0 release. As we transition to a +plugin-based backup and recovery architecture, we encourage you to test +your chosen plugin thoroughly in a staging environment before moving to +production to ensure your workflows and observability integration continue +to meet your needs. Thank you for your understanding and for working with us +as CloudNativePG evolves to provide a more modular and robust experience.
+
386 and ARM (v5/v6/v7) architectures from the cnpg plugin build
+ matrix, reducing the number of published binaries
+ (#7648).Improved validation of shared_buffers by correctly considering HugePages
+ settings, ensuring accurate memory configuration checks
+ (#7864).
Set oom_score_adj for PostgreSQL worker processes to improve prioritization
+ during out-of-memory situations
+ (#7891).
Added fqdn-uri and fqdn-jdbc-uri fields in user secrets to simplify
+ application connection string management and align with DNS-based connection
+ best practices (#7852).
Added the systemID field and related condition in the Cluster status to track
+ the PostgreSQL system identifier.
+ (#7717).
Added a mutex in the connection pooler to protect concurrent access to the + connections map, improving stability in high-concurrency environments + (#7804).
+Fixed replica cluster instance ordering by correctly detecting the designated + primary, improving replica cluster stability and switchover operations + (#8108).
+Added support for reconciling VolumeAttributesClass for PVCs, enhancing
+ storage compatibility and lifecycle management
+ (#7885).
Made the internal webserver routines non-blocking to improve responsiveness + under load (#8071).
+Fixed an issue where the ensureReplicationClientLeafCertificate error did
+ not display the correct secretName in the not found message
+ (#8086).
Prevented invalid ALTER SUBSCRIPTION statements by updating only
+ PostgreSQL‑supported parameters; unsupported options like copy_data are
+ ignored to avoid reconciliation failures
+ (7844).
Fixed an issue where the bootstrap-controller in the connection pooler did
+ not apply resources settings correctly
+ (#7922).
Ensured online backups fail cleanly if the targetPod becomes unhealthy
+ during backup, preventing partial or misleading backups
+ (#7944).
Ensured the Backup resource status is set properly after a failure, improving + observability and scripting reliability + (#7898).
+Improved liveness probe handling to avoid unnecessary timeouts when they are + not required (#7902).
+Release date: May 23, 2025
+CloudNativePG is now officially a CNCF project: CloudNativePG has been + accepted into the Cloud Native Computing Foundation (CNCF), marking a + significant milestone in its evolution. As part of this transition, the project + is now governed under CloudNativePG, a Series of LF Projects, LLC, ensuring + long-term sustainability and community-driven innovation. (#7203)
+Deprecation of Native Barman Cloud Support: Native support for Barman + Cloud backups and recovery is now deprecated and will be fully removed in + CloudNativePG version 1.28.0. Although still available in the current release, + users are strongly encouraged to begin migrating their existing clusters to the + new Barman Cloud Plugin to + ensure a smooth and seamless transition. The plugin should also be used for all + new deployments. This change marks the first step toward making CloudNativePG a + backup-agnostic solution, a goal that will be fully realized when volume + snapshot support is also moved to a plugin-based architecture. (#6876)
+End of Support for Barman 3.4 and Earlier: CloudNativePG no longer + supports Barman versions 3.4 and earlier, including the capability detection + framework. Users running older operand versions (from before April 2023) must + update their operand before upgrading the operator to avoid compatibility + issues. (#7220)
+Hibernation Command Changes: The hibernate on and hibernate off
+ commands in the cnpg plugin for kubectl now serve as shortcuts for
+ declarative hibernation. The previous imperative approach has been removed in
+ favor of this method. Additionally, the hibernate status command has been
+ removed, as its functionality is now covered by the standard status
+ command. Warning: Do not upgrade to version 1.26 of both the plugin and
+ the operator unless you are prepared to migrate to the declarative
+ hibernation method. (#7155)
Declarative Offline In-Place Major Upgrades of PostgreSQL: Introduced
+ support for offline in-place major upgrades when a new operand container
+ image with a higher PostgreSQL major version is applied to a cluster. During
+ the upgrade, all cluster pods are shut down to ensure data consistency. A new
+ job is created to validate upgrade conditions, run pg_upgrade, and set up new
+ directories for PGDATA, WAL files, and tablespaces as needed. Once the
+ upgrade is complete, replicas are re-created. Failed upgrades can be rolled
+ back declaratively. (#6664)
Improved Startup and Readiness Probes for Replicas: Enhanced support for + Kubernetes startup and readiness probes in PostgreSQL instances, providing + greater control over replicas based on the streaming lag. (#6623)
+Declarative management of extensions and schemas: Introduced the
+ extensions and schemas stanzas in the Database resource to declaratively
+ create, modify, and drop PostgreSQL extensions and schemas within a database. (#7062)
Introduced an opt-in experimental feature to enhance the liveness probe with
+ network isolation detection for primary instances. This feature can be
+ activated via the alpha.cnpg.io/livenessPinger annotation (#7466).
Introduced the STANDBY_TCP_USER_TIMEOUT operator configuration setting,
+ allowing users to specify the tcp_user_timeout parameter on all standby
+ instances managed by the operator. (#7036)
Introduced the DRAIN_TAINTS operator configuration option, enabling users
+ to customize which node taints indicate a node is being drained. This
+ replaces the previous fixed behavior of only recognizing
+ node.kubernetes.io/unschedulable as a drain signal. (#6928)
Added a new field in the status of the Cluster resource to track the
+ latest known Pod IP (#7546).
Added the pg_extensions metric, providing information about installed
+ PostgreSQL extensions and their latest available versions. (#7195)
Added the KUBERNETES_CLUSTER_DOMAIN configuration option to the operator,
+ allowing users to specify the domain suffix for fully qualified domain names
+ (FQDNs) generated within the Kubernetes cluster. If not set, it defaults to
+ cluster.local. (#6989)
Implemented the cnpg.io/validation annotation, enabling users to disable
+ the validation webhook on CloudNativePG-managed resources. Use with caution,
+ as this allows unrestricted changes. (#7196)
Added support for patching PostgreSQL instance pods using the
+ cnpg.io/podPatch annotation with a JSON Patch. This may introduce
+ discrepancies between the operator’s expectations and Kubernetes behavior, so
+ it should be used with caution. (#6323)
Added support for collecting pg_stat_wal metrics in PostgreSQL 18. (#7005)
Removed the ENABLE_AZURE_PVC_UPDATES configuration, as it is no longer
+ required to resize Azure volumes correctly. The Azure CSI driver includes the
+ necessary fix as of version 1.11.0. (#7297)
The .spec.backup.barmanObjectStore and .spec.backup.retentionPolicy
+ fields are now deprecated in favor of the external Barman Cloud Plugin, and a
+ warning is now emitted by the admission webhook when these fields are used in
+ the Cluster specification (#7500).
Added support for LZ4, XZ, and Zstandard compression methods when archiving + WAL files via Barman Cloud (deprecated). (#7151)
+CloudNativePG Interface (CNPG-I):
+A plugin can now trigger instance rollouts by implementing the EVALUATE
+ verb, ensuring that plugin-induced changes are properly reconciled. (#7126)
Introduced support for WAL recovery via CNPG-I plugins during snapshot + restore. (#7284)
+imagePullPolicy to Always for the operator deployment to ensure that
+ images are always pulled from the registry, reducing the risk of using
+ outdated or potentially unsafe local images. (#7250)Fixed native replication slot synchronization and logical replication
+ failover for PostgreSQL 17 by appending the dbname parameter to
+ primary_conninfo in replica configurations (#7298).
Fixed a regression in WAL restore operations that prevented fallback to the
+ in-tree barmanObjectStore configuration defined in the externalCluster
+ source when a plugin failed to locate a WAL file (#7507).
Improved backup efficiency by introducing a fail-fast mechanism in WAL + archiving, allowing quicker detection of unexpected primary demotion and + avoiding unnecessary retries (#7483).
+Fixed an off-by-one error in parallel WAL archiving that could cause one + extra worker process to be spawned beyond the requested number (#7389).
+Resolved a race condition that caused the operator to perform two switchovers + when updating the PostgreSQL configuration. (#6991)
+Corrected the PodMonitor configuration by adjusting the matchLabels scope
+ for the targeted pooler and cluster pods. Previously, the matchLabels were
+ too broad, inadvertently inheriting labels from the cluster and leading to data
+ collection from unintended targets. (#7063)
Added a webhook warning for clusters with a missing unit (e.g., MB, GB) in
+ the shared_buffers configuration. This will become an error in future
+ releases. Users should update their configurations to include explicit units
+ (e.g., 512MB instead of 512). (#7160)
Treated timeout errors during volume snapshot creation as retryable to + prevent unnecessary backup failures. (#7010)
+Moved the defaulting logic for .spec.postgresql.synchronous.dataDurability
+ from the CRD to the webhook to avoid UI issues with OLM. (#7600)
CloudNativePG Interface (CNPG-I):
+Implemented automatic reloading of TLS certificates for plugins when they + change. (#7029)
+Ensured the operator properly closes the plugin connection when + performing a backup using the plugin. (#7095, #7096)
+Fixed an issue that prevented WALs from being archived on a former + primary node when using a plugin. (#6964)
+Improved performance and resilience of CNPG-I by removing timeouts for local + plugin operations, avoiding failures during longer backup or WAL archiving + executions (#7496).
+cnpg plugin:
Increased the buffer size in the logs pretty command to better handle
+ larger log output (#7281).
Ensured the plugin-name parameter is required for plugin-based backups
+ and disallowed for non-plugin backup methods (#7506).
Ensured that the primary Pod is recreated during an imperative restart when
+ primaryUpdateMethod is set to restart, aligning its definition with the
+ replicas. (#7122)
Updated the default PostgreSQL version to 17.5 for new cluster + definitions. (#7556)
+Updated the default PgBouncer version to 1.24.1 for new Pooler
+ deployments (#7399).
History of user-visible changes in the 1.27 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: Oct 23, 2025
+Delayed the decommissioning of native in-core support for Barman Cloud to at + least version 1.29. + (#8670)
+Adopted the new format of postgres-containers and postgis-containers
+ images and image catalog artifacts, and updated the default PostgreSQL
+ version to 18.0-system-trixie (PostgreSQL 18 is now supported).
+ (#8578,
+ #8760,
+ #8558)
Deprecated the monitoring.enablePodMonitor field in the Cluster and
+ Pooler resources. This field will be removed in a future release. Users who
+ rely on PodMonitor resources should create them manually instead.
+ (#8753)
Added support for overriding the PgBouncer auth_type, server_tls_sslmode,
+ and client_tls_sslmode settings, which were previously hardcoded. Default
+ values remain consistent with the former behavior but can now be customized
+ when required.
+ (#8674)
Added a CHECKPOINT step before PostgreSQL smart and fast shutdowns to
+ reduce shutdown duration and replica promotion time, especially on systems
+ with a high checkpoint_timeout.
+ (#8867)
Added a warning in the instance manager for deprecated or unsupported OS
+ versions, based on the official postgres-containers project.
+ (#8601)
Improved certificate parsing error reporting. Failures now log specific + errors instead of a generic message, aiding troubleshooting. This is + particularly relevant after the CVE-2025-58187 fix in Go 1.25.2 and 1.24.8, + which may trigger parsing failures for invalid DNS SANs. + (#8801)
+Added a check to ensure the destination WAL archive path is empty when
+ bootstrapping a cluster using the pg_basebackup method, consistent with
+ other bootstrap methods.
+ (#8895)
Added validation to prevent backups from running on hibernated clusters.
+ Backups attempted on such clusters now fail with reason
+ ClusterIsHibernated, following the standard prerequisite check pattern.
+ (#8870)
Added support for pprof profiling. Instances can now enable the pprof
+ tool by adding the alpha.cnpg.io/enableInstancePprof annotation to the
+ Cluster resource for advanced debugging.
+ (#7876)
cnpg plugin:
Updated the Flexible I/O Tester (FIO) image to
+ wallnerryan/fiotools-aio:v2, as provided by Ryan Wallner.
+ (#8847)
Enhanced the cnpg status backup command to provide more detailed status
+ information when using a barman-cloud-based backup plugin.
+ (#8780,
+ #8690)
Fixed backup restoration failures when using custom WAL segment sizes with + parallel WAL recovery. The operator no longer manages the end-of-WAL file + marker during restoration, preventing errors when backups span multiple WAL + segments. + (#8873)
+Fixed a bug in major upgrades where a volume snapshot from a previous minor + version could be incorrectly used to optimize replica creation. + (#8475)
+Fixed initdb to wait for the application user secret before bootstrapping
+ a new cluster, preventing potential race conditions.
+ (#8663)
Fixed configuration hash calculation to ignore internal configuration fields, + preventing unnecessary reconciliations. + (#8868)
+Fixed the connection retry logic in the cnpgi plugin. The reconciliation
+ loop now detects connection pool changes correctly and uses exponential
+ backoff to reduce "closed pool" errors.
+ (#8554)
Fixed volume snapshot usage during replica scaling to work with backup plugins. + Previously, this optimization was only available with the in-tree backup + implementation, but now clusters using backup plugins can also leverage volume + snapshots when creating new replicas. + (#8506)
+Fixed the Pooler templating to correctly inherit settings for the
+ bootstrap controller init container.
+ (#8394)
Fixed webhook errors to use the correct API group (postgresql.cnpg.io) in
+ Pooler and backup webhooks, ensuring consistent API error reporting.
+ (#8485)
Fixed a potential nil pointer dereference in the hibernation reconciler when + handling errors. Contributed by @PascalBourdier. + (#8756)
+Fixed an issue in the environment cache where callers could inadvertently
+ modify shared data. The LoadEnv function now returns a copy of cached
+ environment slices to prevent mutations from affecting the cache.
+ (#8880)
Release date: Aug 12, 2025
+livenessProbeTimeout (default: 30 seconds).Dynamic loading of PostgreSQL extensions: Introduced the
+ .spec.postgresql.extensions stanza for mounting PostgreSQL extensions,
+ packaged as OCI-compliant container images, as read-only and immutable volumes
+ inside instance pods. This allows dynamic extension management without
+ rebuilding base images.
+ (#7991).
Logical decoding slot synchronization in HA clusters: Added the
+ synchronizeLogicalDecoding field under
+ spec.replicationSlots.highAvailability to enable automatic synchronization of
+ logical decoding slots across high-availability clusters, ensuring logical
+ replication subscribers continue seamlessly after a publisher failover
+ (#7931).
Primary Isolation Check: Promoted to stable the liveness pinger
+ experimental feature introduced in 1.26, adding the
+ .spec.probes.liveness.isolationCheck section to enable primary isolation
+ checks in the liveness probe by default. This improves the detection and
+ handling of primary connectivity issues in Kubernetes environments
+ (#7845).
Introduced an opt-in experimental feature that enables quorum-based failover
+ to improve safety and data durability during failover events. This feature,
+ also called failover quorum, can be activated via the
+ alpha.cnpg.io/failoverQuorum annotation.
+ (#7572).
Added support for user maps for predefined users such as streaming_replica,
+ allowing the use of self-managed client certificates with different Common
+ Names in environments with strict policies or shared CAs, while still enabling
+ replicas to join clusters using the streaming_replica role
+ (#7725).
Added a new PhaseFailurePlugin phase in the Cluster status to improve
+ observability of plugin-related failures
+ (#7988).
Made the Backup.spec field immutable after creation, ensuring consistency
+ and predictability in backup operations
+ (#7904).
Added fqdn-uri and fqdn-jdbc-uri fields in the user secret to simplify
+ the retrieval of fully qualified domain name-based connection strings
+ (#7852).
CNPG-I:
+ +barman-cloud plugin using the
+ switchover strategy. Former primary Pods now restart correctly after WAL
+ archiving fails due to missing plugin support.
+ (#8236)History of user-visible changes in the 1.28 minor release of CloudNativePG.
+For a complete list of changes, please refer to the +commits +on the release branch in GitHub.
+Release date: Nov 7, 2025
+Quorum-Based Failover Promoted to Stable: Promoted the quorum-based
+ failover feature, introduced experimentally in 1.27.0, to a stable API.
+ This data-driven failover mechanism is now configured via the
+ spec.postgresql.synchronous.failoverQuorum field, graduating from the
+ previous alpha.cnpg.io/failoverQuorum annotation.
+ (#8589)
Declarative Foreign Data Management: Introduced comprehensive declarative
+ management for Foreign Data Wrappers (FDW) by extending the Database CRD.
+ This feature adds the .spec.fdws and .spec.servers fields, allowing you to
+ manage FDW extensions and their corresponding foreign servers directly from the
+ Database resource. This work was implemented by Ying Zhu (@EdwinaZhu) as part
+ of the LFX Mentorship Program 2025 Term 2.
+ (#7942,
+ #8401)
Enabled simultaneous image and configuration changes, allowing you to update + the container image (including PostgreSQL version or extensions) and + PostgreSQL configuration settings in the same operation. The operator first + applies the image change, followed by the configuration changes in a subsequent + rollout, ensuring safe and consistent cluster updates. + (#8115)
+Introduced securityContext at the pod level and containerSecurityContext
+ for individual containers (including postgres, init, and sidecars).
+ This provides granular control over security settings, replacing the previous
+ cluster-wide postgres and operator user settings. Contributed by @x0ddf.
+ (#6614)
Adopted standard Kubernetes recommended labels (e.g.,
+ app.kubernetes.io/name) for all resources generated by CloudNativePG
+ (Clusters, Backups, Poolers, etc.). Contributed by @JefeDavis.
+ (#8087)
Introduced a new caching layer for user-defined monitoring queries to reduce + load on the PostgreSQL database. + (#8003)
+Introduced the alpha.cnpg.io/unrecoverable=true annotation for replica
+ pods. When applied, this annotation instructs the operator to permanently
+ delete the instance by removing its Pod and PVCs, after which it will recreate
+ the replica from the primary.
+ (#8178)
Enhanced PgBouncer integration by automatically setting auth_dbname to the
+ pgbouncer database, simplifying auth setup.
+ (#8671)
Allowed providing stage-specific pg_restore options (preRestore,
+ postRestore, dataRestore) during database import. Contributed by
+ @hanshal101.
+ (#7690)
Added the PostgreSQL majorVersion to the Backup object's status for
+ easier identification and management.
+ (#8464)
Allowed providing fine-grained custom TLS configurations for PgBouncer. The
+ Pooler CRD was extended with clientTLSSecret, clientCASecret,
+ serverTLSSecret, and serverCASecret fields under .spec.pgbouncer.
+ These fields enable users to supply their own certificates for both
+ client-to-pooler and pooler-to-server connections, taking precedence over the
+ operator-generated certificates.
+ (#8692)
Added optional TLS support for the operator's metrics server (port 8080).
+ This feature is opt-in and enabled by setting the METRICS_CERT_DIR
+ environment variable, which instructs the operator to look for tls.crt and
+ tls.key files in the specified directory. When unset, the server continues to
+ use HTTP for backward compatibility.
+ (#8997)
Enabled cnpg report operator to work with minimal permissions by making
+ only the operator deployment required. All other resources (pods, secrets,
+ config maps, events, webhooks, and OLM data) are now optional and collected on
+ a best-efforts basis. The command gracefully handles permission errors for
+ those resources by logging clear warnings and continuing report generation with
+ available data, rather than failing completely. This enables least-privileged
+ access, where users may have limited, namespace-scoped permissions.
+ (#8982)
Fixed the CREATE PUBLICATION SQL generation for multi-table publications to
+ be backward-compatible with PostgreSQL 13+. The previously generated syntax
+ was only valid for PostgreSQL 15+ and caused syntax errors on older versions.
+ (#8888)
Fixed backup failures in complex pod definitions by reliably selecting the
+ postgres container by name instead of by index.
+ (#8964)
cnpg plugin:
cnpg report log collection, especially when fetching previous
+ logs. The collector now correctly fetches previous and current logs in
+ separate requests and gracefully handles missing previous logs (e.g., on
+ containers with no restart history), ensuring current logs are always
+ collected.
+ (#8992)A replica cluster is a CloudNativePG Cluster resource designed to
+replicate data from another PostgreSQL instance, ideally also managed by
+CloudNativePG.
Typically, a replica cluster is deployed in a different Kubernetes cluster in +another region. These clusters can be configured to perform cascading +replication and can rely on object stores for data replication from the source, +as detailed further down.
+There are primarily two use cases for replica clusters:
+Disaster Recovery and High Availability: Enhance disaster recovery and, + to some extent, high availability of a CloudNativePG cluster across different + Kubernetes clusters, typically located in different regions. In CloudNativePG + terms, this is known as a "Distributed Topology".
+Read-Only Workloads: Create standalone replicas of a PostgreSQL cluster + for purposes such as reporting or Online Analytical Processing (OLAP). These + replicas are primarily for read-only workloads. In CloudNativePG terms, this + is referred to as a "Standalone Replica Cluster".
+For example, the diagram below — taken from the "Architecture" section +— illustrates a distributed PostgreSQL topology spanning two Kubernetes +clusters, with a symmetric replica cluster primarily serving disaster recovery +purposes.
+
CloudNativePG builds on the PostgreSQL replication framework, allowing you to +create and synchronize a PostgreSQL cluster from an existing source cluster +using the replica cluster feature — described in this section. The source can +be a primary cluster or another replica cluster (cascading replication).
+A replica cluster operates in continuous recovery mode, meaning no changes to
+the database, including the catalog and global objects like roles or databases,
+are permitted. These changes are deferred until the Cluster transitions to
+primary. During this phase, global objects such as roles remain as defined in
+the source cluster. CloudNativePG applies any local redefinitions once the
+cluster is promoted.
If you are not planning to promote the cluster (e.g., for read-only workloads) +or if you intend to detach completely from the source cluster +once the replica cluster is promoted, you don't need to take any action. +This is normally the case of the "Standalone Replica Cluster".
+If you are planning to promote the cluster at some point, CloudNativePG will +manage the following roles and passwords when transitioning from replica +cluster to primary:
+If your intention is to seamlessly ensure that the above roles and passwords
+don't change, you need to define the necessary secrets for the above in each
+Cluster.
+This is normally the case of the "Distributed Topology".
The first step is to bootstrap the replica cluster using one of the following +methods:
+pg_basebackupFor detailed instructions on cloning a PostgreSQL server using pg_basebackup
+(streaming) or recovery (volume snapshot or object store), refer to the
+"Bootstrap" section.
Once the base backup for the replica cluster is available, you need to define +how changes will be replicated from the origin using PostgreSQL continuous +recovery. There are three main options:
+restore_command.When configuring the external cluster, you have the following options:
+plugin section:
restore_job
+ and the wal protocols.connectionParameters section:
pg_basebackup section.primary_conninfo option in the
+ designated primary instance, initiating a WAL receiver process to connect
+ to the source cluster and receive data.You still have access to the barmanObjectStore section, although deprecated:
restore_command in the designated primary instance.recovery section if volume snapshots are not feasible.The replica cluster can perform backups to a reserved object store from the +designated primary, supporting symmetric architectures in a distributed +environment. This architectural choice is crucial as it ensures the cluster is +prepared for promotion during a controlled data center switchover or a failover +following an unexpected event.
+You have the flexibility to design your preferred distributed architecture for +a PostgreSQL database, choosing from:
+To set up a replica cluster from a source cluster, follow these steps to create +a cluster YAML file and configure it accordingly:
+Define External Clusters:
+externalClusters section, specify the replica cluster.Bootstrap the Replica Cluster:
+pg_basebackup section for bootstrapping
+ via streaming replication.recovery section to
+ bootstrap from a volume snapshot or an object store..spec.replica stanza:primary, source, and
+ self fields along with the distributed topology defined in
+ externalClusters. This allows CloudNativePG to declaratively control the
+ demotion of a primary cluster and the subsequent promotion of a replica cluster
+ using a promotion token.enabled option and set the source field to point to an
+ externalClusters name. This configuration is suitable for creating replicas
+ primarily intended for read-only workloads.Both the Distributed Topology and the Standalone Replica Cluster strategies for +continuous recovery are thoroughly explained below.
+As Dwight Eisenhower famously said, "Planning is everything", and this holds +true for designing PostgreSQL architectures in Kubernetes.
+First, conceptualize your distributed topology on paper, and then translate it +into a CloudNativePG API configuration. This configuration primarily involves:
+externalClusters section, which must be included in every Cluster
+ definition within your distributed PostgreSQL setup..spec.replica stanza, specifically the primary, source, and
+ (optionally) self fields.For example, suppose you want to deploy a PostgreSQL cluster distributed across +two Kubernetes clusters located in Southern Europe and Central Europe.
+In this scenario, assume you have CloudNativePG installed in the Southern
+Europe Kubernetes cluster, with a PostgreSQL Cluster named cluster-eu-south
+acting as the primary. This cluster has continuous backup configured with a
+local object store. This object store is also accessible by the PostgreSQL
+Cluster named cluster-eu-central, installed in the Central European
+Kubernetes cluster. Initially, cluster-eu-central functions as a replica
+cluster. Following a symmetric approach, it also has a local object store for
+continuous backup, which needs to be read by cluster-eu-south.
In this example, recovery is performed solely through WAL shipping, without any +streaming replication between the two clusters. However, you can configure the +setup to use streaming replication alone or adopt a hybrid approach—streaming +replication with WAL shipping as a fallback—as described in the +“Configuring replication” +section.
+Here’s how you would configure the externalClusters section for both
+Cluster resources, relying on Barman Cloud Plugin for the object store:
# Distributed topology configuration
+externalClusters:
+ - name: cluster-eu-south
+ plugin:
+ name: barman-cloud.cloudnative-pg.io
+ parameters:
+ barmanObjectName: cluster-eu-south
+ serverName: cluster-eu-south
+ - name: cluster-eu-central
+ plugin:
+ name: barman-cloud.cloudnative-pg.io
+ parameters:
+ barmanObjectName: cluster-eu-central
+ serverName: cluster-eu-central
+
+The .spec.replica stanza for the cluster-eu-south PostgreSQL primary
+Cluster should be configured as follows:
replica:
+ primary: cluster-eu-south
+ source: cluster-eu-central
+
+Meanwhile, the .spec.replica stanza for the cluster-eu-central PostgreSQL
+replica Cluster should be configured as:
replica:
+ primary: cluster-eu-south
+ source: cluster-eu-south
+
+In this configuration, when the primary field matches the name of the
+Cluster resource (or .spec.replica.self if a different one is used), the
+current cluster is considered the primary in the distributed topology.
+Otherwise, it is set as a replica from the source (in this case, using the
+Barman object store).
This setup allows you to efficiently manage a distributed PostgreSQL +architecture across multiple Kubernetes clusters, ensuring both high +availability and disaster recovery through controlled switchover of a primary +PostgreSQL cluster using declarative configuration.
+Controlled switchover in a distributed topology is a two-step process +involving:
+These processes are described in the next sections.
+Important
+Before you proceed, ensure you review the "About PostgreSQL Roles" section
+above and use identical role definitions, including secrets, in all
+Cluster objects participating in the distributed topology.
CloudNativePG provides the functionality to demote a primary cluster to a
+replica cluster. This action is typically planned when transitioning the
+primary role from one data center to another. The process involves demoting the
+current primary cluster (e.g., cluster-eu-south) to a replica cluster and
+subsequently promoting the designated replica cluster (e.g.,
+cluster-eu-central) to primary when fully synchronized.
Provided you have defined an external cluster in the current primary Cluster
+resource that points to the replica cluster that's been selected to become the
+new primary, all you need to do is change the primary field as follows:
replica:
+ primary: cluster-eu-central
+ source: cluster-eu-central
+
+When the primary PostgreSQL cluster is demoted, write operations are no +longer possible. CloudNativePG then:
+Archives the WAL file containing the shutdown checkpoint as a .partial
+ file in the WAL archive.
Generates a demotionToken in the status, a base64-encoded JSON structure
+ containing relevant information from pg_controldata such as the system
+ identifier, the timestamp, timeline ID, REDO location, and REDO WAL file of the
+ latest checkpoint.
The first step is necessary to demote/promote using solely the WAL archive to +feed the continuous recovery process (without streaming replication).
+The second step, generation of the .status.demotionToken, will ensure a
+smooth demotion/promotion process, without any data loss and without rebuilding
+the former primary.
At this stage, the former primary has transitioned to a replica cluster,
+awaiting WAL data from the new global primary: cluster-eu-central.
To proceed with promoting the other cluster, you need to retrieve the
+demotionToken from cluster-eu-south using the following command:
kubectl get cluster cluster-eu-south \
+ -o jsonpath='{.status.demotionToken}'
+
+You can obtain the demotionToken using the cnpg plugin by checking the
+cluster's status. The token is listed under the Demotion token section.
Note
+The demotionToken obtained from cluster-eu-south will serve as the
+promotionToken for cluster-eu-central.
You can verify the role change using the cnpg plugin, checking the status of
+the cluster:
kubectl cnpg status cluster-eu-south
+
+To promote a PostgreSQL replica cluster (e.g., cluster-eu-central) to a
+primary cluster and make the designated primary an actual primary instance,
+you need to perform the following steps simultaneously:
.spec.replica.primary to the name of the current replica cluster
+ to be promoted (e.g., cluster-eu-central)..spec.replica.promotionToken with the value obtained from the
+ former primary cluster (refer to "Demoting a Primary to a Replica Cluster").The updated replica section in cluster-eu-central's spec should look like
+this:
replica:
+ primary: cluster-eu-central
+ promotionToken: <PROMOTION_TOKEN>
+ source: cluster-eu-south
+
+Warning
+It is crucial to apply the changes to the primary and promotionToken
+fields simultaneously. If the promotion token is omitted, a failover will be
+triggered, necessitating a rebuild of the former primary.
After making these adjustments, CloudNativePG will initiate the promotion of
+the replica cluster to a primary cluster. Initially, CloudNativePG will wait
+for the designated primary cluster to replicate all Write-Ahead Logging (WAL)
+information up to the specified Log Sequence Number (LSN) contained in the
+token. Once this target is achieved, the promotion process will commence. The
+new primary cluster will switch timelines, archive the history file and new
+WAL, thereby unblocking the replication process in the cluster-eu-south
+cluster, which will then operate as a replica.
To verify the role change, use the cnpg plugin to check the status of the
+cluster:
kubectl cnpg status cluster-eu-central
+
+This command will provide you with the current status of cluster-eu-central,
+confirming its promotion to primary.
By following these steps, you ensure a smooth and controlled promotion process, +minimizing disruption and maintaining data integrity across your PostgreSQL +clusters.
+Important
+Standalone Replica Clusters were previously known as Replica Clusters +before the introduction of the Distributed Topology strategy in CloudNativePG +1.24.
+In CloudNativePG, a Standalone Replica Cluster is a PostgreSQL cluster in +continuous recovery with the following configurations:
+.spec.replica.enabled set to true.spec.replica.source field,
+ pointing to an externalClusters nameWhen .spec.replica.enabled is set to false, the replica cluster exits
+continuous recovery mode and becomes a primary cluster, completely detached
+from the original source.
Warning
+Disabling replication is an irreversible operation. Once replication is +disabled and the designated primary is promoted to primary, the replica cluster +and the source cluster become two independent clusters definitively.
+Important
+Standalone replica clusters are suitable for several use cases, primarily +involving read-only workloads. If you are planning to setup a disaster +recovery solution, look into "Distributed Topology" above.
+Although Standalone Replica Clusters can be used for disaster recovery +purposes, they differ from the "Distributed Topology" strategy in several key +ways:
+Failover is identical in both strategies, requiring the former primary to be +re-cloned if it ever comes back up.
+pg_basebackupThis first example defines a standalone replica cluster using streaming +replication in both bootstrap and continuous recovery. The replica cluster +connects to the source cluster using TLS authentication.
+You can check the sample YAML
+in the samples/ subdirectory.
Note the bootstrap and replica sections pointing to the source cluster.
bootstrap:
+ pg_basebackup:
+ source: cluster-example
+
+ replica:
+ enabled: true
+ source: cluster-example
+
+The previous configuration assumes that the application database and its owning
+user are set to the default, app. If the PostgreSQL cluster being restored
+uses different names, you must specify them as documented in Configure the application database.
+You should also consider copying over the application user secret from
+the original cluster and keep it synchronized with the source.
+See "About PostgreSQL Roles" for more details.
In the externalClusters section, remember to use the right namespace for the
+host in the connectionParameters sub-section.
+The -replication and -ca secrets should have been copied over if necessary,
+in case the replica cluster is in a separate namespace.
externalClusters:
+ - name: <MAIN-CLUSTER>
+ connectionParameters:
+ host: <MAIN-CLUSTER>-rw.<NAMESPACE>.svc
+ user: streaming_replica
+ sslmode: verify-full
+ dbname: postgres
+ sslKey:
+ name: <MAIN-CLUSTER>-replication
+ key: tls.key
+ sslCert:
+ name: <MAIN-CLUSTER>-replication
+ key: tls.crt
+ sslRootCert:
+ name: <MAIN-CLUSTER>-ca
+ key: ca.crt
+
+The second example defines a replica cluster that bootstraps from an object
+store using the recovery section and continuous recovery using both streaming
+replication and the given object store. For streaming replication, the replica
+cluster connects to the source cluster using basic authentication.
You can check the sample YAML
+for it in the samples/ subdirectory.
Note the bootstrap and replica sections pointing to the source cluster.
bootstrap:
+ recovery:
+ source: cluster-example
+
+ replica:
+ enabled: true
+ source: cluster-example
+
+The previous configuration assumes that the application database and its owning
+user are set to the default, app. If the PostgreSQL cluster being restored
+uses different names, you must specify them as documented in Configure the application database.
+You should also consider copying over the application user secret from
+the original cluster and keep it synchronized with the source.
+See "About PostgreSQL Roles" for more details.
In the externalClusters section, take care to use the right namespace in the
+endpointURL and the connectionParameters.host.
+And do ensure that the necessary secrets have been copied if necessary, and that
+a backup of the source cluster has been created already.
externalClusters:
+ - name: <MAIN-CLUSTER>
+ # Example with Barman Cloud Plugin
+ plugin:
+ name: barman-cloud.cloudnative-pg.io
+ parameters:
+ barmanObjectName: <MAIN-CLUSTER>
+ serverName: <MAIN-CLUSTER>
+ …
+ connectionParameters:
+ host: <MAIN-CLUSTER>-rw.default.svc
+ user: postgres
+ dbname: postgres
+ password:
+ name: <MAIN-CLUSTER>-superuser
+ key: password
+
+Note
+To use streaming replication between the source cluster and the replica +cluster, we need to make sure there is network connectivity between the two +clusters, and that all the necessary secrets which hold passwords or +certificates are properly created in advance.
+If you use volume snapshots and your storage class provides +snapshots cross-cluster availability, you can leverage that to +bootstrap a replica cluster through a volume snapshot of the +source cluster.
+The third example defines a replica cluster that bootstraps
+from a volume snapshot using the recovery section. It uses
+streaming replication (via basic authentication) and the object
+store to fetch the WAL files.
You can check the sample YAML
+for it in the samples/ subdirectory.
The example assumes that the application database and its owning
+user are set to the default, app. If the PostgreSQL cluster being restored
+uses different names, you must specify them as documented in Configure the
+application database.
+You should also consider copying over the application user secret from
+the original cluster and keep it synchronized with the source.
+See "About PostgreSQL Roles" for more details.
CloudNativePG supports the creation of delayed replicas through the
+.spec.replica.minApplyDelay option,
+leveraging PostgreSQL's
+recovery_min_apply_delay.
Delayed replicas are designed to intentionally lag behind the primary database
+by a specified amount of time. This delay is configurable using the
+.spec.replica.minApplyDelay option, which maps to the underlying
+recovery_min_apply_delay parameter in PostgreSQL.
The primary objective of delayed replicas is to mitigate the impact of
+unintended SQL statement executions on the primary database. This is especially
+useful in scenarios where operations such as UPDATE or DELETE are performed
+without a proper WHERE clause.
To configure a delay in a replica cluster, adjust the
+.spec.replica.minApplyDelay option. This parameter determines how much time
+the replicas will lag behind the primary. For example:
# ...
+ replica:
+ enabled: true
+ source: cluster-example
+ # Enforce a delay of 8 hours
+ minApplyDelay: '8h'
+ # ...
+
+The above example helps safeguard against accidental data modifications by +providing a buffer period of 8 hours to detect and correct issues before they +propagate to the replicas.
+Monitor and adjust the delay as needed based on your recovery time objectives +and the potential impact of unintended primary database operations.
+The main use cases of delayed replicas can be summarized into:
+mitigating human errors: reduce the risk of data corruption or loss + resulting from unintentional SQL operations on the primary database
+recovery time optimization: facilitate quicker recovery from unintended + changes by having a delayed replica that allows you to identify and rectify + issues before changes are applied to other replicas.
+enhanced data protection: safeguard critical data by introducing a time + buffer that provides an opportunity to intervene and prevent the propagation of + undesirable changes.
+Warning
+The minApplyDelay option of delayed replicas cannot be used in
+conjunction with promotionToken.
By integrating delayed replicas into your replication strategy, you can enhance +the resilience and data protection capabilities of your PostgreSQL environment. +Adjust the delay duration based on your specific needs and the criticality of +your data.
+Important
+Always measure your goals. Depending on your environment, it might be more +efficient to rely on volume snapshot-based recovery for faster outcomes. +Evaluate and choose the approach that best aligns with your unique requirements +and infrastructure.
+Physical replication is one of the strengths of PostgreSQL and one of the +reasons why some of the largest organizations in the world have chosen it for +the management of their data in business continuity contexts. Primarily used to +achieve high availability, physical replication also allows scale-out of +read-only workloads and offloading of some work from the primary.
+Important
+This section is about replication within the same Cluster resource
+managed in the same Kubernetes cluster. For information about how to
+replicate with another Postgres Cluster resource, even across different
+Kubernetes clusters, please refer to the
+"Replica clusters" section.
Having contributed throughout the years to the replication feature in +PostgreSQL, we have decided to build high availability in CloudNativePG on top +of the native physical replication technology, and integrate it directly in the +Kubernetes API.
+In Kubernetes terms, this is referred to as application-level replication, +in contrast with storage-level replication.
+PostgreSQL has a very robust and mature native framework for replicating data +from the primary instance to one or more replicas, built around the concept of +transactional changes continuously stored in the WAL (Write Ahead Log).
+Started as the evolution of crash recovery and point in time recovery +technologies, physical replication was first introduced in PostgreSQL 8.2 +(2006) through WAL shipping from the primary to a warm standby in continuous +recovery.
+PostgreSQL 9.0 (2010) introduced WAL streaming and read-only replicas through +hot standby. In 2011, PostgreSQL 9.1 brought synchronous replication at the +transaction level, supporting RPO=0 clusters. Cascading +replication was added in PostgreSQL 9.2 (2012). The foundations for +logical replication were established in PostgreSQL +9.4 (2014), and version 10 (2017) introduced native support for the +publisher/subscriber pattern to replicate data from an origin to a destination. The +table below summarizes these milestones.
+| Version | +Year | +Feature | +
|---|---|---|
| 8.2 | +2006 | +Warm Standby with WAL shipping | +
| 9.0 | +2010 | +Hot Standby and physical streaming replication | +
| 9.1 | +2011 | +Synchronous replication (priority-based) | +
| 9.2 | +2012 | +Cascading replication | +
| 9.4 | +2014 | +Foundations of logical replication | +
| 10 | +2017 | +Logical publisher/subscriber and quorum-based synchronous replication | +
This table highlights key PostgreSQL replication features and their respective +versions.
+At the moment, CloudNativePG natively and transparently manages physical
+streaming replicas within a cluster in a declarative way, based on the number of
+provided instances in the spec:
replicas = instances - 1 (where instances > 0)
+
+Immediately after the initialization of a cluster, the operator creates a user
+called streaming_replica as follows:
CREATE USER streaming_replica WITH REPLICATION;
+-- NOSUPERUSER INHERIT NOCREATEROLE NOCREATEDB NOBYPASSRLS
+
+Out of the box, the operator automatically sets up streaming replication within
+the cluster over an encrypted channel and enforces TLS client certificate
+authentication for the streaming_replica user - as highlighted by the
+following excerpt taken from pg_hba.conf:
# Require client certificate authentication for the streaming_replica user
+hostssl postgres streaming_replica all cert map=cnpg_streaming_replica
+hostssl replication streaming_replica all cert map=cnpg_streaming_replica
+
+Certificates
+For details on how CloudNativePG manages certificates, please refer +to the "Certificates" section +in the documentation.
+If configured, the operator manages replication slots for all the replicas in the +HA cluster, ensuring that WAL files required by each standby are retained on +the primary's storage, even after a failover or switchover.
+Replication slots for High Availability
+For details on how CloudNativePG automatically manages replication slots for the +High Availability replicas, please refer to the +"Replication slots for High Availability" section +below.
+In case continuous backup is configured in the cluster, CloudNativePG
+transparently configures replicas to take advantage of restore_command when in
+continuous recovery. As a result, PostgreSQL can use the WAL archive as a
+fallback option whenever pulling WALs via streaming replication fails.
CloudNativePG supports both +quorum-based and priority-based synchronous replication for PostgreSQL.
+Warning
+By default, synchronous replication pauses write operations if the required +number of standby nodes for WAL replication during transaction commits is +unavailable. This behavior prioritizes data durability and aligns with +PostgreSQL DBA best practices. However, if self-healing is a higher priority +than strict data durability in your setup, this setting can be adjusted. For +details on managing this behavior, refer to the Data Durability and Synchronous Replication +section.
+Important
+The failover quorum feature (experimental) +can be used alongside synchronous replication to improve data durability +and safety during failover events.
+Direct configuration of the synchronous_standby_names option is not
+permitted. However, CloudNativePG automatically populates this option with the
+names of local pods, while also allowing customization to extend synchronous
+replication beyond the Cluster resource.
+This can be achieved through the
+.spec.postgresql.synchronous stanza.
Synchronous replication is disabled by default (the synchronous stanza is not
+defined). When defined, two options are mandatory:
method: either any (quorum) or first (priority)number: the number of synchronous standby servers that transactions must
+ wait for responses fromIn PostgreSQL, quorum-based synchronous replication ensures that transaction
+commits wait until their WAL records are replicated to a specified number of
+standbys. To enable this, set the method to any.
This replication method is the most common setup for a CloudNativePG cluster.
+The example below, based on a typical cluster-example configuration with
+three instances, sets up quorum-based synchronous replication with at least one
+instance:
postgresql:
+ synchronous:
+ method: any
+ number: 1
+
+With this configuration, CloudNativePG automatically sets the content of
+synchronous_standby_names as follows:
ANY 1 (cluster-example-2, cluster-example-3, cluster-example-1)
+
+This section outlines how to migrate from the deprecated quorum-based +synchronous replication format to the newer, more robust implementation in +CloudNativePG.
+Given the following manifest:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: angus
+spec:
+ instances: 3
+ minSyncReplicas: 1
+ maxSyncReplicas: 1
+
+ storage:
+ size: 1Gi
+
+You can update it to the new format as follows:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: angus
+spec:
+ instances: 3
+
+ storage:
+ size: 1Gi
+
+ postgresql:
+ synchronous:
+ method: any
+ number: 1
+ dataDurability: required
+
+To prioritize self-healing over strict data durability, set dataDurability
+to preferred instead.
PostgreSQL's priority-based synchronous replication makes transaction commits
+wait until their WAL records are replicated to the requested number of
+synchronous standbys chosen based on their priorities. Standbys listed earlier
+in the synchronous_standby_names option are given higher priority and
+considered synchronous. If a current synchronous standby disconnects, it is
+immediately replaced by the next-highest-priority standby. To use this method,
+set method to first.
Important
+Currently, this method is most useful when extending
+synchronous replication beyond the current cluster using the
+maxStandbyNamesFromCluster, standbyNamesPre, and standbyNamesPost
+options explained below.
synchronous_standby_names ContentBy default, CloudNativePG populates synchronous_standby_names with the names
+of local pods in a Cluster resource, ensuring synchronous replication within
+the PostgreSQL cluster. You can customize the content of
+synchronous_standby_names based on your requirements and replication method
+(quorum or priority) using the following optional parameters in the
+.spec.postgresql.synchronous stanza:
maxStandbyNamesFromCluster: the maximum number of pod names from the local
+ Cluster object that can be automatically included in the
+ synchronous_standby_names option in PostgreSQL.standbyNamesPre: a list of standby names (specifically application_name)
+ to be prepended to the list of local pod names automatically listed by the
+ operator.standbyNamesPost: a list of standby names (specifically application_name)
+ to be appended to the list of local pod names automatically listed by the
+ operator.Warning
+You are responsible for ensuring the correct names in standbyNamesPre and
+standbyNamesPost. CloudNativePG expects that you manage any standby with
+an application_name listed here, ensuring their high availability.
+Incorrect entries can jeopardize your PostgreSQL database uptime.
Here are some examples, all based on a cluster-example with three instances:
If you set:
+postgresql:
+ synchronous:
+ method: any
+ number: 1
+ maxStandbyNamesFromCluster: 1
+ standbyNamesPre:
+ - angus
+
+The content of synchronous_standby_names will be:
ANY 1 (angus, cluster-example-2)
+
+If you set:
+postgresql:
+ synchronous:
+ method: any
+ number: 1
+ maxStandbyNamesFromCluster: 0
+ standbyNamesPre:
+ - angus
+ - malcolm
+
+The content of synchronous_standby_names will be:
ANY 1 (angus, malcolm)
+
+If you set:
+postgresql:
+ synchronous:
+ method: first
+ number: 2
+ maxStandbyNamesFromCluster: 1
+ standbyNamesPre:
+ - angus
+ standbyNamesPost:
+ - malcolm
+
+The synchronous_standby_names option will look like:
FIRST 2 (angus, cluster-example-2, malcolm)
+
+The dataDurability option in the .spec.postgresql.synchronous stanza
+controls the trade-off between data safety and availability for synchronous
+replication. It can be set to required or preferred, with the default being
+required if not specified.
Important
+preferred can only be used when standbyNamesPre and standbyNamesPost
+are unset.
When dataDurability is set to required, PostgreSQL only considers
+transactions committed once WAL (Write-Ahead Log) records have been replicated
+to the specified number of synchronous standbys. This setting prioritizes data
+safety over availability, meaning write operations will pause if the required
+number of synchronous standbys is unavailable. This ensures zero data loss
+(RPO=0) but may reduce database availability during network disruptions or
+standby failures.
Synchronous standbys are selected in this priority order:
+The list is then truncated based on maxStandbyNamesFromCluster if this value
+is set, prioritizing healthy instances and ensuring synchronous_standby_names
+is populated.
Consider the following example:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: foo
+spec:
+ instances: 3
+ postgresql:
+ synchronous:
+ method: any
+ number: 1
+ dataDurability: required
+
+Initial state. The content of synchronous_standby_names is:
ANY 1 ("foo-2","foo-3","foo-1")
foo-2 becomes unavailable. It gets pushed back in priority:
ANY 1 ("foo-3","foo-2","foo-1")
foo-3 also becomes unavailable. The list contains no healthy standbys:
ANY 1 ("foo-2","foo-3","foo-1")
At this point no write operations will be allowed until at least one of the +standbys is available again.
+When the standbys are available again, synchronous_standby_names will
+ be back to the initial state.
When dataDurability is set to preferred, the required number of synchronous
+instances adjusts based on the number of available standbys. PostgreSQL will
+attempt to replicate WAL records to the designated number of synchronous
+standbys, but write operations will continue even if fewer than the requested
+number of standbys are available.
Important
+Make sure you have a clear understanding of what ready/available means +for a replica and set your expectations accordingly. By default, a replica is +considered ready when it has successfully connected to the source at least +once. However, CloudNativePG allows you to configure startup and readiness +probes for replicas based on maximum lag. For more details, please refer to +the "Postgres instance manager" section.
+This setting balances data safety with availability, enabling applications to +continue writing during temporary standby unavailability—hence, it’s also known +as self-healing mode.
+Warning
+This mode may result in data loss if all standbys become unavailable.
+With preferred data durability, only healthy replicas are included in
+synchronous_standby_names.
Consider the following example. For demonstration, we’ll use a cluster named
+bar with 5 instances and 2 synchronous standbys:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: bar
+spec:
+ instances: 5
+ postgresql:
+ synchronous:
+ method: any
+ number: 2
+ dataDurability: preferred
+
+Initial state. The content of synchronous_standby_names is:
ANY 2 ("bar-2","bar-3", "bar-4", "bar-5")
bar-2 and bar-3 become unavailable. They are removed from the list:
ANY 2 ("bar-4", "bar-5")
bar-4 also becomes unavailable. It gets removed from the list. Since the
+ number of available standbys is less than the requested number, the requested
+ amount gets reduced:
ANY 1 ("bar-5")
bar-5 also becomes unavailable. synchronous_standby_names becomes empty,
+ disabling synchronous replication completely. Write operations will continue,
+ but with the risk of potential data loss in case of a primary failure.
synchronous_standby_names will be back to
+ the initial state.Warning
+Prior to CloudNativePG 1.24, only the quorum-based synchronous replication
+implementation was supported. Although this method is now deprecated, it
+will not be removed anytime soon.
+The new method prioritizes data durability over self-healing and offers more
+robust features, including priority-based synchronous replication and full
+control over the synchronous_standby_names option.
+It is recommended to gradually migrate to the new configuration method for
+synchronous replication, as explained in the previous paragraph.
Important
+The deprecated method and the new method are mutually exclusive.
+CloudNativePG supports the configuration of quorum-based synchronous
+streaming replication via two configuration options called minSyncReplicas
+and maxSyncReplicas, which are the minimum and the maximum number of expected
+synchronous standby replicas available at any time.
+For self-healing purposes, the operator always compares these two values with
+the number of available replicas to determine the quorum.
Important
+By default, synchronous replication selects among all the available
+replicas indistinctively. You can limit on which nodes your synchronous
+replicas can be scheduled, by working on node labels through the
+syncReplicaElectionConstraint option as described in the next section.
Synchronous replication is disabled by default (minSyncReplicas and
+maxSyncReplicas are not defined).
+In case both minSyncReplicas and maxSyncReplicas are set, CloudNativePG
+automatically updates the synchronous_standby_names option in
+PostgreSQL to the following value:
ANY q (pod1, pod2, ...)
+
+Where:
+q is an integer automatically calculated by the operator to be:
+ 1 <= minSyncReplicas <= q <= maxSyncReplicas <= readyReplicaspod1, pod2, ... is the list of all PostgreSQL pods in the clusterWarning
+To provide self-healing capabilities, the operator can ignore
+minSyncReplicas if such value is higher than the currently available
+number of replicas. Synchronous replication is automatically disabled
+when readyReplicas is 0.
As stated in the
+PostgreSQL documentation,
+the method ANY specifies a quorum-based synchronous replication and makes
+transaction commits wait until their WAL records are replicated to at least the
+requested number of synchronous standbys in the list.
Important
+Even though the operator chooses self-healing over enforcement of
+synchronous replication settings, our recommendation is to plan for
+synchronous replication only in clusters with 3+ instances or,
+more generally, when maxSyncReplicas < (instances - 1).
CloudNativePG enables you to select which PostgreSQL instances are eligible to +participate in a quorum-based synchronous replication set through anti-affinity +rules based on the node labels where the PVC holding the PGDATA and the +Postgres pod are.
+Scheduling
+For more information on the general pod affinity and anti-affinity rules, +please check the "Scheduling" section.
+Warning
+The .spec.postgresql.syncReplicaElectionConstraint option only applies to the
+legacy implementation of synchronous replication
+(see "Synchronous Replication (Deprecated)").
As an example use-case for this feature: in a cluster with a single sync
+replica, we would be able to ensure the sync replica will be in a different
+availability zone from the primary instance, usually identified by
+the topology.kubernetes.io/zone
+label on a node.
+This would increase the robustness of the cluster in case of an outage in a
+single availability zone, especially in terms of recovery point objective
+(RPO).
The idea of anti-affinity is to ensure that sync replicas that participate in +the quorum are chosen from pods running on nodes that have different values for +the selected labels (in this case, the availability zone label) then the node +where the primary is currently in execution. If no node matches such criteria, +the replicas are eligible for synchronous replication.
+Important
+The self-healing enforcement still applies while defining additional +constraints for synchronous replica election +(see "Synchronous replication").
+The example below shows how this can be done through the
+syncReplicaElectionConstraint section within .spec.postgresql.
+nodeLabelsAntiAffinity allows you to specify those node labels that need to be
+evaluated to make sure that synchronous replication will be dynamically
+configured by the operator between the current primary and the replicas which
+are located on nodes having a value of the availability zone label different
+from that of the node where the primary is:
spec:
+ instances: 3
+ postgresql:
+ syncReplicaElectionConstraint:
+ enabled: true
+ nodeLabelsAntiAffinity:
+ - topology.kubernetes.io/zone
+
+As you can imagine, the availability zone is just an example, but you could +customize this behavior based on other labels that describe the node, such +as storage, CPU, or memory.
+Replication slots +are a native PostgreSQL feature introduced in 9.4 that provides an automated way +to ensure that the primary does not remove WAL segments until all the attached +streaming replication clients have received them, and that the primary does not +remove rows which could cause a recovery conflict even when the standby is ( +temporarily) disconnected.
+A replication slot exists solely on the instance that created it, and PostgreSQL +does not replicate it on the standby servers. As a result, after a failover or a +switchover, the new primary does not contain the replication slot from the old +primary. This can create problems for the streaming replication clients that +were connected to the old primary and have lost their slot.
+CloudNativePG provides a turn-key solution to synchronize the content of +physical replication slots from the primary to each standby, addressing two use +cases:
+CloudNativePG fills this gap by introducing the concept of cluster-managed +replication slots, starting with high availability clusters. This feature +automatically manages physical replication slots for each hot standby replica in +the High Availability cluster, both in the primary and the standby.
+In CloudNativePG, we use the terms:
+pg_replication_slots view in the primary, and updated at regular
+ intervals using pg_replication_slot_advance().This feature is enabled by default and can be disabled via configuration. For +details, please refer to the +"replicationSlots" section in the API reference. +Here follows a brief description of the main options:
+.spec.replicationSlots.highAvailability.enabledtrue, the feature is enabled (true is the default).spec.replicationSlots.highAvailability.slotPrefix_cnpg_).spec.replicationSlots.updateIntervalAlthough it is not recommended, if you desire a different behavior, you can +customize the above options.
+For example, the following manifest will create a cluster with replication +slots disabled.
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+ # Disable replication slots for HA in the cluster
+ replicationSlots:
+ highAvailability:
+ enabled: false
+
+ storage:
+ size: 1Gi
+
+Although CloudNativePG doesn't support a way to declaratively define physical +replication slots, you can still create your own slots via SQL.
+Information
+At the moment, we don't have any plans to manage replication slots +in a declarative way, but it might change depending on the feedback +we receive from users. The reason is that replication slots exist +for a specific purpose and each should be managed by a specific application +the oversees the entire lifecycle of the slot on the primary.
+CloudNativePG can manage the synchronization of any user managed physical +replication slots between the primary and standbys, similarly to what it does +for the HA replication slots explained above (the only difference is that you +need to create the replication slot).
+This feature is enabled by default (meaning that any replication slot is
+synchronized), but you can disable it or further customize its behavior (for
+example by excluding some slots using regular expressions) through the
+synchronizeReplicas stanza. For example:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+ replicationSlots:
+ synchronizeReplicas:
+ enabled: true
+ excludePatterns:
+ - "^foo"
+
+For details, please refer to the +"replicationSlots" section in the API reference. +Here follows a brief description of the main options:
+.spec.replicationSlots.synchronizeReplicas.enabled.spec.replicationSlots.synchronizeReplicas.excludePatternsWarning
+Users utilizing this feature should carefully monitor user-defined replication +slots to ensure they align with their operational requirements and do not +interfere with the failover process.
+You can also control the frequency with which a standby queries the
+pg_replication_slots view on the primary, and updates its local copy of
+the replication slots, like in this example:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+ # Reduce the frequency of standby HA slots updates to once every 5 minutes
+ replicationSlots:
+ updateInterval: 300
+
+ storage:
+ size: 1Gi
+
+CloudNativePG can synchronize logical decoding (replication) slots across all +nodes in a high-availability cluster, ensuring seamless continuation of logical +replication after a failover or switchover. This feature is disabled by +default, and enabling it requires two steps.
+The first step is to enable logical decoding slot synchronization:
+ # ...
+ replicationSlots:
+ highAvailability:
+ synchronizeLogicalDecoding: true
+
+The second step involves configuring PostgreSQL parameters: the required +configuration depends on your PostgreSQL version, as explained below.
+When enabled, the operator automatically manages logical decoding slot states +during failover and switchover, preventing slot invalidation and avoiding data +loss for logical replication clients.
+For PostgreSQL 17 and newer, CloudNativePG transparently manages the
+synchronized_standby_slots parameter.
You must enable both sync_replication_slots and hot_standby_feedback in
+your PostgreSQL configuration:
# ...
+postgresql:
+ parameters:
+ # ...
+ hot_standby_feedback: 'on'
+ sync_replication_slots: 'on'
+
+Additionally, you must create the logical replication Subscription with the
+failover option enabled, for example:
apiVersion: postgresql.cnpg.io/v1
+kind: Subscription
+# ...
+spec:
+# ...
+ parameters:
+ failover: 'true'
+# ...
+
+When configured, logical WAL sender processes send decoded changes to plugins +only after the specified replication slots confirm receiving and flushing the +relevant WAL, ensuring that:
+For more details on logical replication slot synchronization, see the +PostgreSQL documentation on Logical Replication Failover.
+For PostgreSQL 16 and older versions, CloudNativePG uses the
+pg_failover_slots extension
+to maintain synchronization of logical replication slots across failovers.
When replication slots is enabled, you might end up running out of disk space +due to PostgreSQL trying to retain WAL files requested by a replication slot. +This might happen due to a standby that is (temporarily?) down, or lagging, or +simply an orphan replication slot.
+Starting with PostgreSQL 13, you can take advantage of the
+max_slot_wal_keep_size
+configuration option controlling the maximum size of WAL files that replication
+slots are allowed to retain in the pg_wal directory at checkpoint time. By
+default, in PostgreSQL max_slot_wal_keep_size is set to -1, meaning that
+replication slots may retain an unlimited amount of WAL files. As a result, our
+recommendation is to explicitly set max_slot_wal_keep_size
+when replication slots support is enabled. For example:
# ...
+ postgresql:
+ parameters:
+ max_slot_wal_keep_size: "10GB"
+ # ...
+
+Replication slots must be carefully monitored in your infrastructure. By default,
+we provide the pg_replication_slots metric in our Prometheus exporter with
+key information such as the name of the slot, the type, whether it is active,
+the lag from the primary.
Monitoring
+Please refer to the "Monitoring" section for details on +how to monitor a CloudNativePG deployment.
+In a typical Kubernetes cluster, pods run with unlimited resources. By default, +they might be allowed to use as much CPU and RAM as needed.
+CloudNativePG allows administrators to control and manage resource usage by the pods of the cluster,
+through the resources section of the manifest, with two knobs:
requests: initial requirementlimits: maximum usage, in case of dynamic increase of resource needsFor example, you can request an initial amount of RAM of 32MiB (scalable to 128MiB) and 50m of CPU (scalable to 100m) +as follows:
+ resources:
+ requests:
+ memory: "32Mi"
+ cpu: "50m"
+ limits:
+ memory: "128Mi"
+ cpu: "100m"
+
+Memory requests and limits are associated with containers, but it is useful to think of a pod as having a memory request +and limit. The pod's memory request is the sum of the memory requests for all the containers in the pod.
+Pod scheduling is based on requests and not on limits. A pod is scheduled to run on a Node only if the Node has enough +available memory to satisfy the pod's memory request.
+For each resource, we divide containers into 3 Quality of Service (QoS) classes, in decreasing order of priority:
+For more details, please refer to the "Configure Quality of Service for Pods" +section in the Kubernetes documentation.
+For a PostgreSQL workload it is recommended to set a "Guaranteed" QoS.
+Info
+When the quality of service is set to "Guaranteed", CloudNativePG sets the
+PG_OOM_ADJUST_VALUE for the postmaster process to 0, in line with the
+PostgreSQL documentation.
+This allows the postmaster to retain its low Out-Of-Memory (OOM) score of
+-997, while its child processes run with an OOM score adjustment of 0. As a
+result, if the OOM killer is triggered, it will terminate the child processes
+before the postmaster. This behavior helps keep the PostgreSQL instance
+alive for as long as possible and enables a clean shutdown procedure in the
+event of an eviction.
To avoid resources related issues in Kubernetes, we can refer to the best practices for "out of resource" handling +while creating a cluster:
+OOM Killed and CPU throttle or any other
+ resource-related issues on running instances.You can refer to the following example manifest:
+apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: postgresql-resources
+spec:
+
+ instances: 3
+
+ postgresql:
+ parameters:
+ shared_buffers: "256MB"
+
+ resources:
+ requests:
+ memory: "1024Mi"
+ cpu: 1
+ limits:
+ memory: "1024Mi"
+ cpu: 1
+
+ storage:
+ size: 1Gi
+
+In the above example, we have specified shared_buffers parameter with a value of 256MB - i.e., how much memory is
+dedicated to the PostgreSQL server for caching data (the default value for this parameter is 128MB in case
+it's not defined).
A reasonable starting value for shared_buffers is 25% of the memory in your system.
+For example: if your shared_buffers is 256 MB, then the recommended value for your container memory size is 1 GB,
+which means that within a pod all the containers will have a total of 1 GB memory that Kubernetes will always preserve,
+enabling our containers to work as expected.
+For more details, please refer to the "Resource Consumption"
+section in the PostgreSQL documentation.
Managing Compute Resources for Containers
+For more details on resource management, please refer to the +"Managing Compute Resources for Containers" +page from the Kubernetes documentation.
+The operator allows you to change the PostgreSQL version used in a cluster +while applications continue running against it.
+Rolling upgrades are triggered when:
+you change the imageName attribute in the cluster specification;
you change the list of extension images in the .spec.postgresql.extensions
+ stanza of the cluster specification;
the image catalog is updated with a new image for the + major version used by the cluster;
+a change in the PostgreSQL configuration requires a restart to apply;
+you change the Cluster .spec.resources values;
the operator is updated, ensuring Pods run the latest instance manager + (unless in-place updates are enabled).
+During a rolling upgrade, the operator upgrades all replicas one Pod at a time, +starting from the one with the highest serial.
+The primary is always the last node to be upgraded.
+Rolling updates are configurable and can be either entirely automated
+(unsupervised) or requiring human intervention (supervised).
The upgrade keeps the CloudNativePG identity, without re-cloning the +data. Pods will be deleted and created again with the same PVCs and a new +image, if required.
+During the rolling update procedure, each service's endpoint moves to reflect the +cluster's status, so that applications can ignore the node that is being +updated.
+unsupervised)When primaryUpdateStrategy is set to unsupervised, the rolling update
+process is managed by Kubernetes and is entirely automated. Once the replicas
+have been upgraded, the selected primaryUpdateMethod operation will initiate
+on the primary. This is the default behavior.
The primaryUpdateMethod option accepts one of the following values:
restart: if possible, perform an automated restart of the pod where the
+ primary instance is running. Otherwise, the restart request is ignored and a
+ switchover issued. This is the default behavior.
switchover: a switchover operation is automatically performed, setting the
+ most aligned replica as the new target primary, and shutting down the former
+ primary pod.
There's no one-size-fits-all configuration for the update method, as that +depends on several factors like the actual workload of your database, the +requirements in terms of RPO and +RTO, whether your PostgreSQL architecture is shared +or shared nothing, and so on.
+Indeed, being PostgreSQL a primary/standby architecture database management
+system, the update process inevitably generates a downtime for your
+applications. One important aspect to consider for your context is the time it
+takes for your pod to download the new PostgreSQL container image, as that
+depends on your Kubernetes cluster settings and specifications. The
+switchover method makes sure that the promoted instance already runs the
+target image version of the container. The restart method instead might require
+to download the image from the origin registry after the primary pod has been
+shut down. It is up to you to determine whether, for your database, it is best
+to use restart or switchover as part of the rolling update procedure.
supervised)When primaryUpdateStrategy is set to supervised, the rolling update process
+is suspended immediately after all replicas have been upgraded.
This phase can only be completed with either a manual switchover or an in-place +restart. Keep in mind that image upgrades can not be applied with an in-place restart, +so a switchover is required in such cases.
+You can trigger a switchover with:
+kubectl cnpg promote [cluster] [new_primary]
+
+You can trigger a restart with:
+kubectl cnpg restart [cluster] [current_primary]
+
+You can find more information in the cnpg plugin page.
The examples show configuration files for setting up +your PostgreSQL cluster.
+Important
+These examples are for demonstration and experimentation +purposes. You can execute them on a personal Kubernetes cluster with Minikube +or Kind, as described in Quick start.
+Reference
+For a list of available options, see API reference.
+cluster-example.yaml
+ A basic example of a cluster.cluster-example-custom.yaml
+ A basic cluster that uses the default storage class and custom parameters for
+ the postgresql.conf and pg_hba.conf files.cluster-storage-class.yaml:
+ A basic cluster that uses a specified storage class of standard.cluster-pvc-template.yaml:
+ A basic cluster with an explicit persistent volume claim template.cluster-example-full.yaml:
+ A cluster that sets most of the available options.cluster-example-initdb-sql-refs.yaml:
+ A cluster example that executes a set of queries defined in a secret and a
+ ConfigMap right after the database is created.pg_hba configurationcluster-example-pg-hba.yaml:
+ A basic cluster that enables the user app to authenticate using certificates.cluster-example-projected-volume.yaml
+ A basic cluster with the existing Secret and ConfigMap mounted into Postgres
+ pod using projected volume mount.cluster-example-security-context.yaml
+ A cluster demonstrating how to customize both Pod and Container security contexts.
+ This is useful when working with Pod Security Standards or meeting specific security requirements.cluster-storage-class-with-backup.yaml
+ A cluster with backups configured.cluster-storage-class-with-backup.yaml
+applied and healthy.backup-example.yaml:
+ An example of a backup that runs against the previous sample.backup.barmanObjectStore with your minio parameters or your cloud solution.cluster-example-with-backup.yaml
+ A basic cluster with backups configured.backup.barmanObjectStore with your Scaleway parameters.cluster-example-with-backup-scaleway.yaml
+ A basic cluster with backups configured to work with Scaleway Object Storage..cluster-storage-class-with-backup.yaml
+applied and healthy, and a backup
+cluster-example-trigger-backup.yaml
+applied and completed.cluster-example-replica-from-backup-simple.yaml:
+ A replica cluster following a cluster with backup configured.cluster-example-with-volume-snapshot.yaml
+applied and healthy, and a volume snapshot
+backup-with-volume-snapshot.yaml
+applied and completed.cluster-example-replica-from-volume-snapshot.yaml:
+ A replica cluster following a cluster with volume snapshot configured.cluster-example.yaml
+applied and healthy.cluster-example-replica-streaming.yaml:
+ A replica cluster following cluster-example with streaming replication.postgis-example.yaml:
+ An example of a PostGIS cluster. See PostGIS for details.cluster-example-with-roles.yaml:
+ Declares a role with the managed stanza. Includes password management with
+ Kubernetes secrets.cluster-example-managed-services.yaml:
+ Declares a service with the managed stanza. Includes default service disabled and new
+ rw service template of LoadBalancer type defined.cluster-example-with-tablespaces.yamlbackup.barmanObjectStore with your minio parameters or your cloud solution.cluster-example-with-tablespaces-backup.yamlbootstrap.recovery.backup.name with the backup name.cluster-restore-with-tablespaces.yamlFor a list of available options, see API reference.
+pooler-external.yamlTwo test manifests contain everything needed to set up logical replication:
+cluster-example-logical-source.yamlSets up a cluster, cluster-example with some tables created in the app
+database, and, importantly, adds replication to the app user.
+A publication is created for the cluster on the app database: note that the
+publication will be reconciled only after the cluster's primary is up and
+running.
cluster-example-logical-destination.yamlSets up a cluster cluster-example-dest with:
externalClusters stanza. Note that it uses
+ the app role to connect, which assumes the source cluster grants it
+ replication privilege.schemaOnly enabledA subscription is created on the destination cluster: note that the subscription +will be reconciled only after the destination cluster's primary is up and +running.
+After both clusters have been reconciled, together with the publication and +subscription objects, you can verify that that tables in the source cluster, +and the data in them, have been replicated in the destination cluster
+In addition, there are some standalone example manifests:
+cluster-example.publication-example.yamlcluster-example.publication-example-objects.yamlcluster-example set up as source, with
+a publication pub-all. A cluster cluster-example-dest set up as a
+destination cluster, including the externalClusters stanza with
+connection parameters to the source cluster, including a role with
+replication privilege.subscription-example.yamlAll the above manifests create publications or subscriptions on the app
+database. The Database CRD offers a convenient way to create databases
+declaratively. With it, logical replication could be set up for arbitrary
+databases.
+Which brings us to the next section.
cluster-example.database-example.yamlcluster-example running Postgres 16database-example-icu.yamlScheduling, in Kubernetes, is the process responsible for placing a new pod on +the best node possible, based on several criteria.
+Kubernetes documentation
+Please refer to the +Kubernetes documentation +for more information on scheduling, including all the available policies. On +this page we assume you are familiar with concepts like affinity, +anti-affinity, node selectors, and so on.
+You can control how the CloudNativePG cluster's instances should be
+scheduled through the affinity
+section in the definition of the cluster, which supports:
Kubernetes provides mechanisms to control where pods are scheduled using +affinity and anti-affinity rules. These rules allow you to specify whether +a pod should be scheduled on particular nodes (affinity) or avoided on +specific nodes (anti-affinity) based on the workloads already running there. +This capability is technically referred to as inter-pod +affinity/anti-affinity.
+By default, CloudNativePG configures cluster instances to preferably be
+scheduled on different nodes, while pgBouncer instances might still run on
+the same nodes.
For example, given the following Cluster specification:
apiVersion: postgresql.cnpg.io/v1
+kind: Cluster
+metadata:
+ name: cluster-example
+spec:
+ instances: 3
+ imageName: ghcr.io/cloudnative-pg/postgresql:18.0-system-trixie
+
+ affinity:
+ enablePodAntiAffinity: true # Default value
+ topologyKey: kubernetes.io/hostname # Default value
+ podAntiAffinityType: preferred # Default value
+
+ storage:
+ size: 1Gi
+
+The affinity configuration applied in the instance pods will be:
affinity:
+ podAntiAffinity:
+ preferredDuringSchedulingIgnoredDuringExecution:
+ - podAffinityTerm:
+ labelSelector:
+ matchExpressions:
+ - key: cnpg.io/cluster
+ operator: In
+ values:
+ - cluster-example
+ - key: cnpg.io/podRole
+ operator: In
+ values:
+ - instance
+ topologyKey: kubernetes.io/hostname
+ weight: 100
+
+With this setup, Kubernetes will prefer to schedule a 3-node PostgreSQL +cluster across three different nodes, assuming sufficient resources are +available.
+You can modify the default behavior by adjusting the settings mentioned above.
+For example, setting podAntiAffinityType to required will enforce
+requiredDuringSchedulingIgnoredDuringExecution instead of
+preferredDuringSchedulingIgnoredDuringExecution.
However, be aware that this strict requirement may cause pods to remain pending +if resources are insufficient—this is particularly relevant when using Cluster Autoscaler +for automated horizontal scaling in a Kubernetes cluster.
+Inter-pod Affinity and Anti-Affinity
+For more details, refer to the Kubernetes documentation.
+In cloud environments, you might consider using topology.kubernetes.io/zone
+as the topologyKey to ensure pods are distributed across different
+availability zones rather than just nodes. For more options, see
+Well-Known Labels, Annotations, and Taints.
If needed, you can disable the operator-generated anti-affinity policies by
+setting enablePodAntiAffinity to false.
For scenarios requiring more precise control, you can specify custom pod
+affinity or anti-affinity rules using the additionalPodAffinity and
+additionalPodAntiAffinity configuration attributes. These custom rules will
+be added to those generated by the operator, if enabled, or used directly if
+the operator-generated rules are disabled.
Note
+When using additionalPodAntiAffinity or additionalPodAffinity, you must
+provide the full podAntiAffinity or podAffinity structure expected by the
+Pod specification. The following YAML example demonstrates how to configure
+only one instance of PostgreSQL per worker node, regardless of which PostgreSQL
+cluster it belongs to:
additionalPodAntiAffinity:
+ requiredDuringSchedulingIgnoredDuringExecution:
+ - labelSelector:
+ matchExpressions:
+ - key: postgresql
+ operator: Exists
+ values: []
+ topologyKey: "kubernetes.io/hostname"
+
+nodeSelectorKubernetes allows nodeSelector to provide a list of labels (defined as
+key-value pairs) to select the nodes on which a pod can run. Specifically,
+the node must have each indicated key-value pair as labels for the
+pod to be scheduled and run.
Similarly, CloudNativePG consents you to define a nodeSelector in the
+affinity section, so that you can request a PostgreSQL cluster to run only
+on nodes that have those labels.
Kubernetes allows you to specify (through taints) whether a node should repel
+all pods not explicitly tolerating (through tolerations) their taints.
So, by setting a proper set of tolerations for a workload matching a specific
+node's taints, Kubernetes scheduler will now take into consideration the
+tainted node, while deciding on which node to schedule the workload.
+Tolerations can be configured for all the pods of a Cluster through the
+.spec.affinity.tolerations section, which accepts the usual Kubernetes syntax
+for tolerations.
Taints and Tolerations
+More information on taints and tolerations can be found in the +Kubernetes documentation.
+Important
+Before proceeding, please ensure you have read the +"Architecture" section of the documentation.
+While you can deploy PostgreSQL on Kubernetes in various ways, we recommend +following these essential principles for production environments:
+node-role.kubernetes.io/postgres label and taint,
+ as detailed in the Reserving Nodes for PostgreSQL Workloads
+ section.As explained in greater detail in the previous sections, CloudNativePG +provides the flexibility to configure pod anti-affinity, node selectors, and +tolerations.
+Below is a sample configuration to ensure that a PostgreSQL Cluster is
+deployed on postgres nodes, with its instances distributed across different
+nodes:
# <snip>
+ affinity:
+ enablePodAntiAffinity: true
+ topologyKey: kubernetes.io/hostname
+ podAntiAffinityType: required
+ nodeSelector:
+ node-role.kubernetes.io/postgres: ""
+ tolerations:
+ - key: node-role.kubernetes.io/postgres
+ operator: Exists
+ effect: NoSchedule
+ # <snip>
+
+Despite its simplicity, this setup ensures optimal distribution and isolation +of PostgreSQL workloads, leading to enhanced performance and reliability in +your production environment.
+ +' + escapeHtml(summary) +'
' + noResultsText + '
'); + } +} + +function doSearch () { + var query = document.getElementById('mkdocs-search-query').value; + if (query.length > min_search_length) { + if (!window.Worker) { + displayResults(search(query)); + } else { + searchWorker.postMessage({query: query}); + } + } else { + // Clear results for short queries + displayResults([]); + } +} + +function initSearch () { + var search_input = document.getElementById('mkdocs-search-query'); + if (search_input) { + search_input.addEventListener("keyup", doSearch); + } + var term = getSearchTermFromLocation(); + if (term) { + search_input.value = term; + doSearch(); + } +} + +function onWorkerMessage (e) { + if (e.data.allowSearch) { + initSearch(); + } else if (e.data.results) { + var results = e.data.results; + displayResults(results); + } else if (e.data.config) { + min_search_length = e.data.config.min_search_length-1; + } +} + +if (!window.Worker) { + console.log('Web Worker API not supported'); + // load index in main thread + $.getScript(joinUrl(base_url, "search/worker.js")).done(function () { + console.log('Loaded worker'); + init(); + window.postMessage = function (msg) { + onWorkerMessage({data: msg}); + }; + }).fail(function (jqxhr, settings, exception) { + console.error('Could not load worker.js'); + }); +} else { + // Wrap search in a web worker + var searchWorker = new Worker(joinUrl(base_url, "search/worker.js")); + searchWorker.postMessage({init: true}); + searchWorker.onmessage = onWorkerMessage; +} diff --git a/assets/documentation/1.28.0-rc1/search/search_index.json b/assets/documentation/1.28.0-rc1/search/search_index.json new file mode 100644 index 000000000..d03af5f66 --- /dev/null +++ b/assets/documentation/1.28.0-rc1/search/search_index.json @@ -0,0 +1 @@ +{"config":{"indexing":"full","lang":["en"],"min_search_length":3,"prebuild_index":false,"separator":"[\\s\\-]+"},"docs":[{"location":"","text":"CloudNativePG CloudNativePG (CNPG) is an open-source operator designed to manage PostgreSQL workloads on any supported Kubernetes cluster. It fosters cloud-neutrality through seamless deployment in private, public, hybrid, and multi-cloud environments via its distributed topology feature. Built around DevOps principles, CloudNativePG embraces declarative configuration and immutable infrastructure, ensuring reliability and automation in database management. At its core, CloudNativePG introduces a custom Kubernetes resource called Cluster , representing a PostgreSQL cluster with: A single primary instance for write operations. Optional replicas for High Availability and read scaling. These instances reside within a Kubernetes namespace, allowing applications to connect seamlessly using operator-managed services. Failovers and switchovers occur transparently, eliminating the need for manual intervention. For applications inside the Kubernetes cluster, CNPG provides a microservice database approach, enabling co-location of PostgreSQL clusters and applications in the same namespace for optimized access. For applications outside the cluster, CNPG offers flexible connectivity through service templates and LoadBalancer services for direct TCP exposure. Additionally, web applications can take advantage of the native connection pooler based on PgBouncer. CloudNativePG was originally built by EDB , then released open source under Apache License 2.0. The source code repository is in GitHub . Note Based on the Operator Capability Levels model , users can expect a \"Level V - Auto Pilot\" subset of capabilities from the CloudNativePG Operator. Supported Kubernetes distributions Each minor release of CloudNativePG is designed to work with a range of Kubernetes versions, usually the ones supported by the CNCF at the time the minor version was first released. Please refer to the \"Supported releases\" page for details. Container images The CloudNativePG community maintains container images for both the operator and PostgreSQL (the operand). Operator The CloudNativePG operator container images are available on the cloudnative-pg project's GitHub Container Registry in three different flavors: Debian 12 distroless Red Hat UBI 9 micro (suffix -ubi9 ) Red Hat UBI images are primarily intended for OLM consumption. All container images are signed and include SBOM and provenance attestations, provided separately for each architecture. Operands The CloudNativePG project provides and maintains PostgreSQL operand container images, built on top of the official Debian slim base image , for both linux/amd64 and linux/arm64 architectures. Images are published for all Debian supported releases ( stable , oldstable ) and for PostgreSQL versions supported by PGDG . They are distributed via the postgres-containers GitHub Container Registry . Three image flavors are available, each extending the previous one: minimal standard system (deprecated) Important The system images are deprecated and will be removed once in-core Barman Cloud support is phased out. They remain usable for now, but you may want to plan a future migration to minimal or standard images with the Barman Cloud plugin, or another supported backup solution. By default, this version of CloudNativePG deploys ghcr.io/cloudnative-pg/postgresql:18.0-system-trixie . All images are signed and shipped with SBOM and provenance attestations. Weekly automated builds ensure that critical vulnerabilities (CVEs) are promptly fixed. For details and support, see the postgres-containers project . Main features Direct integration with the Kubernetes API server for High Availability, eliminating the need for external tools. Self-healing capabilities, including: Automated failover by promoting the most aligned replica. Automatic recreation of failed replicas. Planned switchover of the primary instance by promoting a selected replica. Declarative management of key PostgreSQL configurations, including: PostgreSQL settings. Roles, users, and groups. Databases, extensions, schemas, foreign data wrappers (FDW), and foreign servers. Tablespaces (including temporary tablespaces). Flexible instance definition, supporting any number of instances (minimum 1 primary server). Scale-up/down capabilities to dynamically adjust cluster size. Read-Write and Read-Only Services, ensuring applications connect correctly: Read-Write Service : Routes connections to the primary server. Read-Only Service : Distributes connections among replicas for read workloads. Support for quorum-based and priority-based PostgreSQL Synchronous Replication. Replica clusters enabling PostgreSQL distributed topologies across multiple Kubernetes clusters (private, public, hybrid, and multi-cloud). Delayed Replica clusters for point-in-time access to historical data. Persistent volume management, including: Support for Local Persistent Volumes with PVC templates. Reuse of Persistent Volumes storage in Pods. Separate volumes for WAL files and tablespaces. Backup and Recovery via CNPG-I Plugins: Pluggable architecture for continuous physical backup and recovery. Hot and cold base backups. WAL archiving. Full and Point-In-Time Recovery (PITR). Scheduled and on-demand backups. Backup from standbys to reduce primary load. Community-Supported Barman Cloud Plugin: WAL archiving to object stores with support for full/PITR recovery. Retention policies based on configurable recovery windows. Supported as a CNPG-I plugin (recommended approach). Native Backup Methods: Continuous backup and full/PITR recovery via volume snapshots (if supported by the storage class). Native integration with Barman Cloud for object store backups via .spec.backup.barmanObjectStore ( deprecated since v1.26 ). Offline in-place major upgrades of PostgreSQL Offline and online import of PostgreSQL databases, including major upgrades: Offline Import : Direct restore from existing databases. Online Import : PostgreSQL native logical replication via the Subscription resource. High Availability physical replication slots, including synchronization of user-defined replication slots and logical decoding failover. Parallel WAL archiving and restore, ensuring high-performance data synchronization in high-write environments. TLS support, including: Secure connections and client certificate authentication. Custom TLS certificates (integrated with cert-manager ). Startup and readiness probes, including replica probes based on desired lag from the primary. Declarative rolling updates for: PostgreSQL minor versions. Operator upgrades (in-place or rolling updates). Standard output logging of PostgreSQL error messages in JSON format for easier integration with log aggregation tools. Prometheus-compatible metrics exporter ( metrics port 9187) for custom monitoring. cnpg plugin for kubectl to simplify cluster operations. Cluster hibernation for resource efficiency in inactive states. Fencing of PostgreSQL clusters (full cluster or subset) to isolate instances when needed. Connection pooling with PgBouncer for improved database efficiency. OLM (Operator Lifecycle Manager) installation support for streamlined deployments. Multi-arch container images, including Software Bill of Materials (SBOM) and provenance attestations for security compliance. Info CloudNativePG does not use StatefulSet s for managing data persistence. Instead, it directly manages Persistent Volume Claims (PVCs). See \"Custom Pod Controller\" for more details. About this guide Follow the instructions in the \"Quickstart\" to test CloudNativePG on a local Kubernetes cluster using Kind, or Minikube. In case you are not familiar with some basic terminology on Kubernetes and PostgreSQL, please consult the \"Before you start\" section . The CloudNativePG documentation is licensed under a Creative Commons Attribution 4.0 International License. Postgres, PostgreSQL, and the Slonik Logo are trademarks or registered trademarks of the PostgreSQL Community Association of Canada, and used with their permission. CloudNativePG is a Cloud Native Computing Foundation Sandbox project .","title":"CloudNativePG"},{"location":"#cloudnativepg","text":"CloudNativePG (CNPG) is an open-source operator designed to manage PostgreSQL workloads on any supported Kubernetes cluster. It fosters cloud-neutrality through seamless deployment in private, public, hybrid, and multi-cloud environments via its distributed topology feature. Built around DevOps principles, CloudNativePG embraces declarative configuration and immutable infrastructure, ensuring reliability and automation in database management. At its core, CloudNativePG introduces a custom Kubernetes resource called Cluster , representing a PostgreSQL cluster with: A single primary instance for write operations. Optional replicas for High Availability and read scaling. These instances reside within a Kubernetes namespace, allowing applications to connect seamlessly using operator-managed services. Failovers and switchovers occur transparently, eliminating the need for manual intervention. For applications inside the Kubernetes cluster, CNPG provides a microservice database approach, enabling co-location of PostgreSQL clusters and applications in the same namespace for optimized access. For applications outside the cluster, CNPG offers flexible connectivity through service templates and LoadBalancer services for direct TCP exposure. Additionally, web applications can take advantage of the native connection pooler based on PgBouncer. CloudNativePG was originally built by EDB , then released open source under Apache License 2.0. The source code repository is in GitHub . Note Based on the Operator Capability Levels model , users can expect a \"Level V - Auto Pilot\" subset of capabilities from the CloudNativePG Operator.","title":"CloudNativePG"},{"location":"#supported-kubernetes-distributions","text":"Each minor release of CloudNativePG is designed to work with a range of Kubernetes versions, usually the ones supported by the CNCF at the time the minor version was first released. Please refer to the \"Supported releases\" page for details.","title":"Supported Kubernetes distributions"},{"location":"#container-images","text":"The CloudNativePG community maintains container images for both the operator and PostgreSQL (the operand).","title":"Container images"},{"location":"#operator","text":"The CloudNativePG operator container images are available on the cloudnative-pg project's GitHub Container Registry in three different flavors: Debian 12 distroless Red Hat UBI 9 micro (suffix -ubi9 ) Red Hat UBI images are primarily intended for OLM consumption. All container images are signed and include SBOM and provenance attestations, provided separately for each architecture.","title":"Operator"},{"location":"#operands","text":"The CloudNativePG project provides and maintains PostgreSQL operand container images, built on top of the official Debian slim base image , for both linux/amd64 and linux/arm64 architectures. Images are published for all Debian supported releases ( stable , oldstable ) and for PostgreSQL versions supported by PGDG . They are distributed via the postgres-containers GitHub Container Registry . Three image flavors are available, each extending the previous one: minimal standard system (deprecated) Important The system images are deprecated and will be removed once in-core Barman Cloud support is phased out. They remain usable for now, but you may want to plan a future migration to minimal or standard images with the Barman Cloud plugin, or another supported backup solution. By default, this version of CloudNativePG deploys ghcr.io/cloudnative-pg/postgresql:18.0-system-trixie . All images are signed and shipped with SBOM and provenance attestations. Weekly automated builds ensure that critical vulnerabilities (CVEs) are promptly fixed. For details and support, see the postgres-containers project .","title":"Operands"},{"location":"#main-features","text":"Direct integration with the Kubernetes API server for High Availability, eliminating the need for external tools. Self-healing capabilities, including: Automated failover by promoting the most aligned replica. Automatic recreation of failed replicas. Planned switchover of the primary instance by promoting a selected replica. Declarative management of key PostgreSQL configurations, including: PostgreSQL settings. Roles, users, and groups. Databases, extensions, schemas, foreign data wrappers (FDW), and foreign servers. Tablespaces (including temporary tablespaces). Flexible instance definition, supporting any number of instances (minimum 1 primary server). Scale-up/down capabilities to dynamically adjust cluster size. Read-Write and Read-Only Services, ensuring applications connect correctly: Read-Write Service : Routes connections to the primary server. Read-Only Service : Distributes connections among replicas for read workloads. Support for quorum-based and priority-based PostgreSQL Synchronous Replication. Replica clusters enabling PostgreSQL distributed topologies across multiple Kubernetes clusters (private, public, hybrid, and multi-cloud). Delayed Replica clusters for point-in-time access to historical data. Persistent volume management, including: Support for Local Persistent Volumes with PVC templates. Reuse of Persistent Volumes storage in Pods. Separate volumes for WAL files and tablespaces. Backup and Recovery via CNPG-I Plugins: Pluggable architecture for continuous physical backup and recovery. Hot and cold base backups. WAL archiving. Full and Point-In-Time Recovery (PITR). Scheduled and on-demand backups. Backup from standbys to reduce primary load. Community-Supported Barman Cloud Plugin: WAL archiving to object stores with support for full/PITR recovery. Retention policies based on configurable recovery windows. Supported as a CNPG-I plugin (recommended approach). Native Backup Methods: Continuous backup and full/PITR recovery via volume snapshots (if supported by the storage class). Native integration with Barman Cloud for object store backups via .spec.backup.barmanObjectStore ( deprecated since v1.26 ). Offline in-place major upgrades of PostgreSQL Offline and online import of PostgreSQL databases, including major upgrades: Offline Import : Direct restore from existing databases. Online Import : PostgreSQL native logical replication via the Subscription resource. High Availability physical replication slots, including synchronization of user-defined replication slots and logical decoding failover. Parallel WAL archiving and restore, ensuring high-performance data synchronization in high-write environments. TLS support, including: Secure connections and client certificate authentication. Custom TLS certificates (integrated with cert-manager ). Startup and readiness probes, including replica probes based on desired lag from the primary. Declarative rolling updates for: PostgreSQL minor versions. Operator upgrades (in-place or rolling updates). Standard output logging of PostgreSQL error messages in JSON format for easier integration with log aggregation tools. Prometheus-compatible metrics exporter ( metrics port 9187) for custom monitoring. cnpg plugin for kubectl to simplify cluster operations. Cluster hibernation for resource efficiency in inactive states. Fencing of PostgreSQL clusters (full cluster or subset) to isolate instances when needed. Connection pooling with PgBouncer for improved database efficiency. OLM (Operator Lifecycle Manager) installation support for streamlined deployments. Multi-arch container images, including Software Bill of Materials (SBOM) and provenance attestations for security compliance. Info CloudNativePG does not use StatefulSet s for managing data persistence. Instead, it directly manages Persistent Volume Claims (PVCs). See \"Custom Pod Controller\" for more details.","title":"Main features"},{"location":"#about-this-guide","text":"Follow the instructions in the \"Quickstart\" to test CloudNativePG on a local Kubernetes cluster using Kind, or Minikube. In case you are not familiar with some basic terminology on Kubernetes and PostgreSQL, please consult the \"Before you start\" section . The CloudNativePG documentation is licensed under a Creative Commons Attribution 4.0 International License. Postgres, PostgreSQL, and the Slonik Logo are trademarks or registered trademarks of the PostgreSQL Community Association of Canada, and used with their permission. CloudNativePG is a Cloud Native Computing Foundation Sandbox project .","title":"About this guide"},{"location":"applications/","text":"Connecting from an application Applications are supposed to work with the services created by CloudNativePG in the same Kubernetes cluster. For more information on services and how to manage them, please refer to the \"Service management\" section. Hint It is highly recommended using those services in your applications, and avoiding connecting directly to a specific PostgreSQL instance, as the latter can change during the cluster lifetime. You can use these services in your applications through: DNS resolution environment variables For the credentials to connect to PostgreSQL, you can use the secrets generated by the operator. Connection Pooling Please refer to the \"Connection Pooling\" section for information about how to take advantage of PgBouncer as a connection pooler, and create an access layer between your applications and the PostgreSQL clusters. DNS resolution You can use the Kubernetes DNS service to point to a given server. The Kubernetes DNS service is required by the operator. You can do that by using the name of the service if the application is deployed in the same namespace as the PostgreSQL cluster. In case the PostgreSQL cluster resides in a different namespace, you can use the full qualifier: service-name.namespace-name . DNS is the preferred and recommended discovery method. Environment variables If you deploy your application in the same namespace that contains the PostgreSQL cluster, you can also use environment variables to connect to the database. For example, suppose that your PostgreSQL cluster is called pg-database , you can use the following environment variables in your applications: PG_DATABASE_R_SERVICE_HOST : the IP address of the service pointing to all the PostgreSQL instances for read-only workloads PG_DATABASE_RO_SERVICE_HOST : the IP address of the service pointing to all hot-standby replicas of the cluster PG_DATABASE_RW_SERVICE_HOST : the IP address of the service pointing to the primary instance of the cluster Secrets The PostgreSQL operator will generate up to two basic-auth type secrets for every PostgreSQL cluster it deploys: [cluster name]-app (unless you have provided an existing secret through .spec.bootstrap.initdb.secret.name ) [cluster name]-superuser (if .spec.enableSuperuserAccess is set to true and you have not specified a different secret using .spec.superuserSecret ) Each secret contain the following: username password hostname to the RW service port number database name a working .pgpass file uri jdbc-uri fqdn-uri fqdn-jdbc-uri The FQDN to be used in the URIs is calculated using the Kubernetes cluster domain specified in the KUBERNETES_CLUSTER_DOMAIN configuration parameter. See the operator configuration documentation for more information about that. The -app credentials are the ones that should be used by applications connecting to the PostgreSQL cluster, and correspond to the user owning the database. The -superuser ones are supposed to be used only for administrative purposes, and correspond to the postgres user. Important Superuser access over the network is disabled by default.","title":"Connecting from an application"},{"location":"applications/#connecting-from-an-application","text":"Applications are supposed to work with the services created by CloudNativePG in the same Kubernetes cluster. For more information on services and how to manage them, please refer to the \"Service management\" section. Hint It is highly recommended using those services in your applications, and avoiding connecting directly to a specific PostgreSQL instance, as the latter can change during the cluster lifetime. You can use these services in your applications through: DNS resolution environment variables For the credentials to connect to PostgreSQL, you can use the secrets generated by the operator. Connection Pooling Please refer to the \"Connection Pooling\" section for information about how to take advantage of PgBouncer as a connection pooler, and create an access layer between your applications and the PostgreSQL clusters.","title":"Connecting from an application"},{"location":"applications/#dns-resolution","text":"You can use the Kubernetes DNS service to point to a given server. The Kubernetes DNS service is required by the operator. You can do that by using the name of the service if the application is deployed in the same namespace as the PostgreSQL cluster. In case the PostgreSQL cluster resides in a different namespace, you can use the full qualifier: service-name.namespace-name . DNS is the preferred and recommended discovery method.","title":"DNS resolution"},{"location":"applications/#environment-variables","text":"If you deploy your application in the same namespace that contains the PostgreSQL cluster, you can also use environment variables to connect to the database. For example, suppose that your PostgreSQL cluster is called pg-database , you can use the following environment variables in your applications: PG_DATABASE_R_SERVICE_HOST : the IP address of the service pointing to all the PostgreSQL instances for read-only workloads PG_DATABASE_RO_SERVICE_HOST : the IP address of the service pointing to all hot-standby replicas of the cluster PG_DATABASE_RW_SERVICE_HOST : the IP address of the service pointing to the primary instance of the cluster","title":"Environment variables"},{"location":"applications/#secrets","text":"The PostgreSQL operator will generate up to two basic-auth type secrets for every PostgreSQL cluster it deploys: [cluster name]-app (unless you have provided an existing secret through .spec.bootstrap.initdb.secret.name ) [cluster name]-superuser (if .spec.enableSuperuserAccess is set to true and you have not specified a different secret using .spec.superuserSecret ) Each secret contain the following: username password hostname to the RW service port number database name a working .pgpass file uri jdbc-uri fqdn-uri fqdn-jdbc-uri The FQDN to be used in the URIs is calculated using the Kubernetes cluster domain specified in the KUBERNETES_CLUSTER_DOMAIN configuration parameter. See the operator configuration documentation for more information about that. The -app credentials are the ones that should be used by applications connecting to the PostgreSQL cluster, and correspond to the user owning the database. The -superuser ones are supposed to be used only for administrative purposes, and correspond to the postgres user. Important Superuser access over the network is disabled by default.","title":"Secrets"},{"location":"architecture/","text":"Architecture Hint For a deeper understanding, we recommend reading our article on the CNCF blog post titled \"Recommended Architectures for PostgreSQL in Kubernetes\" , which provides valuable insights into best practices and design considerations for PostgreSQL deployments in Kubernetes. This documentation page provides an overview of the key architectural considerations for implementing a robust business continuity strategy when deploying PostgreSQL in Kubernetes. These considerations include: Deployments in stretched vs. non-stretched clusters : Evaluating the differences between deploying in stretched clusters (across 3 or more availability zones) versus non-stretched clusters (within a single availability zone). Reservation of postgres worker nodes : Isolating PostgreSQL workloads by dedicating specific worker nodes to postgres tasks, ensuring optimal performance and minimizing interference from other workloads. PostgreSQL architectures within a single Kubernetes cluster : Designing effective PostgreSQL deployments within a single Kubernetes cluster to meet high availability and performance requirements. PostgreSQL architectures across Kubernetes clusters for disaster recovery : Planning and implementing PostgreSQL architectures that span multiple Kubernetes clusters to provide comprehensive disaster recovery capabilities. Synchronizing the state PostgreSQL is a database management system and, as such, it needs to be treated as a stateful workload in Kubernetes. While stateless applications mainly use traffic redirection to achieve High Availability (HA) and Disaster Recovery (DR), in the case of a database, state must be replicated in multiple locations, preferably in a continuous and instantaneous way, by adopting either of the following two strategies: storage-level replication , normally persistent volumes application-level replication , in this specific case PostgreSQL CloudNativePG relies on application-level replication, for a simple reason: the PostgreSQL database management system comes with robust and reliable built-in physical replication capabilities based on Write Ahead Log (WAL) shipping , which have been used in production by millions of users all over the world for over a decade. PostgreSQL supports both asynchronous and synchronous streaming replication over the network, as well as asynchronous file-based log shipping (normally used as a fallback option, for example, to store WAL files in an object store). Replicas are usually called standby servers and can also be used for read-only workloads, thanks to the Hot Standby feature. Important We recommend against storage-level replication with PostgreSQL , although CloudNativePG allows you to adopt that strategy. For more information, please refer to the talk given by Chris Milsted and Gabriele Bartolini at KubeCon NA 2022 entitled \"Data On Kubernetes, Deploying And Running PostgreSQL And Patterns For Databases In a Kubernetes Cluster\" where this topic was covered in detail. Kubernetes architecture Kubernetes natively provides the possibility to span separate physical locations - also known as data centers, failure zones, or more frequently availability zones - connected to each other via redundant, low-latency, private network connectivity. Being a distributed system, the recommended minimum number of availability zones for a Kubernetes cluster is three (3), in order to make the control plane resilient to the failure of a single zone. For details, please refer to \"Running in multiple zones\" . This means that each data center is active at any time and can run workloads simultaneously. Note Most of the public Cloud Providers' managed Kubernetes services already provide 3 or more availability zones in each region. Multi-availability zone Kubernetes clusters The multi-availability zone Kubernetes architecture with three (3) or more zones is the one that we recommend for PostgreSQL usage. This scenario is typical of Kubernetes services managed by Cloud Providers. Such an architecture enables the CloudNativePG operator to control the full lifecycle of a Cluster resource across the zones within a single Kubernetes cluster, by treating all the availability zones as active: this includes, among other operations, scheduling the workloads in a declarative manner (based on affinity rules, tolerations and node selectors), automated failover, self-healing, and updates. All will work seamlessly across the zones in a single Kubernetes cluster. Please refer to the \"PostgreSQL architecture\" section below for details on how you can design your PostgreSQL clusters within the same Kubernetes cluster through shared-nothing deployments at the storage, worker node, and availability zone levels. Additionally, you can leverage Kubernetes clusters to deploy distributed PostgreSQL topologies hosting \"passive\" PostgreSQL replica clusters in different regions and managing them via declarative configuration. This setup is ideal for disaster recovery (DR), read-only operations, or cross-region availability. Important Each operator deployment can only manage operations within its local Kubernetes cluster. For operations across Kubernetes clusters, such as controlled switchover or unexpected failover, coordination must be handled manually (through GitOps, for example) or by using a higher-level cluster management tool. Single availability zone Kubernetes clusters If your Kubernetes cluster has only one availability zone, CloudNativePG still provides you with a lot of features to improve HA and DR outcomes for your PostgreSQL databases, pushing the single point of failure (SPoF) to the level of the zone as much as possible - i.e. the zone must have an outage before your CloudNativePG clusters suffer a failure. This scenario is typical of self-managed on-premise Kubernetes clusters, where only one data center is available. Single availability zone Kubernetes clusters are the only viable option when only two data centers are available within reach of a low-latency connection (typically in the same metropolitan area). Having only two zones prevents the creation of a multi-availability zone Kubernetes cluster, which requires a minimum of three zones. As a result, users must create two separate Kubernetes clusters in an active/passive configuration, with the second cluster primarily used for Disaster Recovery (see the replica cluster feature ). Hint If you are at an early stage of your Kubernetes journey, please share this document with your infrastructure team. The two data centers setup might be simply the result of a \"lift-and-shift\" transition to Kubernetes from a traditional bare-metal or VM based infrastructure, and the benefits that Kubernetes offers in a 3+ zone scenario might not have been known, or addressed at the time the infrastructure architecture was designed. Ultimately, a third physical location connected to the other two might represent a valid option to consider for organization, as it reduces the overall costs of the infrastructure by moving the day-to-day complexity from the application level down to the physical infrastructure level. Please refer to the \"PostgreSQL architecture\" section below for details on how you can design your PostgreSQL clusters within your single availability zone Kubernetes cluster through shared-nothing deployments at the storage and worker node levels only. For HA, in such a scenario it becomes even more important that the PostgreSQL instances be located on different worker nodes and do not share the same storage. For DR, you can push the SPoF above the single zone, by using additional Kubernetes clusters to define a distributed topology hosting \"passive\" PostgreSQL replica clusters . As with other Kubernetes workloads in this scenario, promotion of a Kubernetes cluster as primary must be done manually. Through the replica cluster feature , you can define a distributed PostgreSQL topology and coordinate a controlled switchover between data centers by first demoting the primary cluster and then promoting the replica cluster, without the need to re-clone the former primary. While failover is now fully declarative, automated failover across Kubernetes clusters is not within CloudNativePG's scope, as the operator can only function within a single Kubernetes cluster. Important CloudNativePG provides all the necessary primitives and probes to coordinate PostgreSQL active/passive topologies across different Kubernetes clusters through a higher-level operator or management tool. Reserving nodes for PostgreSQL workloads Whether you're operating in a multi-availability zone environment or, more critically, within a single availability zone, we strongly recommend isolating PostgreSQL workloads by dedicating specific worker nodes exclusively to postgres in production. A Kubernetes worker node dedicated to running PostgreSQL workloads is referred to as a Postgres node or postgres node. This approach ensures optimal performance and resource allocation for your database operations. Hint As a general rule of thumb, deploy Postgres nodes in multiples of three\u2014ideally with one node per availability zone. Three nodes is an optimal number because it ensures that a PostgreSQL cluster with three instances (one primary and two standby replicas) is distributed across different nodes, enhancing fault tolerance and availability. In Kubernetes, this can be achieved using node labels and taints in a declarative manner, aligning with Infrastructure as Code (IaC) practices: labels ensure that a node is capable of running postgres workloads, while taints help prevent any non- postgres workloads from being scheduled on that node. Important This methodology is the most straightforward way to ensure that PostgreSQL workloads are isolated from other workloads in terms of both computing resources and, when using locally attached disks, storage. While different PostgreSQL clusters may share the same node, you can take this a step further by using labels and taints to ensure that a node is dedicated to a single instance of a specific Cluster . Proposed node label CloudNativePG recommends using the node-role.kubernetes.io/postgres label. Since this is a reserved label ( *.kubernetes.io ), it can only be applied after a worker node is created. To assign the postgres label to a node, use the following command: kubectl label node