Data Snapshots and Recovery
GridGain provides the ability to create snapshots of data stored cluster-wide, that can later be used for cluster recovery purposes. Having snapshots at hand, they can be used to recover the cluster to a state recorded in a snapshot.
Creating Full Snapshots
To create a full snapshot, use the cluster snapshot create
CLI command. In the command, you can specify the list of fully qualified table names to create snapshots of, or specify the --all
option to create a snapshot of all tables. For example:
cluster snapshot create --type=full --tables=PERSON
The command above creates a snapshot of a table Person in the PUBLIC
schema.
Creating Incremental Snapshots
When creating incremental snapshots, successive copies of the data contain only the changes since the last full or incremental snapshot. The base snapshot for incremental snapshots must be a full snapshot, but all subsequent ones can be inremental. The latest valid snapshot will be found for the tables you have specified, and an incremental snapshot based on it will be created.
Here is how you can create an incremental snapshot based on the full snapshot created above:
cluster snapshot create --type=incremental --tables=PERSON
You cannot add more tables to the snapshot when creating an incremental snapshot. You need to have the same tables in it as in the base snapshot created before.
Creating Snapshots in the Past
You can also make a snapshot for the specific cluster state in the past, for example:
cluster snapshot create --type=full --timestamp=2024-09-10T10:53:00+01:00 --all
The timestamp must be specified in ISO format.
Restoring Snapshots
To restore snapshots, you can use the cluster snapshot restore
command.
To make sure your snapshot is restored correctly, follow these guidelines:
-
Make sure that the cluster topology is the same as the one snapshot was taken on.
-
Stop traffic to the cluster during restoration to avoid possible inconsistencies and failed operations.
When you are prepared to restore data to the cluster, run the restore
command. For example:
cluster snapshot restore --id=112646727522648064
The command above restores all tables in the snapshot with the specified ID. You can also choose to only restore specific tables stored in the snapshot, instead of all of them. In this case, specify the fully qualified table names of the tables to restore, for example:
cluster snapshot restore --id=112646727522648064 --tables=PERSON
Checking Snapshot Status
You can check the status of all snapshots by using the cluster snapshot status
command. By default, this command provides information about all snapshots in the cluster.
cluster snapshot status
You can narrow information down by providing the snapshot ID. If you do, you can also use the --all-nodes
option to see information about the snapshot on each specific node in the cluster. For example:
cluster snapshot status --id=112646727522648064 --all-nodes
The command above returns information about all operations with the snapshots per node.
The following information is provided:
Column | Description |
---|---|
Operation ID |
The ID of the operation. For create operations, this is the snapshot ID. For delete operations, this is the ID of the delete operation. |
Start time |
Time when the operation was started in UNIX time. |
Operation |
The operation performed. |
Status |
Current operation status. Possible values: |
Target Snapshot ID |
The snapshot the operation was performed against. |
Base Snapshot ID |
For incremental snapshots, the id of the snapshot this snapshot is based on. |
Description |
Operation description. |
© 2024 GridGain Systems, Inc. All Rights Reserved. Privacy Policy | Legal Notices. GridGain® is a registered trademark of GridGain Systems, Inc.
Apache, Apache Ignite, the Apache feather and the Apache Ignite logo are either registered trademarks or trademarks of The Apache Software Foundation.