Snapshots in ZFS (ZOL)

Posted on January 22, 2017 • 5 minutes • 1040 words

One of the impressive tools in ZOL (ZFS On Linux) is snapshots. Snapshots are a way to make a ‘instant backup’ of the files on the system. The name backup, is not completely correct. If the system dies, the snapshots won’t save you, since snapshots are on the same storage as the data, they don’t eat any unneeded extra space.

A snapshot is the state of the pool/dataset when you created the snapshot. In ZFS snapshots are cheap and fast, so its smart to use the feature to your advantige. The snapshot will remember how the files looked when you create the snapshot, and will only require extra space when changes to those files are made.

How to use

Creating a snapshot :

zfs snapshot pool@anythingyouwant
zfs snapshot pool/dataset@anythingyouwant

Destroy a snapshot :

zfs destroy pool@snapname
zfs destroy pool/dataset@snapname

**Rename **a snapshot :

zfs rename pool/dataset@name pool/dataset@new_name

**List **snapshots :

zfs list -t snapshot 
zfs list -t snapshot -r pool/dataset

**Rollback **from a snapshot :

zfs rollback pool/dataset@name

note : take care when doing a rollback.

Demo

Lets see this in a simple example. I first created a demo filesystem.__
__

zfs create tank/demo

Lets create a first snapshot on this empty system :

zfs snapshot tank/demo@initial

Getting a list of the current snapshots : _zfs list -t snapshot _There is an option to show snapshots on _zfs list _but thats makes it even more cloudy to work.

# zfs list -t snapshot
NAME                                                USED  AVAIL  REFER  MOUNTPOINT
tank/demo@initial                                      0      -  35.9K  -

Now to generate some data, in ZFS (most COW FS) _fallocate _won’t work. So I use a fancy trick with head but in essence this just generates a file of approx. 10M.

head -c 10M </dev/urandom >data
head -c 10M </dev/urandom >data2
head -c 10M </dev/urandom >data3

Nothing special going on here :

# ls -lh
total 31M
-rw-r--r-- 1 root root 10M Jan 22 19:07 data
-rw-r--r-- 1 root root 10M Jan 22 19:07 data2
-rw-r--r-- 1 root root 10M Jan 22 19:07 data3

# zfs list
NAME                     USED  AVAIL  REFER  MOUNTPOINT
tank                    9.96T  11.1T  3.48G  /tank
tank/demo               30.1M  11.1T  30.0M  /tank/demo

As expected, when you see zfs list there is about 30M in use. Now since this is data that was not in our original (empty) set. The snapshot should not store any changes.

# zfs list -t snapshot
NAME                                                USED  AVAIL  REFER  MOUNTPOINT
tank/demo@initial                                  20.2K      -  35.9K  -

note : the 20.2k are meta-data changes.

Now its time to take a snapshot on the important data we just generated :

# zfs snapshot tank/demo@today

# zfs list -t snapshot
NAME USED AVAIL REFER MOUNTPOINT
tank/demo@initial 20.2K - 35.9K -
tank/demo@today 0 - 30.0M -

Now lets try snapshots, I did a move, a deletion, a overwrite (update) and a new file creation.

mv data data_moved
rm -rf data2
cp data_moved data3
head -c 10M </dev/urandom >data4


# zfs list -t snapshot
NAME USED AVAIL REFER MOUNTPOINT
tank/demo@initial 20.2K - 35.9K -
tank/demo@today 20.0M - 30.0M -

Exactly what we expected, the snapshot took a “backup” of data2 and data3 which where removed and overwritten, since data1 was not changed or removed, it does not occupy extra space and data4 is a new file, so it was not there during creation of the snapshot.

Now how do we get the data back ? There are two options, first for small issue’s like accidental removals, you can use the hidden .zfs directory, for larger issue’s you can mount the snapshot on a seperate folder. In every directory on ZFS there is a hidden folder _.zfs. _Note that even ls -la will not pick that directory up.

# ls -ls .zfs/snapshot/
total 1
0 dr-xr-xr-x 1 root root 0 Jan 22 19:25 initial
1 drwxr-xr-x 2 root root 5 Jan 22 19:07 today

# ls -ls .zfs/snapshot/today/
total 30729
10243 -rw-r--r-- 1 root root 10485760 Jan 22 19:07 data
10243 -rw-r--r-- 1 root root 10485760 Jan 22 19:07 data2
10243 -rw-r--r-- 1 root root 10485760 Jan 22 19:07 data3

Voila, there are our data files, prior to removal/overwrite/rename. Note that with this method the snapshot is auto-mounted to /tank/demo/.zfs/snapshot/today as you might have guessed, you can also manually mount snapshots to a location. This can be done in the ’normal’ Linux way :

# mkdir today_snapshot
# mount -t zfs tank/demo@today today_snapshot

# ls -l today_snapshot/
total 30729
-rw-r--r-- 1 root root 10485760 Jan 22 19:07 data
-rw-r--r-- 1 root root 10485760 Jan 22 19:07 data2
-rw-r--r-- 1 root root 10485760 Jan 22 19:07 data3

To unmount just use umount

umount tank/demo@today

Our snapshot name was not chosen very long-term minded, you can rename them to something more verbose :

zfs rename tank/demo@today tank/demo@22jan2017

The last thing we want to do, is remove a snapshot, lets remove the initial snapshot, since there was no data :

zfs destroy tank/demo@initial

There is another option, which I don’t trust much beside this example and I would only advice it to use with extreme care. Rollback, can be done straight back to a state of the system at the time of the snapshot creation, however only back to the latest snapshot. If you want to go back further, you need to destroy the snapshots in between.

zfs rollback -r tank/demo@22jan2017

Note the -r will destroy any snapshots between current state and @22jan2017 snapshot.

Relevant

Snapshots are so cheap, people tend to forget to make them. However, cheap is not free, so making a cron to generate a snapshot every month/week/day/hour will eventually put your tank full with snapshots. Therefor I suggest using a snapshot systems that creates and prunes snapshots. I chose sanoid, a good system, this article can help you install sanoid.

With multiple datasets, and each having multiple snapshots, its sometimes difficult to get a good overview of howmany snapshots each dataset has. You can use this shell cmd to check :

zfs list -t snapshot | tail -n +2 | sed ’s/@/\t/g’ | cut -f 1 | uniq -c

Which generates something like :

# zfs list -t snapshot | tail -n +2 | sed 's/@/\t/g' | cut -f 1 | uniq -c
 30 huginn/complgen3
  2 jbod1/breva
  1 jbod1/users
  5 jbod1/users/flylab
  5 jbod1/users/proteo
  2 jbod1/users/servers
  5 jbod1/users/userA