-
Notifications
You must be signed in to change notification settings - Fork 28
Home
During hackweek interstellar I evaluated using the boost graph library (BGL). You should be familiar with the basics of graph theory and BGL when looking at libstorage-bgl-eval. The code is of prototype quality.
There are two usecases for graphs in libstorage. The first are the devices themself, the second and even more interesting are the actions performed when committing the queued changing to the system.
Storing the devices in libstorage as a graph looks like a natural approach.
You might have seen the diagram above in YaST. But here it is a direct dump of internal data structure instead of a cumbersome transformation.
The graph approach has several advantages compared to the current list of list design:
-
No pseude containers for e.g. MD RAID, tmpfs or NFS are needed.
-
Possibility to have special objects for IMSM and DDF RAID containers.
-
Some operations are trivial compared to the current libstorage, e.g. getting all devices using a device is just a plain BFS.
Together with a redesign of the current objects several features would be simple, e.g. using disks for filesystems, renaming LVM volume groups and logical volumes, switching to and from partitioned MD RAID.
When committing the queued changes to the system libstorage has to analyse what steps are needed and in which order they must be done. Unfortunately the code here has evolved and some if statements span several lines of code. Unfortunately bugs are also frequent here.
The new approch looks different:
-
Two device graphs (the first is the current system, the second the desired system) are compared and the required steps are stored as vertecies of a graph. E.g. a step is to create a partition.
-
Dependencies are added as edges in the graph. E.g. when creating two partitions on a disk the ordering is important (otherwise the partition numbers get swapped).
-
A topological sort calculates the order in which the steps can be executed.
Here is a simply action graph for deleting a volume group with one logical volume and two physical volumes.
I see several advantages of the new approch:
-
Better testability: The code to generate the actions and their dependencies can be checked in a testsuite (one testcase already exists). Only testing the actions themself needs to be done on an actual system.
-
Checking the dependencies instead of the ordering is more robust. The ordering can be correct "by luck" while dependencies are explicit.
-
The topological sort can also be used to find actions that can be done in parallel. This might be used for filesystem creation (although newer filesystems (e.g. btrfs, xfs, ext4) have fast mkfs commands).
Here is a more complex action graph with dependencies for partition creation.
Here is a even more complex action graph with dependencies for partition creation and mount ordering. One filesystems is mounted twice resulting in cross device dependencies. AFAIS the old approch cannot handle such complex situations and it would be very difficult to implement them.
The nop is added as a synchronization point incase an action depends on all mount point of /dev/sda1 being done. Whether that is really needed I cannot say yet.
When you compile libstorage-bgl-eval you can find more action graphs in the examples directory.