I love that cloudmin includes some basic system monitoring, however now that I have to come the cloudmin interface for 90% of my system management, I would prefer to see some more effective monitoring variables.
Today I ran into a problem where a HOST failed because the root partition was full, but the disk space alert was never triggered and I figured out why.
If you use the "disk space free/used" variable it can only effectively monitor the total disk space used or free. If you have an advanced partitioning scheme or simply mount a second partition for storage, this variable instantly becomes ineffective and unless you understand this weakness, you may make the same mistake as I in thinking you are monitoring the free space on your partitions. In this case, the alert was set to go off if ANY system has less that 5 GB free space. But because my backup partition on my host had lots of free space, my system ground to a halt with a full root partition because the 200 GB of free space on the backup partition caused the metric used to report the 200 or so free GB which would not trigger the alert for the crippled root partition.
I would like to see a new and more comprehensive set of drive space variables that can allow you to monitor free/used partition space on mounts, volumes, partitions, drives or any other aspect of drive space so that you can effectively catch a problem before it crashes the host or the vm.