A CAStor cluster manages itself. That is, processes running in real time organically balance the storage and CPU loads and check object replicas to be sure they match the policy set for that object. Should a stray cosmic ray flip a bit somewhere, the system automatically recovers. If a disk goes down, all other disks on all other nodes participate to recover any data that was on the bad disk. The larger the cluster, the faster the recovery time.
Recovery works down to the disk level — so that a node can keep operating even when it has a bad disk. In fact, if your hardware allows hot swapping of disks, you can swap in a new disk while the system is up and running. Otherwise, wait until the system has systematically transferred all node contents to spare capacity to retire, refurbish and recycle the node. While you run your business, CAStor will run the cluster.
|
|