Replies: 3 comments 12 replies
-
@Aashiqps is it possible for you to check behaviour with NFS primary storage? |
Beta Was this translation helpful? Give feedback.
-
Hey @Aashiqps , we are facing the same situation as well. Great to have found another Linstor user in dissagregated architecture. At this moment, we managed to get Volume Snapshots to work fine, after latest fixes from Linbit side. But we still the failover issues, where we cant seem to get all the Virtual Routers to start up successfully during a node failure (we simulated by pulling the power from the server). If the Virtual Router cant start up, then the VMs in that network cant continue to start up as well. We tested this using NFS Too just to isolate the network issue and the VM HA using NFS works just fine. When we go through the logs, we cant seem to identify whats the problem. Our latest findings is that according to the logs, the VR was successfully migrated to the new host, and its status transitioned to running. However, the 'ACS HighAvailabilityManager' triggered a stop/reboot action on the router and did not take any action to start it afterward. `2024-07-16 17:31:18,853 DEBUG [c.c.v.VmWorkJobDispatcher] (Work-Job-Executor-137:ctx-b555a5f3 job-385919/job-386245) (logid:1dc1e938) Run VM work job: com.cloud.vm.VmWorkStop for VM 54572, job origin: 385919 2024-07-16 17:31:37,947 INFO [c.c.h.HighAvailabilityManagerImpl] (HA-Worker-42:ctx-cfc0e084 work-103139) (logid:fe9b0f9a) VM But the challenge we are having, is that we dont see any error related to the storage. There should be somewhere a message that either a device can't be accessed or created, but we cant find any. But there are few things to take note when using Linstor:
More info here: Im curious to know your progress and if you managed to find any solution to it? Happy to communicate to help each other out. |
Beta Was this translation helpful? Give feedback.
-
@btzq Host HA is not working in our case, but VM HA is working. The instances, VR are automatically getting up when KVM host is recovered. VR dont migrate to other hosts if a power failure or network outage occurs (in my case). Check VM HA is working or not, by default it should work, just power off the instance ( not from UI ) and check instances are automatically starting or not. Reduced the time period of this parameter in global settings (kvm.ha.recover.wait.period). VM HA worked with both linstor and NFS primary |
Beta Was this translation helpful? Give feedback.
-
I have created a cloudstack with 1 management node and 3 kvm host nodes. For primary storage i am using linstor in disaggregated method and configured as diskless as i am using separate nodes ( 1 combined and 2 satellite nodes ) for linstor cluster and also another node for nfs secondary storage. The cloudstack is working but i am facing challenge in HA and snapshots. The vm created are not migrating to another host when i reboot, remove network cable or stop cloudstack agent in KVM host nodes, also the snapshots are not working. However for maintenance mode in host the vm, systemvm and vr are automatically migrating to another host. can anybody guide a solution.
Beta Was this translation helpful? Give feedback.
All reactions