-
Notifications
You must be signed in to change notification settings - Fork 111
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deadlock of the pool when destroying a replica #1737
Comments
Looks like there was a heartbeat failure, which caused control-plane to mark the node as offline. |
The DestroyReplica call is the one getting starved of lock, but who is holding the pool lock here? |
The first |
This ticket needs two fixes:
|
Control-plane changes: openebs/mayastor-control-plane#862 |
Control-plane fix is release on 2.7.1 but we should also ensure from the data-plane that this can't happen, so leaving this issue open. |
Describe the bug
Pool lock was taken and never released. This means all grpc for that pool will fail!
To Reproduce
Seems like this may happen if we try to delete a replica which is part of a nexus!
Expected behavior
Don't lock the pool forever...
Additional context
This was found on another report: #1734
The text was updated successfully, but these errors were encountered: