-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Consensus failure lottery #23683
Comments
@Reecepbcups wrote a workaround in Reecepbcups@72e651e It seems it was resolved in v50 but it recently came back up on many networks around the same time. |
also ref from the hub: cosmos/gaia#2313 (uses my above patch as a work around) |
Update Upon reaching the upgrade height, nodes automatically shut down (and restart) to apply the new upgrade. In smaller blockchain networks, this simultaneous shutdown of multiple nodes can lead to a loss of consensus if a sufficient number of nodes become temporarily inactive. The Cosmos SDK does not wait for a majority of nodes to reach the upgrade height before initiating the shutdown process. Consequently, if consensus is lost during this period, all nodes may shut down, resulting in the observed error. To maintain consensus during upgrades, we should monitor which nodes have been updated and which have not. After bumping the number of validators from |
Update The I'm starting to suspect Cosmovisor. Here's the interesting bit No Cosmovisor: The application exits itself
Cosmovisor: The application receives an interrupt from Cosmovisor
|
Is there an existing issue for this?
What happened?
We added CosmWasm support to our chain and tried to upgrade our devnet. However, we stumbled on
We wrote
interchaintest/chain_upgrade_test.go
in liftedinit/manifest-ledger#118 and sometime it fails with the error above, sometime everything is fine and the upgrade/test is successful.I'm not sure what's going on.
The upgrade handler can be found in
app/upgrades/next/upgrades.go
Cosmos SDK Version
0.50.11
How to reproduce?
Checkout the branch related to liftedinit/manifest-ledger#118
Run
make local-image
andmake ictest-chain-upgrade
.The test might pass. Run it again until it fails.
The text was updated successfully, but these errors were encountered: