Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MGMT-16781: Implement ocp postprocess IP rename #89

Merged
merged 1 commit into from
Feb 6, 2024

Conversation

omertuc
Copy link
Member

@omertuc omertuc commented Feb 5, 2024

This change ensures that unnecessary and time-consuming OCP
control-plane component rollouts are skipped. They are triggered by the
node's IP change, which occurs when applying an IBU seed image onto the
target SNO.

Specifically, the changes are:

  • add a new recert configuration argument of type string, i.e.
    --ip with which the new ip is provided
  • add an ip_rename mod, which implements the following changes:
    • replace all occurrences of the previous (seed node) ip in etcd
    • replace all occurrences of the previous (seed node) ip in the
      filesystem (this is currently just a stub, since it's done by LCA already)

Copy link

openshift-ci bot commented Feb 5, 2024

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: omertuc

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@omertuc
Copy link
Member Author

omertuc commented Feb 5, 2024

/test baremetalds-sno-recert-cluster-rename
/test e2e-aws-ovn-single-node-recert-serial
/test e2e-aws-ovn-single-node-recert-parallel

omertuc added a commit to omertuc/lifecycle-agent that referenced this pull request Feb 5, 2024
Similar to b6c5db1 but for IP instead
of hostname

This change leverages recert's latest OCP post-process IP feature, which
makes OCP's control-plane cluster operators (i.e. etcd, kube-apiserver,
authentication, etc) happy, so that they won't trigger additional
revisions. Thus reducing the time OCP needs to stabilize after recert.

For more details please check here:
- rh-ecosystem-edge/recert#89
This change ensures that unnecessary and time-consuming OCP
control-plane component rollouts are skipped. They are triggered by the
node's IP change, which occurs when applying an IBU seed image onto the
target SNO.

Specifically, the changes are:

- add a new recert configuration argument of type string, i.e.
  --ip with which the new ip is provided
- add an ip_rename mod, which implements the following changes:
   - replace all occurrences of the previous (seed node) ip in etcd
   - replace all occurrences of the previous (seed node) ip in the
     filesystem (this is currently just a stub, since it's [done](https://github.com/openshift-kni/lifecycle-agent/blob/3f447f629cf73a25a350c1c2cc88d95bf2a31956/lca-cli/postpivot/postpivot.go#L232-L236) by LCA already)
@omertuc
Copy link
Member Author

omertuc commented Feb 5, 2024

/test baremetalds-sno-recert-cluster-rename
/test e2e-aws-ovn-single-node-recert-serial
/test e2e-aws-ovn-single-node-recert-parallel

Copy link

openshift-ci bot commented Feb 6, 2024

@omertuc: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/e2e-aws-ovn-single-node-recert-parallel 88cb8c4 link true /test e2e-aws-ovn-single-node-recert-parallel

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@omertuc omertuc merged commit 4912712 into rh-ecosystem-edge:main Feb 6, 2024
9 of 11 checks passed
omertuc added a commit to omertuc/lifecycle-agent that referenced this pull request Feb 6, 2024
Similar to b6c5db1 but for IP instead
of hostname

This change leverages recert's latest OCP post-process IP feature, which
makes OCP's control-plane cluster operators (i.e. etcd, kube-apiserver,
authentication, etc) happy, so that they won't trigger additional
revisions. Thus reducing the time OCP needs to stabilize after recert.

For more details please check here:
- rh-ecosystem-edge/recert#89

Also stop deleting `etcd-endpoints` since now they're fixed by recert,
and if you delete them you get a rollout
@mresvanis mresvanis changed the title Implement ocp postprocess IP rename MGMT-16781: Implement ocp postprocess IP rename Feb 6, 2024
omertuc added a commit to omertuc/lifecycle-agent that referenced this pull request Feb 6, 2024
Similar to b6c5db1 but for IP instead
of hostname

This change leverages recert's latest OCP post-process IP feature, which
makes OCP's control-plane cluster operators (i.e. etcd, kube-apiserver,
authentication, etc) happy, so that they won't trigger additional
revisions. Thus reducing the time OCP needs to stabilize after recert.

For more details please check here:
- rh-ecosystem-edge/recert#89

Also stop deleting `etcd-endpoints` since now they're fixed by recert,
and if you delete them you get a rollout
omertuc added a commit to omertuc/lifecycle-agent that referenced this pull request Feb 6, 2024
Similar to b6c5db1 but for IP instead
of hostname

This change leverages recert's latest OCP post-process IP feature, which
makes OCP's control-plane cluster operators (i.e. etcd, kube-apiserver,
authentication, etc) happy, so that they won't trigger additional
revisions. Thus reducing the time OCP needs to stabilize after recert.

For more details please check here:
- rh-ecosystem-edge/recert#89

Also stop deleting `etcd-endpoints` since now they're fixed by recert,
and if you delete them you get a rollout
jc-rh pushed a commit to jc-rh/lifecycle-agent that referenced this pull request Feb 14, 2024
Similar to b6c5db1 but for IP instead
of hostname

This change leverages recert's latest OCP post-process IP feature, which
makes OCP's control-plane cluster operators (i.e. etcd, kube-apiserver,
authentication, etc) happy, so that they won't trigger additional
revisions. Thus reducing the time OCP needs to stabilize after recert.

For more details please check here:
- rh-ecosystem-edge/recert#89

Also stop deleting `etcd-endpoints` since now they're fixed by recert,
and if you delete them you get a rollout
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant