Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate duplicate resource error from Jet AWS #220 #58

Closed
muvaf opened this issue Aug 4, 2022 · 1 comment
Closed

Investigate duplicate resource error from Jet AWS #220 #58

muvaf opened this issue Aug 4, 2022 · 1 comment
Assignees
Labels
bug Something isn't working

Comments

@muvaf
Copy link
Member

muvaf commented Aug 4, 2022

What happened?

A few users have reported this bug that seems to be a fundamental issue with Terrajet: crossplane-contrib/provider-jet-aws#220

We should check whether Upjet-based official providers are subject to this. And I think we should fix in both Terrajet and Upjet.

How can we reproduce it?

Details in crossplane-contrib/provider-jet-aws#220

@muvaf muvaf added the bug Something isn't working label Aug 4, 2022
@sergenyalcin sergenyalcin self-assigned this Aug 15, 2022
@sergenyalcin
Copy link
Member

I investigated this issue. Let me share my observations.

I started to investigate by looking the shared grpc server side because of the following comment: crossplane-contrib/provider-jet-aws#220 (comment)

This comment suggests a workaround that is disabling the shared grpc server usage. And some watcher of this issue said that, this workaround is working. But after a few days, I saw the following comment: crossplane-contrib/provider-jet-aws#220 (comment)

He says that, this workaround is not working for me. So this caused that reviewing my starting point of investigation.

To be honest, my first sense about this issue was biased because of first comment. If I start to investigation (as soon as possible) objective, I would say that, the root cause of this issue is related with the Sync/Async usage. Because I observed this type of errors in resources that are Sync and spent a lot of time for creation and deletion or when I loaded to cluster a lot and the provisioning/deletion takes a lot of time. And also when we change this to Async the problem was gone.

In the root issue, there are two resources: FargateProfile(eks) and SecurityGroup (ec2). For both of them we are using Sync functionality.

A short time ago, I worked on testing of eks resources I observed the same issue for FargateProfile and I resolved this problem by changing the behavior from Sync to Async for this resource. (For a long time, in my local environment the shared grpc server usage is enabled.)

So I think, we can resolve this type of problems by changing the behavior from Sync to Async

I will also open a PR for SecurityGroup resource to address this problem for this resource.

@muvaf muvaf closed this as completed Aug 27, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants