Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test reverting of CUDA installation on Windows staging #840

Merged
merged 2 commits into from
Sep 27, 2024

Conversation

petemoore
Copy link
Member

@petemoore petemoore commented Sep 26, 2024

This reverts the CUDA changes on Windows Staging (which seems to have broken Windows GPU pools in #830).

Note, this reverts it only from staging, so that we don't test in production (even though production GPU pools are probably broken, it is a good habit to get into).

It might be worth trying one last time to build with the CUDA installation on staging, or alternatively, we could revert on staging, and then use a pool where we run as root, get an RDP session on it, running as root, and see what is going wrong when we manually RDP onto it and execute the installation steps.

Weirdly, the installation seems to happen, but it seems to break the worker somehow, because workers were getting spawned, but not calling Worker Manager to get credentials, as far as I can tell.

At the same time, I enabled interactive on all worker pools in community, because I can't think of a good reason we would ever want to disable it. If that changes, we can set it at a worker pool level later, but for now, I think we should just keep things simple and have it enabled everywhere, until that becomes a problem.

@petemoore petemoore self-assigned this Sep 26, 2024
@petemoore petemoore requested a review from a team as a code owner September 26, 2024 10:07
Copy link
Contributor

@lotas lotas left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤞

@petemoore petemoore changed the title Test reverting of CUDA installation on Windows _staging_ Test reverting of CUDA installation on Windows staging Sep 27, 2024
@petemoore petemoore merged commit 3cac800 into main Sep 27, 2024
3 checks passed
@petemoore petemoore deleted the revert-cuda-installation-windows branch September 27, 2024 08:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants