-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
set pmix.locstr and pmix.cpuset #31
Comments
Bumping this, as testing on Dane suggests that pmix's calling into
|
Notes to whoever ends up doing this: The two keys mentioned above are referred to in ORTE code using these names
Don't go looking at slurm for a reference since it apparently does this wrong. References on this from Ralph in #58 are:
For better or worse, in flux-pmix I didn't use the preprocessor macros in the "standard" pmix.h to manipulate the info vectors. Instead the "infovec" class in flux-pmix does that stuff. That may be confusing when looking at orte code since it uses the macros. Be prepared for some studying of the above pmix-standard, the orte code, and flux-pmix and some head scratching as there seem to be many ways to organize the "info schema". Do you set the keys once per node, once for the whole system, once per task...? It may be helpful to look at the code in openmpi to see how it is looking up the keys and do it that way. |
A kludgy workaround that I hesitate to even mention would be to dump the shell's XML to a file and set |
Yeah we should do it right, but since openmpi may not be the only application-level code that does this, a shell option like |
I agree it'd be useful, but we'd have to be sure that users retain the power to subsequently
|
Yes, I think the idea is that the hwloc XML would only be saved to One nice thing about doing it in the flux-pmix plugin is that once this issue is closed, the automated use of |
Though @garlick makes a good point about other use cases for |
Problem: flux-pmix does not yet share hwloc with MPI, which forces MPI to go looking for it, sometimes at great cost to performance. For the time being, tell flux-core to share a hwloc xml file by setting the -o hwloc.xmlfile shell option. Later we will want to fix flux-framework#31 properly and drop this.
Oops this should remain open for "correct" solution. |
Problem: ompi asks for
pmix.locstr
andpmix.cpuset
(optionally).Furthermore,
pmix.locstr
is listed in the business card exchange use case in spec v5 sec B.1.2.I think if we provide these attributes, then every rank of ompi won't need to gather hwloc info from scratch, as an
strace
would seem to indicate they are doing now.The text was updated successfully, but these errors were encountered: