[DOC] Add best practices/advice with respect to using pool allocators #1694

wence- · 2024-10-04T11:15:26Z

RMM has multiple pool-like allocators:

a pool_memory_resource that wraps a coalescing best fit suballocator around an upstream resource;
an arena_memory_resource that similarly wraps around an upstream resource but divides the global allocation into size-binned arenas to mitigate fragmentation when allocating/deallocating;
and a cuda_async_memory_resource that uses the memory pool implementation provided by cudaMallocAsync. This one can avoid fragmentation because it is in control of the virtual address space.

Since these are all composable, one can happily wrap a pool_memory_resource around a cuda_async_memory_resource (or an arena, ...). But should one?

It would be useful if the documentation provided some guidance on which combinations make sense, and what typical allocation scenarios best fit a particular pool.

We should also recommend best practices for picking initial pool sizes: a bad choice here can lead to overfragmentation.

The text was updated successfully, but these errors were encountered:

harrism · 2024-10-09T22:30:33Z

Side thought: Maybe we should experiment with replacing the cuda_memory_resource used for initial_resource with a cuda_async_memory_resource...

wence- · 2024-10-10T10:07:52Z

Side thought: Maybe we should experiment with replacing the cuda_memory_resource used for initial_resource with a cuda_async_memory_resource...

Maybe, though we'd have the usual static destruction problems, so we'd never explicitly free that memory pool.

It might also be problematic in the multiple library case where one library is not configured with a specific pool, so makes an allocation from the initial_resource which now builds a pool. And now we're conflicting (potentially) with other libraries that then set a pool up.

harrism · 2024-10-10T21:21:22Z

I was thinking by default the async resource uses the default pool, which we would not own. Maybe I'm misremembering how it's implemented.

vyasr · 2024-10-10T22:05:38Z

The async resource will use the pool managed by the CUDA driver, which we do not own and would probably be fine. Ideally everyone would use that and then all pooling would be handled by the driver. If we use the async mr by default and a different library does not but constructs their own pool manually using a different underlying allocation routine (e.g. cudaMalloc instead of cudaMallocAsync), then we could conflict.

wence- · 2024-10-11T15:42:11Z

In cuda_async_memory_resource we call cudaMempoolCreate and get a handle to a pool and use that to make our allocations. So that sounds like we own that pool.

vyasr · 2024-10-11T17:35:15Z

My mistake, I didn't realize that we were allocating from a specific pool that we created. The failure mode should still be relatively graceful if two processes both use the async allocation routines and one pool blocks another's growth. I don't think it will be as graceful if you mix and match async with non-async allocation, but I could be wrong there.

harrism · 2024-10-14T22:56:48Z

I believe the reason that cuda_async_memory_resource owns its pool is because we provide a non-owning MR: cuda_async_view_memory_resource. We could use the latter as the default resource with the default pool. That MR requires passing a cudaMemPool_t pool handle, and the default pool handle can be retrieved using cudaDeviceGetDefaultMempool.

Perhaps, however, we should wait to make this default change until we can start using the cuda_async_memory_resource from libcu++, which has a different design.

wence- added ? - Needs Triage Need team to review and classify doc Documentation labels Oct 4, 2024

github-project-automation bot added this to RMM Project Board Oct 4, 2024

github-project-automation bot moved this to Todo in RMM Project Board Oct 4, 2024

wence- removed the ? - Needs Triage Need team to review and classify label Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOC] Add best practices/advice with respect to using pool allocators #1694

[DOC] Add best practices/advice with respect to using pool allocators #1694

wence- commented Oct 4, 2024 •

edited

Loading

harrism commented Oct 9, 2024 •

edited

Loading

wence- commented Oct 10, 2024

harrism commented Oct 10, 2024

vyasr commented Oct 10, 2024

wence- commented Oct 11, 2024

vyasr commented Oct 11, 2024

harrism commented Oct 14, 2024 •

edited

Loading

[DOC] Add best practices/advice with respect to using pool allocators #1694

[DOC] Add best practices/advice with respect to using pool allocators #1694

Comments

wence- commented Oct 4, 2024 • edited Loading

harrism commented Oct 9, 2024 • edited Loading

wence- commented Oct 10, 2024

harrism commented Oct 10, 2024

vyasr commented Oct 10, 2024

wence- commented Oct 11, 2024

vyasr commented Oct 11, 2024

harrism commented Oct 14, 2024 • edited Loading

wence- commented Oct 4, 2024 •

edited

Loading

harrism commented Oct 9, 2024 •

edited

Loading

harrism commented Oct 14, 2024 •

edited

Loading