[FEA] Add `host_buffer` class #260

jrhemstad · 2020-01-28T19:27:00Z

Is your feature request related to a problem? Please describe.

It has come up in several independent conversations w/ @jakirkham and others that it would be nice to have RMM provide a corollary to device_buffer for host allocated memory. See conversation in #141 and #216

In short, it would be convenient to have a common abstraction for host memory allocations used by RAPIDS libraries. This would allow for things like having pinned host memory allocations for use in more performant device to host memory spilling.

Describe the solution you'd like

Add a rmm::host_buffer class to act as a host memory corollary to device_buffer, i.e., untyped, uninitialized host memory allocation.

Additional context

Note that this opens a (few) sizable can of worms of questions that will eventually need to be answered. From my comment here

Would rmm::alloc be renamed to rmm::device_alloc and we add a rmm::host_alloc?
Does a host memory resource accept streams for alloc/free? If not, then host/device_memory_resource cannot share the same base class.
Do we enforce using RMM host memory resources anywhere host memory is being allocated in the same way we do with device memory? (e.g., are we going to provide a rmm::host_vector to replace std::vector?)
Are there alignment requirements for host allocations?
Will there be a separate default memory resource for host allocations?
Do we need host memory pools?
- If so, can we leverage C++17 memory pool implementations? https://en.cppreference.com/w/cpp/header/memory_resource

Here's what I think the simplest and least effort path forward is:

Provide a host_memory_resource base class mirrored from device_memory_resource.
Provide host_buffer that accepts a host_memory_resource* to use for allocation
Do NOT provide mirrors of the default device memory resource infrastructure (e.g., get_default_resource/set_default_resource())
Do NOT provide a mirror for rmm::alloc/free for host memory allocations
If a host memory pool is required, only support it in C++17 and beyond.

The text was updated successfully, but these errors were encountered:

jakirkham · 2020-01-28T19:58:44Z

cc @kkraus14 @pentschev

kkraus14 · 2020-01-28T21:44:44Z

If a host memory pool is required, only support it in C++17 and beyond.

I assume this would be a crazy effort to backport to C++14? We typically get pushback from requiring newer compiler versions / system libraries.

jrhemstad · 2020-01-28T21:50:43Z

I assume this would be a crazy effort to backport to C++14? We typically get pushback from requiring newer compiler versions / system libraries.

Yeah, it's not really possible to backport. You'd be better off just re-implementing all of the memory pool logic or using some other open source host memory pool.

That said, we can probably insulate user libraries from needing C++17. We can wrap the C++17 bits in include guards and throw an error if someone tries to use the host memory pools pre-c++17.

Note that providing a host memory pool is orthogonal to providing a host_buffer.

harrism · 2020-01-28T23:59:28Z

I would leave host memory pools to future work.

kkraus14 · 2020-01-29T17:00:22Z

Yeah, it's not really possible to backport. You'd be better off just re-implementing all of the memory pool logic or using some other open source host memory pool.

That said, we can probably insulate user libraries from needing C++17. We can wrap the C++17 bits in include guards and throw an error if someone tries to use the host memory pools pre-c++17.

Note that providing a host memory pool is orthogonal to providing a host_buffer.

Yea I understand, just that there likely will be enterprise customers that want the memory pool and C++14 support, but beggars can't be choosers 😄.

jakirkham · 2020-01-29T18:10:01Z

Would using Boost help?

harrism · 2020-01-30T03:12:15Z

We don't want RMM to depend on Boost.

jakirkham · 2020-01-30T03:16:26Z

Yeah that makes sense. Just thinking about what alternatives we might have 🙂

jakirkham · 2020-01-30T03:38:11Z

What if we take this as an opportunity to start using the Conda compilers? That would put us on GCC 7.3.0, which has C++17 support (unless I've missed something). Besides this was something we were planning to do anyways. ( rapidsai/cudf#1210 )

harrism · 2020-01-30T04:01:00Z

I think we should just stick to C++14, and leave host memory pools for future work.

kkraus14 · 2020-01-30T14:32:27Z

What if we take this as an opportunity to start using the Conda compilers? That would put us on GCC 7.3.0, which has C++17 support (unless I've missed something). Besides this was something we were planning to do anyways. ( rapidsai/cudf#1210 )

That's only for conda packages though. We have people wanting to build from source themselves without conda and can't necessarily guarantee a new enough compiler for C++17. We've already had pushback for C++14 😄.

jrhemstad · 2020-01-30T16:03:34Z

Agreed host memory pool is future work, just wanted to bring it up since we're talking about adding host memory management to RMM.

jakirkham · 2020-01-30T19:18:31Z

Ok makes sense. Thanks for the context and thanks for putting up with suggestions you likely have already considered 🙂

harrism · 2020-03-12T02:56:10Z

Can this issue be closed now that host_memory_resource exists and rmm::alloc/free are going to be dropped?

jrhemstad · 2020-03-12T02:59:52Z

We still need a host_buffer analog to device_buffer.

harrism · 2020-03-12T04:15:43Z

Ah yes, I got that confused.

github-actions · 2021-02-16T17:29:42Z

This issue has been marked rotten due to no recent activity in the past 90d. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

github-actions · 2021-02-16T17:30:07Z

This issue has been marked stale due to no recent activity in the past 30d. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be marked rotten if there is no activity in the next 60d.

randerzander · 2021-06-11T15:42:19Z

Still desired

jakirkham · 2021-06-11T15:43:57Z

(for context this came up today when discussing how to improve spilling and serialization performance)

cc @quasiben (for awareness)

jrhemstad · 2021-06-11T16:05:31Z

There's unlikely to be any movement here until NVIDIA/libcudacxx#158 is complete in the next month or so.

jakirkham · 2021-06-11T16:27:51Z

Thanks for the update Jake 🙂

jrhemstad · 2024-02-22T23:54:17Z

This conversation is quite stale now, so I'll give a quick update on the status quo:

The rmm::(host/device)_memory_resource base class interface is on its way out in favor of the cuda::mr functionality
cuda::mr should be thought of as taking what we learned the last several years with RMM, generalizing it, and then centralizing it in libcu++
RMM is in the process of migrating to the new cuda::mr interface: Use cuda::mr::memory_resource instead of raw device_memory_resource #1095
Today, libcu++ only offers the interface. It doesn't offer any concrete implementations nor does it offer any data structures or containers that use a memory resource for allocation. This is admittedly a bit limited today.
Our immediate next steps are to 1) Add concrete memory resource implementations, 2) Create allocators, data structures, and containers that use a cuda::mr resource for allocation. Step 2 would ultimately satisfy the original request of having a simple host_buffer and device_buffer classes. Instead, we'd likely have cuda::buffer<device_accessible> and cuda::buffer<host_accessible>.

harrism · 2024-02-26T20:34:50Z

Would you distinguish between async and synchronous host_buffer and device_buffer classes (so there would be 4 classes)? Or would you have two classes with both sync and async methods? Or would host_buffer always be synchronous and device_buffer always async?

jrhemstad added the feature request New feature or request label Jan 28, 2020

jrhemstad mentioned this issue Jan 29, 2020

Moved device memory resource files to mr/device folder #262

Merged

harrism mentioned this issue Feb 10, 2020

[FEA] Currently the IO writers aren't using pinned memory rapidsai/cudf#4020

Closed

jrhemstad mentioned this issue Feb 11, 2020

[REVIEW] host_memory_resource #272

Merged

7 tasks

jakirkham mentioned this issue Feb 21, 2020

Out of Memory Sort Fails even with Spill over rapidsai/dask-cuda#57

Closed

jakirkham mentioned this issue May 5, 2020

[REVIEW] Expose the HOST_BUFFER parquet sink to python rapidsai/cudf#5094

Closed

jakirkham mentioned this issue Jul 1, 2020

Evaluate further serialization performance improvements rapidsai/dask-cuda#106

Closed

github-actions bot added the inactive-90d label Feb 16, 2021

github-actions bot added the inactive-30d label Feb 16, 2021

jakirkham mentioned this issue Mar 23, 2021

[FEA][DISCUSSION] Require C++17 #736

Closed

4 tasks

jrhemstad added 0 - Backlog In queue waiting for assignment and removed inactive-30d inactive-90d labels Jun 11, 2021

jarmak-nv added this to RMM Project Board Nov 15, 2022

harrism mentioned this issue Jun 4, 2024

Pinned vector factory that uses the global pool rapidsai/cudf#15895

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Add `host_buffer` class #260

[FEA] Add `host_buffer` class #260

jrhemstad commented Jan 28, 2020 •

edited

Loading

jakirkham commented Jan 28, 2020

kkraus14 commented Jan 28, 2020

jrhemstad commented Jan 28, 2020 •

edited

Loading

harrism commented Jan 28, 2020

kkraus14 commented Jan 29, 2020

jakirkham commented Jan 29, 2020

harrism commented Jan 30, 2020 •

edited

Loading

jakirkham commented Jan 30, 2020

jakirkham commented Jan 30, 2020

harrism commented Jan 30, 2020

kkraus14 commented Jan 30, 2020

jrhemstad commented Jan 30, 2020 •

edited

Loading

jakirkham commented Jan 30, 2020

harrism commented Mar 12, 2020

jrhemstad commented Mar 12, 2020

harrism commented Mar 12, 2020

github-actions bot commented Feb 16, 2021

github-actions bot commented Feb 16, 2021

randerzander commented Jun 11, 2021

jakirkham commented Jun 11, 2021

jrhemstad commented Jun 11, 2021

jakirkham commented Jun 11, 2021

jrhemstad commented Feb 22, 2024

harrism commented Feb 26, 2024

[FEA] Add host_buffer class #260

[FEA] Add host_buffer class #260

Comments

jrhemstad commented Jan 28, 2020 • edited Loading

jakirkham commented Jan 28, 2020

kkraus14 commented Jan 28, 2020

jrhemstad commented Jan 28, 2020 • edited Loading

harrism commented Jan 28, 2020

kkraus14 commented Jan 29, 2020

jakirkham commented Jan 29, 2020

harrism commented Jan 30, 2020 • edited Loading

jakirkham commented Jan 30, 2020

jakirkham commented Jan 30, 2020

harrism commented Jan 30, 2020

kkraus14 commented Jan 30, 2020

jrhemstad commented Jan 30, 2020 • edited Loading

jakirkham commented Jan 30, 2020

harrism commented Mar 12, 2020

jrhemstad commented Mar 12, 2020

harrism commented Mar 12, 2020

github-actions bot commented Feb 16, 2021

github-actions bot commented Feb 16, 2021

randerzander commented Jun 11, 2021

jakirkham commented Jun 11, 2021

jrhemstad commented Jun 11, 2021

jakirkham commented Jun 11, 2021

jrhemstad commented Feb 22, 2024

harrism commented Feb 26, 2024

[FEA] Add `host_buffer` class #260

[FEA] Add `host_buffer` class #260

jrhemstad commented Jan 28, 2020 •

edited

Loading

jrhemstad commented Jan 28, 2020 •

edited

Loading

harrism commented Jan 30, 2020 •

edited

Loading

jrhemstad commented Jan 30, 2020 •

edited

Loading