Add functions to manipulate the GC heap counters #55390

vchuravy · 2024-08-06T08:53:19Z

jl_malloc and jl_free can be used to inform the GC about memory
pressure caused by external libraries. This PR adds two functions
that only do the accounting and not the memory allocation/freeing.

The intended audience is users of libraries that do their own memory
management, but still have establishes sizes and deallocation points.
An example of this is the unified memory API in CUDA.

`jl_malloc` and `jl_free` can be used to inform the GC about memory pressure caused by external libraries. This PR adds two functions that only do the accounting and not the memory allocation/freeing. The intended audience is users of libraries that do their own memory management, but still have establishes sizes and deallocation points. An example of this is the unified memory API in CUDA.

src/gc-interface.h

gbaraldi · 2024-08-06T12:59:42Z

src/gc-stock.c

+{
+    jl_gcframe_t **pgcstack = jl_get_pgcstack();
+    jl_task_t *ct = jl_current_task;
+    if (pgcstack != NULL && ct->world_age) {


Why does world_age matter here? or is that just a proxy to see if it's a valid task? We do this in other places but now that I look at it it's odd.

Yeah this code looked weird to me as well, but I assume it's a valid task check, maybe an artifact before thread-adoption?

d-netto

The implementation seems mostly fine.

Still, adding these calls to inform the GC about manually managed memory makes me a bit nervous.

We face a similar problem at RAI where we need to make sure that a database buffer cache (the source of manually managed memory in our case) and the Julia heap cooperate. We have been moving in the direction of trying to make the GC heuristics code more aware of this external memory by, for instance, polling for RSS periodically.

I'm not familiar with memory management on GPUs, but is making the GC heuristics code more aware of GPU memory something feasible in that domain?

vchuravy · 2024-08-15T11:05:29Z

but is making the GC heuristics code more aware of GPU memory something feasible in that domain?

We need to use a different allocator cudaMalloc/cudaFree, but for unified memory the address-space between host and device is shared. The alternative would be to extend gcext to have a jl_new_foreign_memory where we provide the GC a custom alloc and free function.

But this doesn't solve how to inform the GC about memory pressure stemming from allocations in a C++ library. RSS is a poor proxy since that starts to get all kinds of wonky in the presence of swap and multi-process.

vchuravy added the GC Garbage collector label Aug 6, 2024

vchuravy requested review from d-netto and gbaraldi August 6, 2024 08:53

vchuravy commented Aug 6, 2024

View reviewed changes

src/gc-interface.h Outdated Show resolved Hide resolved

vchuravy commented Aug 6, 2024

View reviewed changes

src/gc-interface.h Outdated Show resolved Hide resolved

Apply suggestions from code review

07fa76b

gbaraldi reviewed Aug 6, 2024

View reviewed changes

d-netto reviewed Aug 6, 2024

View reviewed changes

vchuravy mentioned this pull request Aug 22, 2024

Problem with GC hpsc-lab/SecureArithmetic.jl#45

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add functions to manipulate the GC heap counters #55390

Add functions to manipulate the GC heap counters #55390

vchuravy commented Aug 6, 2024

gbaraldi Aug 6, 2024 •

edited

Loading

vchuravy Aug 15, 2024

d-netto left a comment

vchuravy commented Aug 15, 2024 •

edited

Loading

Add functions to manipulate the GC heap counters #55390

Are you sure you want to change the base?

Add functions to manipulate the GC heap counters #55390

Conversation

vchuravy commented Aug 6, 2024

gbaraldi Aug 6, 2024 • edited Loading

Choose a reason for hiding this comment

vchuravy Aug 15, 2024

Choose a reason for hiding this comment

d-netto left a comment

Choose a reason for hiding this comment

vchuravy commented Aug 15, 2024 • edited Loading

gbaraldi Aug 6, 2024 •

edited

Loading

vchuravy commented Aug 15, 2024 •

edited

Loading