Parallel Sweep [DRAFT] by luke-gruber · Pull Request #975 · Shopify/ruby

luke-gruber · 2026-04-13T15:40:04Z

No description provided.

Previously we did a non-atomic read of malloc_increase to determine whether or not we needed to gc on malloc. We also should only need to check the value when we have actually flushed/committed a new increase. Use relaxed load in atomic_sub_nounderflow The previous code did a non-atomic load of val, which would work fine (since the value would only be used for an atomic CAS) but resulted in TSAN errors. This alos adjusts the cas to use relaxed memory model, though I'm not sure that actually makes a difference anywhere. Use relaxed atomics for malloc increase as well Use atomics for loading deferred Use atomics for final slot count Use flag for is_lazy_sweeping WIP: simpler background thread (no-op for now) Adjustments Allow one more thread WIP: getting closer to checking end of sweep condition correctly dequeue usleep Attempt pre-sweep Fix accounting of free slots Sweep anything that !needs_cleanup Add TODO Free some T_OBJECTs in background thread get id2ref working Finish background sweeping before compaction gc_abort() waits for background sweeping to finish Add page->page_lock and lock it when changing freelist Right now only the mutator and the background thread need to lock it. We should re-init all these locks on fork due to Ractors (TODO). Make sure not background sweeping during Process.warmup Less locking/unlocking in gc_sweep_step_worker Better sweep_lock management reinit sweep_lock,sweep_cond after fork Allow sweep thread to acquire VM lock. It never joins a VM barrier. Add simply T_DATA freeing Also finish background freeing in ruby_vm_destruct. Allow taking VM lock in sweep thread Add fiber_pool_lock for cont.c Don't take VM lock in background sweep thread Make id2ref_tbl_lock re-entrant We can get the following: 1) RB_VM_LOCK() // need this to allow GC when inserting into tbl 2) id2ref_tbl_lock() // 3) insert into id2ref_tbl, causes GC 4) free object id, which acquires id2ref_tbl_lock again Therefore, the lock needs to be re-entrant. freeing of id2ref object_id in background thread Get gen fields freeing done in background sweep thread First pass at making zombies in background sweep thread add mutexes_lock lock for thread->keeping_mutexes We need to lock this when manipulating this linked list, because freeing a mutex, which can now be done in a background thread, manipulates it. I made this lock global for now, but it should really be either per-ractor or per-thread. Add autoload_free_lock We can't free autoload_data by 1 thread while also freeing an autoload_const that's associated with it concurrently. This can happen currently if they're on separate pages. Add assertions after major GC that background thread is inactive. I'm going to work on allowing background sweeping during a major (unless explicitly requested via GC.start). This is probably more important even than during minors. Add more assertions and comments whitelist making zombies in sweep thread sweep some imemos in background sweep thread tmp commit stuck tmp Get 1 pass over pages mostly working GC compaction tests are still broken. Not sure why. TODO: when in background thread, never modify the page's freelist directly in case user code is being run. Instead, each page should have a deferred_freelist that the user thread will link in when the page is swept. Merge freelist and deferred freelist when we process a page some cleanup Get GC compaction working, doesn't use background thread Fix running GC in cleanup finalizers stuck with GC compact Fix GC.compact Remove usage of page_lock mutex as we no longer need it. Keep actual lock around, but I'll remove it in a separate commit. GC: Remove unused page->page_lock mutex cleanup Remove unused code, add comments Background thread only sweeps until ruby thread is done with that heap There are some problems with the current approach: 1) The background thread can get ahead of the ruby thread on the current heap and sweep more than is necessary instead of moving on to the next heap. We should track `incremental_step_freed_objects` for each heap so ruby thread and background thread are in sync, and background thread can sweep next heap when necessary. 2) We need to restart the background sweeping when we exit from GC. There should be a `objspace->background sweep_mode` after GC exits and background sweeping begins. checkin Fix issues with parallel sweep Issues were: * post-fork issues * gc_sweep_dequeue_page/heap_is_sweep_done/has_sweeping_pages trio is tricky * rb_ec_cleanup issues (aborting bg sweeping, stopping thread) Fix issue with gc_sweep_rest() that could loop forever It could happen when background sweeping got ahead of the ruby thread. Fix more bugs Attempt to make has_sweeping_pages() faster We can make it even faster if we always let the ruby thread take the last page. This is what it used to do, and I think it was the right strategy in hindsight, just because of `has_sweeping_pages` and `gc_sweep_finish`. Otherwise, the ruby thread could need to wait on the background thread somtimes when it's called. Simplify end conditions by ruby thread taking last sweeping_page This is how it used to work, and I think it's a good idea to simplify checks for when sweeping is finished. Tracking down allocation bug Fixed allocation bug It had to do with adjacent bitfields being same memory object and used concurrently. Changed them to bools and it fixed the issue. Improve efficiency of requesting background sweep help Keep track of heap->latest_swept_page cleanup unlink pages in sweep thread Add to free_pages and pooled_pages in sweep thread remove redundant work Use atomic for heap->foreground_sweep_steps and separate swept_pages lock Add heap->skip_sweep_continue Add parallel sweep lock stats Output sweep time at end of process Add counts of sweep events less conditionals Change PSWEEP_LOCK_STATS to use per-callsite stats Add wall clock psweep timings first pass at rb_garbage_object_p with sweep thread Fix WB issues with sweep thread Use atomic operations for bitmaps that can be read/modified by both the mutator and the sweep thread. Avoid the tricky case of `gc_setup_mark_bits` by deferring it to sweep finish. This way, it doesn't conflict with write barriers. Make page->needs_setup_mark_bits its own memory object tmp commit before pairing Bug fix for mark T_NONE John found the fix.

Add this flag to all internal types. Internal extension types are skipped for now.

This is so that garbage_object_p() will work correctly.

…ally The use here is not protected by the sweep lock, so we should. Also, use atomic load for checking if object is marked so that it's not re-ordered past the next atomic load of page->before_sweep.

Make popcount_bits work for all platforms

…makes sense

Also, don't call GET_EC() for every call to free_vm_weak_references. We know in advance when this is called from the sweep thread, so just call the correct function in pre_sweep_plane.

We no longer need it because we changed how zombies are handled.

Since MMTK might also use this, called it something more generic. It's now `rb_gc_obj_free_concurrency_safe_vm_weak_references`.

semgrep-pr-bot · 2026-04-13T17:12:51Z

⚠️ Deprecated LLM Models Detected

This PR contains references to 1 deprecated model(s). Consider migrating to the recommended replacements.

⚠️ `o1` (openai)

Occurrences: 1
Severity: WARNING
Shopify Replacement: gpt-5.1
Hotswap Date: 2025-12-15
Provider Shutdown: 2025-09-26
Provider Suggested Replacements: gpt-5, gpt-4.1*

Locations:

set.c:934

Recommended Action: Replace with gpt-5.1 before 2025-12-15

📚 Resources

💡 This is an informational warning - it will not block your PR. However, please plan to migrate away from deprecated models before their shutdown dates.
💡 Please reach out to #help-proxy-shopify-ai if you have any questions.

_{🤖 Posted by Deprecated Models GitHub App}

It also works with RGENGC_CHECK_MODE=1

nobu and others added 30 commits March 30, 2026 21:31

ZJIT: Remove side-exit locations dump after test

05428c8

Adjust indent [ci skip]

5b83468

[Feature #19107] parse.y: Allow trailing comma in method signature

ae9b60f

Fix age_bits

5163a3a

Use planar age bits like 4.1.0

66d5b87

Add RUBY_TYPED_CONCURRENT_FREE_SAFE flag

188bc3d

Add this flag to all internal types. Internal extension types are skipped for now.

Don't free in sweep thread for typeddata that aren't concur free safe

4de7e58

Add a few concurrent free safe flags to ext typeddatas

75b792d

Fix warnings

4fbf561

Add GC.stat about pages swept by sweep thread

04e2cee

Turn off background sweep page bookkeeping

7395b9e

Update concurrent set for concurrent deletions

d2f95d6

gc: remove atomic operations on bitmaps

c6bfdfa

Turn on concurrent set deletions in sweep thread

a3a803b

Fix assertion in string.c

713019e

Fix age bits

b9bbab4

Add RGENGC assertions to pre_sweep_plane

40ddc68

Add major GC reasons to GC.stat

c1c1e28

Add DEBUG_SWEEP_BOOKKEEPING assertions

b3ab6f7

Add GC sweep bookkeeping assertions on sweep_finish

343912d

Call post_sweep_page after freeing all deferred objects.

6f9427c

This is so that garbage_object_p() will work correctly.

Fix garbage_object_p() to load and use background_sweep_thread atomic…

fb0d274

…ally The use here is not protected by the sweep lock, so we should. Also, use atomic load for checking if object is marked so that it's not re-ordered past the next atomic load of page->before_sweep.

Add per-page pre_freed_malloc_bytes to deal with malloc_increase issue

62b195a

Add deferred free object bitmap per page

4fc9c58

Add concurrent_set.c debugging facilities

ebf3453

concurrent_set: change CAS memory order

e7a7070

gc: change some atomic memory orderings

0189543

assert all unmarked slots are freed if RGENGC_CHECK_MODE > 0

f7ff576

Make popcount_bits work for all platforms

Remove a bitfield that shouldn't be there

5d496a7

luke-gruber added 27 commits April 1, 2026 14:04

Debugging issues after rebase

d50cb33

before claude code help

9b584ad

More debugging code

a87512f

gc/default.c: Put debugging behind RUBY_DEBUG

f36b8af

Better sweep lock management

295aecf

cleanup PSWEEP debug macros

ab9962f

Fix psweep if not USE_MALLOC_INCREASE_LOCAL

8f48f3c

Remove old rb_bug

c8e5033

Remove unused parameter

1a8274c

GC: better id2ref_lock management with comments

84fd235

parallel sweep: more checks for garbage needed in rb_obj_info

6936e4c

Remove unneeded code

5f78bb3

move gc_report to before making object T_NONE

8f857aa

Fix issue with gc_rest() and MISMATCH debug

316ece5

More debug fixes

3845778

tmp commit

f08c4fb

parallel sweep: blacklist imemo types specifically

8bfa0f5

parallel sweep: remove a test from test_tracepoint.rb that no longer …

78cce78

…makes sense

Parallel Sweep: no longer user blacklisted_vm_weak_references

2180916

Pull out deferred freeing into its own function in from gc_sweep_step()

4de9fe0

Also, don't call GET_EC() for every call to free_vm_weak_references. We know in advance when this is called from the sweep thread, so just call the correct function in pre_sweep_plane.

Remove deferred freelist

c3be429

We no longer need it because we changed how zombies are handled.

Change name of free_whitelisted_vm_weak_references_from_sweep_thread

f78728f

Since MMTK might also use this, called it something more generic. It's now `rb_gc_obj_free_concurrency_safe_vm_weak_references`.

Fix concurrent_set.c when found garbage object

9dbf1d7

Didn't mean to merge this a while back

65e09d4

Go from 2 to 1 allocations in onig_region_resize

b8760d6

Remove old is_sweep_thread_p checks in vm_sync.c

ba09ff9

Remove old is_sweep_pthread_p check in darray.h

f2f9358

luke-gruber added 2 commits April 13, 2026 13:50

fix GC.verify_internal_consistency with Parallel Sweep

7290630

It also works with RGENGC_CHECK_MODE=1

Enable sweep thread when calling GC.start

bbe44b9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel Sweep [DRAFT]#975

Parallel Sweep [DRAFT]#975
luke-gruber wants to merge 67 commits intomasterfrom
v4.1.0dev_parallel_sweep_concurrent_set

luke-gruber commented Apr 13, 2026

Uh oh!

semgrep-pr-bot bot commented Apr 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

luke-gruber commented Apr 13, 2026

Uh oh!

semgrep-pr-bot bot commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ Deprecated LLM Models Detected

⚠️ o1 (openai)

📚 Resources

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

semgrep-pr-bot bot commented Apr 13, 2026 •

edited

Loading

⚠️ `o1` (openai)