Parallelize in_place_bit_reverse_permute by jotabulacios · Pull Request #562 · yetanotherco/lambda_vm

jotabulacios · 2026-04-24T19:47:45Z

No description provided.

jotabulacios · 2026-04-24T19:48:05Z

/bench

github-actions · 2026-04-24T19:49:26Z

Codex Code Review

Found one issue:

Low / Potential bug: crypto/math/src/fft/cpu/bit_reversing.rs:2 adds E: Send to the public in_place_bit_reverse_permute API even when the parallel feature is disabled, and even for small inputs that use the sequential fallback. The old function worked for any &mut [E]; this now breaks valid callers with non-Send element types such as Rc<_>. Keep the public sequential API unconstrained, or at least gate the bound with #[cfg(feature = "parallel")] and preserve the unconstrained signature for non-parallel builds.

No security issues found in the unsafe parallel swap logic given the documented power-of-two precondition.

I could not run cargo check because rustup tried to write under /home/runner/.rustup, which is read-only in this environment.

claude · 2026-04-24T19:49:43Z

+                if br > i {
+                    // SAFETY: (i, br) uniquely identifies this pair (smaller index is owner),
+                    // so no two threads race on the same `ptr.0.add(k)` slot. Both indices
+                    // are in-bounds since i < n and br < n.


Low: redundant alias

SendPtr<E> is Copy, so ptr is already freely copyable inside the closure — the let p = ptr; binding serves no purpose.

Suggested change

// are in-bounds since i < n and br < n.

unsafe {

core::ptr::swap(ptr.0.add(i), ptr.0.add(br));

}

claude · 2026-04-24T19:49:47Z

-    for i in 0..input.len() {
-        let bit_reversed_index = reverse_index(i, input.len() as u64);
+pub fn in_place_bit_reverse_permute<E: Send>(input: &mut [E]) {
+    let n = input.len();


Low: missing invariant assertion before unsafe code

The entire function — including the doc comment — requires n to be a power of two. Violating that causes reverse_index to produce out-of-bounds indices, which is a silent panic in the serial path but undefined behaviour in the parallel path (raw pointer dereference out of bounds).

All call-sites in polynomial.rs and bowers_fft.rs already guard with is_power_of_two() before calling this function, but a debug_assert here makes the safety invariant explicit and catches future misuse:

Suggested change

let n = input.len();

let n = input.len();

debug_assert!(n == 0 || n.is_power_of_two(), "input length must be a power of two");

claude · 2026-04-24T19:49:59Z

Review: Parallelize `in_place_bit_reverse_permute`

Overview: Adds a Rayon-based parallel fast path under #[cfg(feature = \"parallel\")] when the input exceeds 16 384 elements, using a SendPtr wrapper to share a raw pointer across threads.

Safety correctness ✓

The unsafe is sound. Bit-reverse permutation is a bijection, so each pair (i, br(i)) with i < br(i) appears exactly once in the iteration. Because br(br(i)) = i, the thread for index j = br(i) sees br(j) = i < j and skips the swap — so every memory location is written by at most one thread. The invariant is correctly stated in the SAFETY comment.

Issues

Medium — parallel path is never exercised by the test suite

PARALLEL_BITREV_THRESHOLD is 1 << 14 = 16 384. Every existing test (bit_reverse_permutation_works uses size 16; test_parallel_matches_sequential in bowers_fft_tests.rs uses size 1 024) stays well below the threshold. The new unsafe parallel branch has zero test coverage.

Suggested addition to the #[cfg(all(test, feature = "alloc"))] block (or a new #[cfg(all(test, feature = "parallel"))] block):

#[cfg(all(test, feature = "parallel"))]
#[test]
fn bit_reverse_permutation_parallel_matches_serial() {
    let n = 1 << 15; // above PARALLEL_BITREV_THRESHOLD
    let mut serial: Vec<usize> = (0..n).collect();
    let mut parallel: Vec<usize> = (0..n).collect();

    // serial baseline
    for i in 0..n {
        let br = reverse_index(i, n as u64);
        if br > i { serial.swap(i, br); }
    }

    in_place_bit_reverse_permute(&mut parallel);
    assert_eq!(serial, parallel);
}

Low — redundant alias and missing invariant assertion

See inline comments.

github-actions · 2026-04-24T19:55:30Z

Benchmark — fib_iterative_8M (median of 5)

_{Table parallelism: auto (cores / 3)}

Metric	main	PR	Δ
Peak heap	56567 MB	58186 MB	+1619 MB (+2.9%) ⚪
Prove time	26.636s	27.642s	+1.006s (+3.8%) ⚪

✅ No significant change.

⚠️ Baseline heap spread: 7.6% (60341 MB / 57943 MB / 56039 MB / 56567 MB / 56400 MB) — comparison may be less reliable

_{Commit: 3f8beac · Baseline: built from main · Runner: self-hosted bench}

jotabulacios · 2026-04-24T20:18:03Z

/bench 10

jotabulacios · 2026-04-27T12:39:01Z

/bench 10

jotabulacios · 2026-04-27T19:26:35Z

/bench k=1

jotabulacios · 2026-04-28T13:18:17Z

/bench 5

Parallelize in_place_bit_reverse_permute

239eb42

claude Bot reviewed Apr 24, 2026

View reviewed changes

Merge branch 'main' into perf/parallel-bit-reverse

547ced7

Merge branch 'main' into perf/parallel-bit-reverse

3f8beac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelize in_place_bit_reverse_permute#562

Parallelize in_place_bit_reverse_permute#562
jotabulacios wants to merge 3 commits into
mainfrom
perf/parallel-bit-reverse

jotabulacios commented Apr 24, 2026

Uh oh!

jotabulacios commented Apr 24, 2026

Uh oh!

github-actions Bot commented Apr 24, 2026

Uh oh!

claude Bot Apr 24, 2026

Uh oh!

claude Bot Apr 24, 2026

Uh oh!

claude Bot commented Apr 24, 2026

Uh oh!

github-actions Bot commented Apr 24, 2026 •

edited

Loading

Uh oh!

jotabulacios commented Apr 24, 2026

Uh oh!

jotabulacios commented Apr 27, 2026

Uh oh!

jotabulacios commented Apr 27, 2026

Uh oh!

jotabulacios commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

-                    // are in-bounds since i < n and br < n.
+                    unsafe {
+                        core::ptr::swap(ptr.0.add(i), ptr.0.add(br));
+                    }

	let n = input.len();
	let n = input.len();
	debug_assert!(n == 0 \|\| n.is_power_of_two(), "input length must be a power of two");

Conversation

jotabulacios commented Apr 24, 2026

Uh oh!

jotabulacios commented Apr 24, 2026

Uh oh!

github-actions Bot commented Apr 24, 2026

Codex Code Review

Uh oh!

claude Bot Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot commented Apr 24, 2026

Review: Parallelize in_place_bit_reverse_permute

Safety correctness ✓

Issues

Medium — parallel path is never exercised by the test suite

Low — redundant alias and missing invariant assertion

Uh oh!

github-actions Bot commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark — fib_iterative_8M (median of 5)

Uh oh!

jotabulacios commented Apr 24, 2026

Uh oh!

jotabulacios commented Apr 27, 2026

Uh oh!

jotabulacios commented Apr 27, 2026

Uh oh!

jotabulacios commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Review: Parallelize `in_place_bit_reverse_permute`

github-actions Bot commented Apr 24, 2026 •

edited

Loading