Skip to content

[codex] Enable contiguous mul range path without parallel#132

Merged
shinaoka merged 1 commit into
mainfrom
codex/enable-contiguous-mul-single-thread
Jun 6, 2026
Merged

[codex] Enable contiguous mul range path without parallel#132
shinaoka merged 1 commit into
mainfrom
codex/enable-contiguous-mul-single-thread

Conversation

@shinaoka

@shinaoka shinaoka commented Jun 6, 2026

Copy link
Copy Markdown
Member

Summary

  • compile the contiguous-range multiply planner and runner without the parallel feature
  • keep the Rayon split path behind parallel, while sharing the single-thread runner for both builds
  • add a regression test that the high-rank contiguous broadcast-mul plan is available in the default non-parallel build

Root cause

Tenferro uses strided-kernel without the parallel feature in the benchmark path. The large contiguous-range broadcast-multiply path was gated entirely by parallel, so 1T runs fell back to the generic strided loop for large intermediates.

Validation

  • cargo test -p strided-kernel -- --nocapture
  • cargo test -p strided-kernel --features parallel -- --nocapture
  • cargo fmt --check && git diff --check

@shinaoka shinaoka force-pushed the codex/enable-contiguous-mul-single-thread branch from 7d6070c to f6be58e Compare June 6, 2026 22:39
@shinaoka shinaoka marked this pull request as ready for review June 6, 2026 22:40
@shinaoka shinaoka merged commit f60f290 into main Jun 6, 2026
5 checks passed
@shinaoka shinaoka deleted the codex/enable-contiguous-mul-single-thread branch June 6, 2026 22:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant