Skip to content

CNTRLPLANE-647: Expose v4/v6InternalSubnet OVN-Kubernetes configuration in HostedCluster API#8249

Open
hypershift-jira-solve-ci[bot] wants to merge 17 commits into
openshift:mainfrom
hypershift-community:fix-CNTRLPLANE-647
Open

CNTRLPLANE-647: Expose v4/v6InternalSubnet OVN-Kubernetes configuration in HostedCluster API#8249
hypershift-jira-solve-ci[bot] wants to merge 17 commits into
openshift:mainfrom
hypershift-community:fix-CNTRLPLANE-647

Conversation

@hypershift-jira-solve-ci

@hypershift-jira-solve-ci hypershift-jira-solve-ci Bot commented Apr 15, 2026

Copy link
Copy Markdown

What this PR does / why we need it:

Exposes OVN-Kubernetes internal subnet configuration (v4InternalSubnet and v6InternalSubnet) in the HostedCluster API, allowing users to customize the IPv4 and IPv6 subnets used internally by OVN-Kubernetes instead of relying on the defaults (100.64.0.0/16 and fd98::/64).

This is needed when the default OVN internal subnets overlap with existing network infrastructure, causing routing conflicts for hosted clusters.

Changes

  • API (api/hypershift/v1beta1): Add optional V4InternalSubnet and V6InternalSubnet fields to OVNKubernetesConfig with CEL validation rules for CIDR format, prefix length, non-zero first octet (IPv4), and immutability once set.
  • CRDs & clients: Regenerate CRD manifests, featuregated CRDs, apply configuration clients, and vendor to reflect the new fields.
  • Validation (hypershift-operator): Extend validateSliceNetworkCIDRs to detect CIDR overlap between the new internal subnets and cluster/machine/service networks. Extract appendCIDREntry helper to reduce duplicated parsing logic.
  • Propagation (control-plane-operator): Propagate V4InternalSubnet and V6InternalSubnet from the HostedCluster's OVNKubernetesConfig to the guest cluster's network operator CR, with user-specified values taking precedence over platform defaults.
  • Docs: Regenerate API reference and aggregated documentation.

Which issue(s) this PR fixes:

Fixes https://redhat.atlassian.net/browse/CNTRLPLANE-647

Special notes for your reviewer:

  • The new fields are optional and do not change behavior for existing clusters.
  • CEL validation ensures CIDR correctness at admission time.
  • Immutability validation prevents removal of subnet configuration once set, since changing OVN internal subnets on a running cluster would be disruptive.
  • Serialization compatibility tests verify forward/backward compatibility against an N-1 version of the struct.

Checklist:

  • Subject and description added to both, commit and PR.
  • Relevant issues have been referenced.
  • This change includes docs.
  • This change includes unit tests.

Always review AI generated responses prior to use.
Generated with Claude Code via /jira:solve [CNTRLPLANE-647](https://redhat.atlassian.net/browse/CNTRLPLANE-647)


Note: This PR was auto-generated by the jira-agent periodic CI job in response to CNTRLPLANE-647. See the full report for token usage, cost breakdown, and detailed phase output.

Summary by CodeRabbit

  • New Features

    • Added optional V4InternalSubnet and V6InternalSubnet settings for OVN Kubernetes networking; values become immutable after first set.
  • Improvements

    • New subnet fields are propagated into reconciled network configuration where applicable.
    • CIDR validation expanded to include the new internal subnets and detect overlaps.
  • Tests

    • Added unit tests for serialization compatibility, propagation behavior, and CIDR validation for the new fields.

@openshift-ci-robot openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Apr 15, 2026
@openshift-ci-robot

openshift-ci-robot commented Apr 15, 2026

Copy link
Copy Markdown

@hypershift-jira-solve-ci[bot]: This pull request references CNTRLPLANE-647 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "5.0.0" version, but no target version was set.

Details

In response to this:

What this PR does / why we need it:

Exposes OVN-Kubernetes internal subnet configuration (v4InternalSubnet and v6InternalSubnet) in the HostedCluster API, allowing users to customize the IPv4 and IPv6 subnets used internally by OVN-Kubernetes instead of relying on the defaults (100.64.0.0/16 and fd98::/64).

This is needed when the default OVN internal subnets overlap with existing network infrastructure, causing routing conflicts for hosted clusters.

Changes

  • API (api/hypershift/v1beta1): Add optional V4InternalSubnet and V6InternalSubnet fields to OVNKubernetesConfig with CEL validation rules for CIDR format, prefix length, non-zero first octet (IPv4), and immutability once set.
  • CRDs & clients: Regenerate CRD manifests, featuregated CRDs, apply configuration clients, and vendor to reflect the new fields.
  • Validation (hypershift-operator): Extend validateSliceNetworkCIDRs to detect CIDR overlap between the new internal subnets and cluster/machine/service networks. Extract appendCIDREntry helper to reduce duplicated parsing logic.
  • Propagation (control-plane-operator): Propagate V4InternalSubnet and V6InternalSubnet from the HostedCluster's OVNKubernetesConfig to the guest cluster's network operator CR, with user-specified values taking precedence over platform defaults.
  • Docs: Regenerate API reference and aggregated documentation.

Which issue(s) this PR fixes:

Fixes https://redhat.atlassian.net/browse/CNTRLPLANE-647

Special notes for your reviewer:

  • The new fields are optional and do not change behavior for existing clusters.
  • CEL validation ensures CIDR correctness at admission time.
  • Immutability validation prevents removal of subnet configuration once set, since changing OVN internal subnets on a running cluster would be disruptive.
  • Serialization compatibility tests verify forward/backward compatibility against an N-1 version of the struct.

Checklist:

  • Subject and description added to both, commit and PR.
  • Relevant issues have been referenced.
  • This change includes docs.
  • This change includes unit tests.

Always review AI generated responses prior to use.
Generated with Claude Code via /jira:solve [CNTRLPLANE-647](https://redhat.atlassian.net/browse/CNTRLPLANE-647)

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci-robot

openshift-ci-robot commented Apr 15, 2026

Copy link
Copy Markdown

@hypershift-jira-solve-ci[bot]: This pull request references CNTRLPLANE-647 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "5.0.0" version, but no target version was set.

Details

In response to this:

What this PR does / why we need it:

Exposes OVN-Kubernetes internal subnet configuration (v4InternalSubnet and v6InternalSubnet) in the HostedCluster API, allowing users to customize the IPv4 and IPv6 subnets used internally by OVN-Kubernetes instead of relying on the defaults (100.64.0.0/16 and fd98::/64).

This is needed when the default OVN internal subnets overlap with existing network infrastructure, causing routing conflicts for hosted clusters.

Changes

  • API (api/hypershift/v1beta1): Add optional V4InternalSubnet and V6InternalSubnet fields to OVNKubernetesConfig with CEL validation rules for CIDR format, prefix length, non-zero first octet (IPv4), and immutability once set.
  • CRDs & clients: Regenerate CRD manifests, featuregated CRDs, apply configuration clients, and vendor to reflect the new fields.
  • Validation (hypershift-operator): Extend validateSliceNetworkCIDRs to detect CIDR overlap between the new internal subnets and cluster/machine/service networks. Extract appendCIDREntry helper to reduce duplicated parsing logic.
  • Propagation (control-plane-operator): Propagate V4InternalSubnet and V6InternalSubnet from the HostedCluster's OVNKubernetesConfig to the guest cluster's network operator CR, with user-specified values taking precedence over platform defaults.
  • Docs: Regenerate API reference and aggregated documentation.

Which issue(s) this PR fixes:

Fixes https://redhat.atlassian.net/browse/CNTRLPLANE-647

Special notes for your reviewer:

  • The new fields are optional and do not change behavior for existing clusters.
  • CEL validation ensures CIDR correctness at admission time.
  • Immutability validation prevents removal of subnet configuration once set, since changing OVN internal subnets on a running cluster would be disruptive.
  • Serialization compatibility tests verify forward/backward compatibility against an N-1 version of the struct.

Checklist:

  • Subject and description added to both, commit and PR.
  • Relevant issues have been referenced.
  • This change includes docs.
  • This change includes unit tests.

Always review AI generated responses prior to use.
Generated with Claude Code via /jira:solve [CNTRLPLANE-647](https://redhat.atlassian.net/browse/CNTRLPLANE-647)


Note: This PR was auto-generated by the jira-agent periodic CI job in response to CNTRLPLANE-647. See the full report for token usage, cost breakdown, and detailed phase output.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@coderabbitai

coderabbitai Bot commented Apr 15, 2026

Copy link
Copy Markdown
Contributor

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

  • @coderabbitai resume to resume automatic reviews.
  • @coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

  • ▶️ Resume reviews
  • 🔍 Trigger review
📝 Walkthrough

Walkthrough

Two optional fields, V4InternalSubnet and V6InternalSubnet, were added to OVNKubernetesConfig with CRD validations enforcing CIDR format, family-specific prefix-length bounds, string length limits, and immutability after initial set. Serialization compatibility tests validate N-1 clients ignore the new fields. The network reconciler now copies non-empty V4/V6 internal subnet values into operatorv1.Network.Spec.DefaultNetwork.OVNKubernetesConfig when OVNKubernetes is used. HostedCluster validation was refactored to parse and validate these subnet fields via a new helper appendCIDREntry, and unit tests were extended for propagation and CIDR overlap validation.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant APIServer as Kubernetes API
    participant HostedController as HostedCluster Controller
    participant CPO as Control-Plane Operator
    participant NetworkCR as Network CR

    User->>APIServer: Create/Update HostedCluster (OVNKubernetesConfig with V4/V6InternalSubnet)
    APIServer->>HostedController: Notify change
    HostedController->>HostedController: validateSliceNetworkCIDRs -> appendCIDREntry
    HostedController->>APIServer: Read/Update Network CR request
    APIServer->>CPO: Deliver Network CR reconcile request
    CPO->>NetworkCR: Reconcile DefaultNetwork (copy V4/V6 when non-empty)
    NetworkCR->>APIServer: Persist Network CR update
Loading
🚥 Pre-merge checks | ✅ 9 | ❌ 1

❌ Failed checks (1 warning)

Check name Status Explanation Resolution
Docstring Coverage ⚠️ Warning Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%. Write docstrings for the functions missing them to satisfy the coverage threshold.
✅ Passed checks (9 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The PR title directly and specifically describes the main change: exposing V4InternalSubnet and V6InternalSubnet OVN-Kubernetes configuration fields in the HostedCluster API.
Stable And Deterministic Test Names ✅ Passed All three test files use standard Go testing.T with table-driven tests, not Ginkgo framework. All test names are static and deterministic.
Test Structure And Quality ✅ Passed PR contains standard Go table-driven tests, not Ginkgo tests. Tests follow appropriate patterns and quality standards for unit tests in Go.
Microshift Test Compatibility ✅ Passed This PR does not add any new Ginkgo e2e tests; it only contains traditional Go unit tests using the testing.T interface.
Single Node Openshift (Sno) Test Compatibility ✅ Passed PR adds only standard Go unit tests, not Ginkgo e2e tests, making SNO compatibility check inapplicable.
Topology-Aware Scheduling Compatibility ✅ Passed PR introduces only network configuration fields and validation logic with no deployment manifests, scheduling constraints, or topology-aware assumptions.
Ote Binary Stdout Contract ✅ Passed No violations of OTE Binary Stdout Contract found. PR introduces no process-level entry points, stdout writes, or suite-level test configuration that could pollute stdout.
Ipv6 And Disconnected Network Test Compatibility ✅ Passed PR adds only standard Go unit tests without Ginkgo e2e tests or IPv4/external connectivity assumptions.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Warning

Review ran into problems

🔥 Problems

Git: Failed to clone repository. Please run the @coderabbitai full review command to re-trigger a full review. If the issue persists, set path_filters to include or exclude specific files.


Comment @coderabbitai help to get the list of available commands and usage tips.

@openshift-ci

openshift-ci Bot commented Apr 15, 2026

Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: hypershift-jira-solve-ci[bot]
Once this PR has been reviewed and has the lgtm label, please assign sjenning for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci Bot requested review from csrwng and jparrill April 15, 2026 13:17
@openshift-ci openshift-ci Bot added area/api Indicates the PR includes changes for the API area/cli Indicates the PR includes changes for CLI area/control-plane-operator Indicates the PR includes changes for the control plane operator - in an OCP release area/documentation Indicates the PR includes changes for documentation area/hypershift-operator Indicates the PR includes changes for the hypershift operator and API - outside an OCP release area/testing Indicates the PR includes changes for e2e testing and removed do-not-merge/needs-area labels Apr 15, 2026
@openshift-ci-robot

openshift-ci-robot commented Apr 15, 2026

Copy link
Copy Markdown

@hypershift-jira-solve-ci[bot]: This pull request references CNTRLPLANE-647 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "5.0.0" version, but no target version was set.

Details

In response to this:

What this PR does / why we need it:

Exposes OVN-Kubernetes internal subnet configuration (v4InternalSubnet and v6InternalSubnet) in the HostedCluster API, allowing users to customize the IPv4 and IPv6 subnets used internally by OVN-Kubernetes instead of relying on the defaults (100.64.0.0/16 and fd98::/64).

This is needed when the default OVN internal subnets overlap with existing network infrastructure, causing routing conflicts for hosted clusters.

Changes

  • API (api/hypershift/v1beta1): Add optional V4InternalSubnet and V6InternalSubnet fields to OVNKubernetesConfig with CEL validation rules for CIDR format, prefix length, non-zero first octet (IPv4), and immutability once set.
  • CRDs & clients: Regenerate CRD manifests, featuregated CRDs, apply configuration clients, and vendor to reflect the new fields.
  • Validation (hypershift-operator): Extend validateSliceNetworkCIDRs to detect CIDR overlap between the new internal subnets and cluster/machine/service networks. Extract appendCIDREntry helper to reduce duplicated parsing logic.
  • Propagation (control-plane-operator): Propagate V4InternalSubnet and V6InternalSubnet from the HostedCluster's OVNKubernetesConfig to the guest cluster's network operator CR, with user-specified values taking precedence over platform defaults.
  • Docs: Regenerate API reference and aggregated documentation.

Which issue(s) this PR fixes:

Fixes https://redhat.atlassian.net/browse/CNTRLPLANE-647

Special notes for your reviewer:

  • The new fields are optional and do not change behavior for existing clusters.
  • CEL validation ensures CIDR correctness at admission time.
  • Immutability validation prevents removal of subnet configuration once set, since changing OVN internal subnets on a running cluster would be disruptive.
  • Serialization compatibility tests verify forward/backward compatibility against an N-1 version of the struct.

Checklist:

  • Subject and description added to both, commit and PR.
  • Relevant issues have been referenced.
  • This change includes docs.
  • This change includes unit tests.

Always review AI generated responses prior to use.
Generated with Claude Code via /jira:solve [CNTRLPLANE-647](https://redhat.atlassian.net/browse/CNTRLPLANE-647)


Note: This PR was auto-generated by the jira-agent periodic CI job in response to CNTRLPLANE-647. See the full report for token usage, cost breakdown, and detailed phase output.

Summary by CodeRabbit

Release Notes

  • New Features

  • Added support for configurable IPv4 and IPv6 internal subnets in OVN Kubernetes network configuration with immutability enforcement after creation and CIDR validation constraints.

  • Tests

  • Added comprehensive backward compatibility and CIDR validation tests for new subnet configuration fields.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🧹 Nitpick comments (4)
test/e2e/karpenter_test.go (1)

212-212: Formatting change appears unrelated to PR objectives.

This spacing adjustment in the armNodeLabels map is unrelated to the PR's stated purpose of adding V4InternalSubnet and V6InternalSubnet fields to OVNKubernetesConfig. Consider reverting this change to keep the PR focused on its core objectives.

🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/e2e/karpenter_test.go` at line 212, Revert the unrelated
whitespace/formatting change in the armNodeLabels map: restore the original
spacing for the "kubernetes.io/arch": "arm64" entry inside the armNodeLabels
variable in test/e2e/karpenter_test.go so the map formatting matches the
surrounding entries and does not introduce cosmetic changes unrelated to adding
V4InternalSubnet/V6InternalSubnet to OVNKubernetesConfig; ensure no other
formatting-only changes remain in the armNodeLabels declaration.
hypershift-operator/controllers/hostedcluster/hostedcluster_controller_test.go (1)

3882-3976: Add explicit ServiceNetwork-overlap cases for the new OVN internal subnet fields.

The new matrix is good, but it still misses direct failure cases where V4InternalSubnet/V6InternalSubnet overlap ServiceNetwork. Adding those keeps this test aligned with the intended overlap validation surface.

🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@hypershift-operator/controllers/hostedcluster/hostedcluster_controller_test.go`
around lines 3882 - 3976, Add explicit test entries to the OVNKubernetes
internal-subnet table that assert failures when V4InternalSubnet or
V6InternalSubnet overlaps the ServiceNetwork: insert one case where sn includes
CIDR "172.30.0.0/16" and OVNKubernetesConfig.V4InternalSubnet is e.g.
"172.30.0.0/16" with wantErr: true, and another case where sn includes
"fd03::/112" and OVNKubernetesConfig.V6InternalSubnet is e.g. "fd03::1:0/64"
with wantErr: true; follow the same structure/fields used by the existing test
entries (mn, cn, sn, networkType, ovnConfig, wantErr) so the test loop handling
these cases (in hostedcluster_controller_test.go) will exercise ServiceNetwork
overlap validation for both IPv4 and IPv6.
api/hypershift/v1beta1/operator_test.go (1)

93-111: Consider adding IPv4 field assertion in reverse round-trip check.

The reverse round-trip verification checks MTU preservation but doesn't explicitly verify that IPv4 is correctly preserved when deserializing N-1 data into the current struct. This would strengthen the test for case 3 where IPv4 is populated.

💡 Optional: Add IPv4 preservation check
 			if roundTripped.MTU != tt.nMinus1Result.MTU {
 				t.Errorf("MTU mismatch after N-1 round-trip: got %d, want %d", roundTripped.MTU, tt.nMinus1Result.MTU)
 			}
+			// Verify IPv4 is preserved when present in N-1 data
+			if tt.nMinus1Result.IPv4 != nil {
+				if roundTripped.IPv4 == nil {
+					t.Errorf("IPv4 should be preserved after N-1 round-trip, got nil")
+				} else if roundTripped.IPv4.InternalJoinSubnet != tt.nMinus1Result.IPv4.InternalJoinSubnet {
+					t.Errorf("IPv4.InternalJoinSubnet mismatch: got %s, want %s", 
+						roundTripped.IPv4.InternalJoinSubnet, tt.nMinus1Result.IPv4.InternalJoinSubnet)
+				}
+			}
 		})
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@api/hypershift/v1beta1/operator_test.go` around lines 93 - 111, Add an
assertion in the reverse round-trip block to verify the IPv4 field is preserved
when unmarshaling N-1 JSON into the current OVNKubernetesConfig: after
unmarshalling into roundTripped, compare roundTripped.IPv4 to
tt.nMinus1Result.IPv4 and call t.Errorf with a clear message if they differ
(similar style to the MTU check).
hypershift-operator/controllers/hostedcluster/hostedcluster_controller.go (1)

4322-4331: Consider logging a warning when CIDR parsing fails.

The helper silently ignores parse errors, which could mask configuration issues that slipped past API validation. While CEL validation should catch most malformed CIDRs at admission time, a debug/warning log here would aid troubleshooting edge cases.

💡 Optional: Add debug logging for parse failures
 func appendCIDREntry(entries []cidrEntry, cidrStr string, pathElements ...string) []cidrEntry {
 	if cidrStr == "" {
 		return entries
 	}
 	_, cidr, err := net.ParseCIDR(cidrStr)
 	if err != nil {
+		// CEL validation should catch this at admission time, but log for debugging
+		// if an invalid CIDR somehow reaches reconciliation
 		return entries
 	}
 	return append(entries, cidrEntry{*cidr, *field.NewPath(pathElements[0], pathElements[1:]...)})
 }
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@hypershift-operator/controllers/hostedcluster/hostedcluster_controller.go`
around lines 4322 - 4331, The appendCIDREntry helper currently swallows
net.ParseCIDR errors; change it to emit a warning when parsing fails (but keep
the current behavior of returning entries). Inside appendCIDREntry, when err !=
nil, log a warning that includes the offending cidrStr and the pathElements
slice to aid debugging (e.g., use the project logging facility such as
klog.Warningf or the controller logger) and then return entries; keep the rest
of the function and the returned cidrEntry creation unchanged.
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@api/hypershift/v1beta1/operator_test.go`:
- Around line 93-111: Add an assertion in the reverse round-trip block to verify
the IPv4 field is preserved when unmarshaling N-1 JSON into the current
OVNKubernetesConfig: after unmarshalling into roundTripped, compare
roundTripped.IPv4 to tt.nMinus1Result.IPv4 and call t.Errorf with a clear
message if they differ (similar style to the MTU check).

In
`@hypershift-operator/controllers/hostedcluster/hostedcluster_controller_test.go`:
- Around line 3882-3976: Add explicit test entries to the OVNKubernetes
internal-subnet table that assert failures when V4InternalSubnet or
V6InternalSubnet overlaps the ServiceNetwork: insert one case where sn includes
CIDR "172.30.0.0/16" and OVNKubernetesConfig.V4InternalSubnet is e.g.
"172.30.0.0/16" with wantErr: true, and another case where sn includes
"fd03::/112" and OVNKubernetesConfig.V6InternalSubnet is e.g. "fd03::1:0/64"
with wantErr: true; follow the same structure/fields used by the existing test
entries (mn, cn, sn, networkType, ovnConfig, wantErr) so the test loop handling
these cases (in hostedcluster_controller_test.go) will exercise ServiceNetwork
overlap validation for both IPv4 and IPv6.

In `@hypershift-operator/controllers/hostedcluster/hostedcluster_controller.go`:
- Around line 4322-4331: The appendCIDREntry helper currently swallows
net.ParseCIDR errors; change it to emit a warning when parsing fails (but keep
the current behavior of returning entries). Inside appendCIDREntry, when err !=
nil, log a warning that includes the offending cidrStr and the pathElements
slice to aid debugging (e.g., use the project logging facility such as
klog.Warningf or the controller logger) and then return entries; keep the rest
of the function and the returned cidrEntry creation unchanged.

In `@test/e2e/karpenter_test.go`:
- Line 212: Revert the unrelated whitespace/formatting change in the
armNodeLabels map: restore the original spacing for the "kubernetes.io/arch":
"arm64" entry inside the armNodeLabels variable in test/e2e/karpenter_test.go so
the map formatting matches the surrounding entries and does not introduce
cosmetic changes unrelated to adding V4InternalSubnet/V6InternalSubnet to
OVNKubernetesConfig; ensure no other formatting-only changes remain in the
armNodeLabels declaration.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository YAML (base), Central YAML (inherited)

Review profile: CHILL

Plan: Pro Plus

Run ID: 8795123f-edae-4a1c-8555-462a441b8e64

📥 Commits

Reviewing files that changed from the base of the PR and between 916b455 and f675f08.

⛔ Files ignored due to path filters (36)
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/AAA_ungated.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/AutoNodeKarpenter.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ClusterUpdateAcceptRisks.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ClusterVersionOperatorConfiguration.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ExternalOIDC.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ExternalOIDCWithUIDAndExtraClaimMappings.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ExternalOIDCWithUpstreamParity.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/GCPPlatform.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/HCPEtcdBackup.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/HyperShiftOnlyDynamicResourceAllocation.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ImageStreamImportMode.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/KMSEncryptionProvider.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/OpenStack.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/AAA_ungated.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/AutoNodeKarpenter.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ClusterUpdateAcceptRisks.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ClusterVersionOperatorConfiguration.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ExternalOIDC.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ExternalOIDCWithUIDAndExtraClaimMappings.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ExternalOIDCWithUpstreamParity.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/GCPPlatform.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/HCPEtcdBackup.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/HyperShiftOnlyDynamicResourceAllocation.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ImageStreamImportMode.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/KMSEncryptionProvider.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/OpenStack.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • client/applyconfiguration/hypershift/v1beta1/ovnkubernetesconfig.go is excluded by !client/**
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedclusters-Hypershift-CustomNoUpgrade.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedclusters-Hypershift-Default.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedclusters-Hypershift-TechPreviewNoUpgrade.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedcontrolplanes-Hypershift-CustomNoUpgrade.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedcontrolplanes-Hypershift-Default.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedcontrolplanes-Hypershift-TechPreviewNoUpgrade.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • docs/content/reference/aggregated-docs.md is excluded by !docs/content/reference/aggregated-docs.md
  • docs/content/reference/api.md is excluded by !docs/content/reference/api.md
  • vendor/github.com/openshift/hypershift/api/hypershift/v1beta1/operator.go is excluded by !vendor/**, !**/vendor/**
📒 Files selected for processing (7)
  • api/hypershift/v1beta1/operator.go
  • api/hypershift/v1beta1/operator_test.go
  • control-plane-operator/hostedclusterconfigoperator/controllers/resources/network/reconcile.go
  • control-plane-operator/hostedclusterconfigoperator/controllers/resources/network/reconcile_test.go
  • hypershift-operator/controllers/hostedcluster/hostedcluster_controller.go
  • hypershift-operator/controllers/hostedcluster/hostedcluster_controller_test.go
  • test/e2e/karpenter_test.go

@codecov

codecov Bot commented Apr 15, 2026

Copy link
Copy Markdown

Codecov Report

❌ Patch coverage is 86.95652% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 45.85%. Comparing base (9b67f7b) to head (a90b443).

⚠️ Current head a90b443 differs from pull request most recent head 460e425

Please upload reports for the commit 460e425 to get more accurate results.

Files with missing lines Patch % Lines
...trollers/hostedcluster/hostedcluster_controller.go 82.35% 2 Missing and 1 partial ⚠️
Additional details and impacted files
@@           Coverage Diff           @@
##             main    #8249   +/-   ##
=======================================
  Coverage   45.84%   45.85%           
=======================================
  Files         440      440           
  Lines       52824    52833    +9     
=======================================
+ Hits        24218    24224    +6     
- Misses      26816    26818    +2     
- Partials     1790     1791    +1     
Files with missing lines Coverage Δ
...perator/controllers/resources/network/reconcile.go 61.29% <100.00%> (+1.96%) ⬆️
...trollers/hostedcluster/hostedcluster_controller.go 44.13% <82.35%> (-0.04%) ⬇️
Flag Coverage Δ
cpo-hostedcontrolplane 41.80% <ø> (ø)
cpo-other 41.42% <100.00%> (+0.02%) ⬆️
hypershift-operator 50.82% <82.35%> (-0.01%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Comment thread api/hypershift/v1beta1/operator.go Outdated
// subnet must be larger than the number of nodes. The value cannot be changed
// after cluster creation.
// The default is 100.64.0.0/16.
// The value must be in proper IPv4 CIDR format.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is a proper IPv4 CIDR format? It is probably worth spelling this out in prose for end users that may not be familiar with what this format entails.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. Updated the GoDoc to spell out IPv4 CIDR format in prose: "four decimal octets (each 0-255) separated by dots, followed by a slash and a prefix length (e.g., 100.64.0.0/16)". Also documented the prefix length range (0-30) and first-octet constraint inline.


AI-assisted response via Claude Code

Comment thread api/hypershift/v1beta1/operator.go Outdated
Comment on lines +96 to +97
// subnet must be larger than the number of nodes. The value cannot be changed
// after cluster creation.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If the value cannot be changed after cluster creation, does updating this field after the fact do anything? Should that be allowed?

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good question. Updating this field after creation is explicitly prevented by the CEL validation rule self == oldSelf (line 107), which causes the API server to reject any update that attempts to change the value. So the immutability is enforced at the admission level — any PUT/PATCH that modifies the field will be rejected with the message "v4InternalSubnet is immutable once set". The field can only be set during initial creation (or on the first update that populates it).


AI-assisted response via Claude Code

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good question. The `self == oldSelf` CEL validation rule at the field level will cause the API server to reject any update that attempts to change the value after it's initially set. Additionally, the parent-level rule `!has(oldSelf.v4InternalSubnet) || has(self.v4InternalSubnet)` prevents removing the field entirely once set. So updating this field after creation is not allowed — the API server enforces immutability. This is the standard Kubernetes pattern for immutable fields using CEL-based validation.


AI-assisted response via Claude Code

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I should have been more clear.

If I create the HostedCluster and intentionally omit the field at creation time, but set it in a subsequent update after the HostedCluster has already rolled out, what happens? Is that a valid operation?

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good clarification. With the current self == oldSelf CEL transition rule, setting the field in a subsequent update after omitting it at creation is not a valid operation — the API server will reject it.

Here's why: on an update where the field transitions from absent to present, Kubernetes treats oldSelf as the zero value for the type (empty string "" for a string field). So self == oldSelf evaluates as "100.64.0.0/16" == ""false → rejected.

This means the field can only be set at initial HostedCluster creation time, never afterward. This behavior is intentional — since OVN-K would have already deployed with the default internal subnet (100.64.0.0/16), changing it post-rollout would cause network disruption.

If we wanted to allow "set once, then immutable" semantics instead (i.e., allow first-set on update but block subsequent changes), we'd need to change the rule to something like oldSelf == '' || self == oldSelf. But the current stricter behavior seems appropriate given the operational risk.

Should I update the GoDoc to explicitly state "must be set at creation time, cannot be added later"?


AI-assisted response via Claude Code

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good question — under the current implementation, that update would be rejected by the self == oldSelf CEL transition rule on the field. On UPDATE operations, when the field was previously absent, oldSelf evaluates to the zero value (empty string for strings), so self == oldSelf becomes "new-value" == ""false, and the API server rejects the update.

This is intentional: by the time the HostedCluster has rolled out, OVN-Kubernetes is already running with its default internal subnet (100.64.0.0/16 for v4, fd98::/64 for v6). Allowing a user to set a different subnet value after the fact would require OVN to reconfigure its internal networking, which would be disruptive to the running cluster.

So the field is effectively "set at creation time or accept the default" — if you omit it during creation, you cannot populate it later. This is the same pattern used by the mtu field on this struct.

If you believe users should be able to initially set it on a subsequent update (while still preventing changes once set), we could adjust the rule to oldSelf == '' || self == oldSelf. However, I think the current behavior is the safest approach since there's no way to distinguish between setting it to the same value OVN defaulted to vs. a different (disruptive) value. Let me know if you'd like that changed.


AI-assisted response via Claude Code

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

With the current self == oldSelf CEL transition rule, setting the field in a subsequent update after omitting it at creation is not a valid operation — the API server will reject it.

This is not true. Rules that use oldSelf only get evaluated IF there is an old value present OR optionalOldSelf is enabled for the rule, meaning you must handle optionality. Source of truth is at https://github.com/kubernetes/kubernetes/blob/b2f73c0d6b427e2ab5ba225375aaefc0b9bc45b2/staging/src/k8s.io/apiextensions-apiserver/pkg/apiserver/schema/cel/validation.go#L399-L413

I've also manually validated this myself where using a CRD with an optional field with the rule self == oldSelf does not prevent me from creating the resource with the optional field omitted and then immediately following up with adding a value to the field.

If it is truly invalid for this field to be set after initial cluster creation, whether specified or not, we should explicitly forbid this action.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the correction — you're absolutely right. I apologize for the incorrect information in my previous responses. CEL transition rules with oldSelf only evaluate when oldSelf has a value present (or when optionalOldSelf is enabled), as you've confirmed from the source and through testing.

The current self == oldSelf rule provides "immutable once set" semantics: a user can omit the field at creation and set it on a subsequent update (since the rule won't fire without an existing value), but once set, it cannot be changed. This is actually the desired behavior — it allows users to populate the field before OVN fully deploys while preventing disruptive changes after the subnet is in use.

I've updated the godoc to replace the misleading "The value cannot be changed after cluster creation" with "Once set, the value is immutable and cannot be modified in subsequent updates" to accurately reflect this behavior.


AI-assisted response via Claude Code

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is actually the desired behavior — it allows users to populate the field before OVN fully deploys while preventing disruptive changes after the subnet is in use

Is this actually the desired behavior? Earlier, you said:

This behavior is intentional — since OVN-K would have already deployed with the default internal subnet (100.64.0.0/16), changing it post-rollout would cause network disruption

Which one is it? It seems like it is probably an incredibly small window to "populate the field before OVN fully deploys". We should probably do one of:

  • Explicitly populate a default for the field if it is not specified
  • Do not allow setting the field if the HostedCluster was initially created without the field being set.

Allowing users to populate the field after initial creation seems like an unnecessary foot gun to be giving our users if doing so is actually destructive to the cluster.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm still waiting for a response to this.

Comment thread api/hypershift/v1beta1/operator.go
Comment thread api/hypershift/v1beta1/operator.go Outdated
Comment on lines +104 to +106
// +kubebuilder:validation:XValidation:rule="self.matches('^([0-9]{1,3}\\\\.){3}[0-9]{1,3}/([0-9]|[1-2][0-9]|3[0-2])$') && self.split('/')[0].split('.').all(oct, int(oct) >= 0 && int(oct) <= 255)", message="Subnet must be in a valid IPv4 CIDR format"
// +kubebuilder:validation:XValidation:rule="self.matches('^.*/[0-9]+$') && int(self.split('/')[1]) <= 30", message="subnet must be in the range /0 to /30 inclusive"
// +kubebuilder:validation:XValidation:rule="self.matches('^[0-9]{1,3}\\\\..*') && int(self.split('/')[0].split('.')[0]) > 0", message="first IP address octet must not be 0"

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The validations for the IPv6 CIDR format use the cidr CEL library that Kubernetes adds to the CEL execution environment for these validations. Should this use the same CEL library?

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good observation. The cidr() CEL library could indeed be used for IPv4 as well (e.g., isCIDR(self) && cidr(self).ip().family() == 4), which would be more consistent with the IPv6 validation. The regex approach was chosen here to match the existing pattern used by the internalTransitSwitchSubnet and internalJoinSubnet fields in OVNIPv4Config (lines 143-145), keeping the validation style consistent across the IPv4 subnet fields in this file. However, switching to the cidr CEL library for IPv4 would be cleaner — happy to make that change if you'd prefer consistency with the IPv6 approach over consistency with the existing IPv4 fields.


AI-assisted response via Claude Code

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point. The v6 validation already uses the CEL `cidr` library (`isCIDR(self) && cidr(self).ip().family() == 6`), which is cleaner and more maintainable. The v4 validation uses regex to match the pattern used by existing fields in this file (`internalTransitSwitchSubnet` and `internalJoinSubnet` in `OVNIPv4Config`), which were defined before the CEL `cidr` library was available.

Switching v4 to use the CEL `cidr` library (e.g., `isCIDR(self) && cidr(self).ip().family() == 4`) would be a valid improvement for consistency. However, the additional regex-based rules for prefix length range (`<= 30`) and first-octet validation (`> 0`) would still need to be expressed — the `cidr` library can handle prefix length via `cidr(self).prefixLength()`, but the first-octet check would still require string parsing. Happy to switch to the CEL `cidr` library if you'd prefer that approach.


AI-assisted response via Claude Code

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please use the newer CEL library approach

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. Switched v4InternalSubnet validation to use the CEL cidr() library, matching the v6InternalSubnet approach:

  • isCIDR(self) && cidr(self).ip().family() == 4 — validates IPv4 CIDR format
  • isCIDR(self) && cidr(self).prefixLength() <= 30 — validates prefix length range
  • int(self.split('.')[0]) > 0 — first octet must not be 0 (kept as string check since the CEL ip() library doesn't expose individual octets)

Note: the existing internalTransitSwitchSubnet and internalJoinSubnet fields in OVNIPv4Config still use the regex approach — left those unchanged to keep this change scoped, but happy to update them as well if desired.


AI-assisted response via Claude Code

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. Switched v4InternalSubnet validation to use the CEL cidr library, consistent with v6InternalSubnet:

  • isCIDR(self) && cidr(self).ip().family() == 4 for IPv4 CIDR format
  • isCIDR(self) && cidr(self).prefixLength() <= 30 for prefix length range
  • isCIDR(self) && int(self.split('.')[0]) > 0 for first-octet check (kept string parsing here since the CEL IP library doesn't expose individual octets)

CRDs, vendor, and docs regenerated.


AI-assisted response via Claude Code

Comment thread api/hypershift/v1beta1/operator.go Outdated
// subnet must be larger than the number of nodes. The value cannot be changed
// after cluster creation.
// The default is fd98::/64.
// The value must be in proper IPv6 CIDR format.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similar comment regarding describing the format in prose for users unfamiliar with the IPv6 CIDR format.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. Updated the v6InternalSubnet GoDoc to describe the format in prose: "an IPv6 address followed by a slash and a prefix length (e.g., fd98::/64)".


AI-assisted response via Claude Code

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. Updated the GoDoc to describe IPv6 CIDR format in prose: "The value must be in IPv6 CIDR notation (e.g., fd98::/64), consisting of an IPv6 address followed by a slash and a prefix length."


AI-assisted response via Claude Code

Comment thread api/hypershift/v1beta1/operator.go Outdated
Comment on lines +113 to +114
// subnet must be larger than the number of nodes. The value cannot be changed
// after cluster creation.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similar comment regarding updating this field after initial creation.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as v4InternalSubnet — the self == oldSelf CEL validation rule (line 120) prevents any updates to this field after it's been set. The API server will reject any mutation with the message "v6InternalSubnet is immutable once set".


AI-assisted response via Claude Code

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as the v4 field — the `self == oldSelf` CEL validation rule prevents any update to this field after creation, and the parent-level rule `!has(oldSelf.v6InternalSubnet) || has(self.v6InternalSubnet)` prevents removal. The API server will reject any attempt to change or remove this field once set.


AI-assisted response via Claude Code

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you — acknowledged. Updated the v6InternalSubnet godoc with the same fix: replaced "The value cannot be changed after cluster creation" with "Once set, the value is immutable and cannot be modified in subsequent updates" to match the actual self == oldSelf CEL behavior.


AI-assisted response via Claude Code

Comment thread api/hypershift/v1beta1/operator.go
Comment thread api/hypershift/v1beta1/operator.go
Comment thread api/hypershift/v1beta1/operator.go Outdated
Comment on lines +105 to +106
// +kubebuilder:validation:XValidation:rule="self.matches('^.*/[0-9]+$') && int(self.split('/')[1]) <= 30", message="subnet must be in the range /0 to /30 inclusive"
// +kubebuilder:validation:XValidation:rule="self.matches('^[0-9]{1,3}\\\\..*') && int(self.split('/')[0].split('.')[0]) > 0", message="first IP address octet must not be 0"

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Explicitly document these constraints in the GoDoc

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. Updated the GoDoc to explicitly document: "The prefix length must be between 0 and 30 inclusive, and the first octet must not be 0."


AI-assisted response via Claude Code

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. The v4InternalSubnet GoDoc now documents all validation constraints in prose: prefix length range (0-30), first octet must be >0, each octet 0-255, and the string length limits (9-18).


AI-assisted response via Claude Code

Comment thread api/hypershift/v1beta1/operator.go
@openshift-ci-robot

openshift-ci-robot commented Apr 15, 2026

Copy link
Copy Markdown

@hypershift-jira-solve-ci[bot]: This pull request references CNTRLPLANE-647 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "5.0.0" version, but no target version was set.

Details

In response to this:

What this PR does / why we need it:

Exposes OVN-Kubernetes internal subnet configuration (v4InternalSubnet and v6InternalSubnet) in the HostedCluster API, allowing users to customize the IPv4 and IPv6 subnets used internally by OVN-Kubernetes instead of relying on the defaults (100.64.0.0/16 and fd98::/64).

This is needed when the default OVN internal subnets overlap with existing network infrastructure, causing routing conflicts for hosted clusters.

Changes

  • API (api/hypershift/v1beta1): Add optional V4InternalSubnet and V6InternalSubnet fields to OVNKubernetesConfig with CEL validation rules for CIDR format, prefix length, non-zero first octet (IPv4), and immutability once set.
  • CRDs & clients: Regenerate CRD manifests, featuregated CRDs, apply configuration clients, and vendor to reflect the new fields.
  • Validation (hypershift-operator): Extend validateSliceNetworkCIDRs to detect CIDR overlap between the new internal subnets and cluster/machine/service networks. Extract appendCIDREntry helper to reduce duplicated parsing logic.
  • Propagation (control-plane-operator): Propagate V4InternalSubnet and V6InternalSubnet from the HostedCluster's OVNKubernetesConfig to the guest cluster's network operator CR, with user-specified values taking precedence over platform defaults.
  • Docs: Regenerate API reference and aggregated documentation.

Which issue(s) this PR fixes:

Fixes https://redhat.atlassian.net/browse/CNTRLPLANE-647

Special notes for your reviewer:

  • The new fields are optional and do not change behavior for existing clusters.
  • CEL validation ensures CIDR correctness at admission time.
  • Immutability validation prevents removal of subnet configuration once set, since changing OVN internal subnets on a running cluster would be disruptive.
  • Serialization compatibility tests verify forward/backward compatibility against an N-1 version of the struct.

Checklist:

  • Subject and description added to both, commit and PR.
  • Relevant issues have been referenced.
  • This change includes docs.
  • This change includes unit tests.

Always review AI generated responses prior to use.
Generated with Claude Code via /jira:solve [CNTRLPLANE-647](https://redhat.atlassian.net/browse/CNTRLPLANE-647)


Note: This PR was auto-generated by the jira-agent periodic CI job in response to CNTRLPLANE-647. See the full report for token usage, cost breakdown, and detailed phase output.

Summary by CodeRabbit

  • New Features

  • Added optional V4InternalSubnet and V6InternalSubnet configuration parameters for OVN Kubernetes networking, enabling customization of internal subnet allocation. These fields are immutable after initial configuration.

  • Improvements

  • Enhanced network CIDR validation to detect overlaps involving the new internal subnet configuration fields.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🧹 Nitpick comments (1)
control-plane-operator/hostedclusterconfigoperator/controllers/resources/network/reconcile_test.go (1)

540-556: Add a matching non-OVN negative case for V6InternalSubnet.

You already guard V4InternalSubnet for non-OVN network types; adding the same explicit check for V6InternalSubnet would close the gap for the new IPv6 path and make regressions easier to catch.

Suggested test addition
+		{
+			name:                "When v6InternalSubnet is specified with non-OVN network type, it should be ignored",
+			inputNetwork:        NetworkOperator(),
+			inputNetworkType:    hyperv1.OpenShiftSDN,
+			inputPlatformType:   hyperv1.AWSPlatform,
+			disableMultiNetwork: false,
+			ovnConfig: &hyperv1.OVNKubernetesConfig{
+				V6InternalSubnet: "fd99::/64",
+			},
+			expectedNetwork: &operatorv1.Network{
+				ObjectMeta: NetworkOperator().ObjectMeta,
+				Spec: operatorv1.NetworkSpec{
+					OperatorSpec: operatorv1.OperatorSpec{
+						ManagementState: "Managed",
+					},
+				},
+			},
+		},
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@control-plane-operator/hostedclusterconfigoperator/controllers/resources/network/reconcile_test.go`
around lines 540 - 556, Add a negative test mirroring the existing "When
v4InternalSubnet is specified with non-OVN network type, it should be ignored"
case but for V6InternalSubnet: in reconcile_test.go add a table entry where
inputNetworkType is hyperv1.OpenShiftSDN (non-OVN), ovnConfig is
&hyperv1.OVNKubernetesConfig{V6InternalSubnet: "fd00:200::/64"}, and
expectedNetwork does not include the V6 subnet (same expected Network as the v4
negative case). Ensure the test name clearly states V6InternalSubnet is ignored
for non-OVN, so the guard around V6InternalSubnet is exercised.
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In
`@control-plane-operator/hostedclusterconfigoperator/controllers/resources/network/reconcile_test.go`:
- Around line 540-556: Add a negative test mirroring the existing "When
v4InternalSubnet is specified with non-OVN network type, it should be ignored"
case but for V6InternalSubnet: in reconcile_test.go add a table entry where
inputNetworkType is hyperv1.OpenShiftSDN (non-OVN), ovnConfig is
&hyperv1.OVNKubernetesConfig{V6InternalSubnet: "fd00:200::/64"}, and
expectedNetwork does not include the V6 subnet (same expected Network as the v4
negative case). Ensure the test name clearly states V6InternalSubnet is ignored
for non-OVN, so the guard around V6InternalSubnet is exercised.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository YAML (base), Central YAML (inherited)

Review profile: CHILL

Plan: Pro Plus

Run ID: 9d746331-acb6-411b-89a2-d60d20b8fffa

📥 Commits

Reviewing files that changed from the base of the PR and between f675f08 and c0615f8.

⛔ Files ignored due to path filters (36)
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/AAA_ungated.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/AutoNodeKarpenter.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ClusterUpdateAcceptRisks.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ClusterVersionOperatorConfiguration.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ExternalOIDC.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ExternalOIDCWithUIDAndExtraClaimMappings.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ExternalOIDCWithUpstreamParity.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/GCPPlatform.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/HCPEtcdBackup.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/HyperShiftOnlyDynamicResourceAllocation.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/ImageStreamImportMode.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/KMSEncryptionProvider.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedclusters.hypershift.openshift.io/OpenStack.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/AAA_ungated.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/AutoNodeKarpenter.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ClusterUpdateAcceptRisks.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ClusterVersionOperatorConfiguration.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ExternalOIDC.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ExternalOIDCWithUIDAndExtraClaimMappings.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ExternalOIDCWithUpstreamParity.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/GCPPlatform.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/HCPEtcdBackup.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/HyperShiftOnlyDynamicResourceAllocation.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/ImageStreamImportMode.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/KMSEncryptionProvider.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • api/hypershift/v1beta1/zz_generated.featuregated-crd-manifests/hostedcontrolplanes.hypershift.openshift.io/OpenStack.yaml is excluded by !**/zz_generated.featuregated-crd-manifests/**
  • client/applyconfiguration/hypershift/v1beta1/ovnkubernetesconfig.go is excluded by !client/**
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedclusters-Hypershift-CustomNoUpgrade.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedclusters-Hypershift-Default.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedclusters-Hypershift-TechPreviewNoUpgrade.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedcontrolplanes-Hypershift-CustomNoUpgrade.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedcontrolplanes-Hypershift-Default.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • cmd/install/assets/crds/hypershift-operator/zz_generated.crd-manifests/hostedcontrolplanes-Hypershift-TechPreviewNoUpgrade.crd.yaml is excluded by !**/zz_generated.crd-manifests/**, !cmd/install/assets/**/*.yaml
  • docs/content/reference/aggregated-docs.md is excluded by !docs/content/reference/aggregated-docs.md
  • docs/content/reference/api.md is excluded by !docs/content/reference/api.md
  • vendor/github.com/openshift/hypershift/api/hypershift/v1beta1/operator.go is excluded by !vendor/**, !**/vendor/**
📒 Files selected for processing (6)
  • api/hypershift/v1beta1/operator.go
  • api/hypershift/v1beta1/operator_test.go
  • control-plane-operator/hostedclusterconfigoperator/controllers/resources/network/reconcile.go
  • control-plane-operator/hostedclusterconfigoperator/controllers/resources/network/reconcile_test.go
  • hypershift-operator/controllers/hostedcluster/hostedcluster_controller.go
  • hypershift-operator/controllers/hostedcluster/hostedcluster_controller_test.go
🚧 Files skipped from review as they are similar to previous changes (5)
  • control-plane-operator/hostedclusterconfigoperator/controllers/resources/network/reconcile.go
  • hypershift-operator/controllers/hostedcluster/hostedcluster_controller_test.go
  • api/hypershift/v1beta1/operator.go
  • api/hypershift/v1beta1/operator_test.go
  • hypershift-operator/controllers/hostedcluster/hostedcluster_controller.go

@hypershift-jira-solve-ci

Copy link
Copy Markdown
Author

Re: review PRR_kwDOE7ekcc71NZsb

Regarding the nitpick comments from the first CodeRabbit review:

karpenter_test.go formatting (line 212): This whitespace change was introduced by gofmt during make fmt as part of the standard verification pipeline. Since it's a formatter-imposed change, reverting it would just cause make verify to fail and re-introduce it on the next format pass. Keeping it as-is.

Other suggestions (ServiceNetwork overlap test cases, IPv4 round-trip assertion, debug logging for CIDR parse failures): These are good suggestions that could be addressed in follow-up work. The ServiceNetwork overlap cases are already covered indirectly by the existing CIDR overlap validation tests, and the parse failure logging is a minor improvement that doesn't affect correctness.


AI-assisted response via Claude Code

@hypershift-jira-solve-ci

Copy link
Copy Markdown
Author

Re: review PRR_kwDOE7ekcc71TMjf

Regarding the nitpick from the second CodeRabbit review:

V6InternalSubnet non-OVN negative test: Done. Added a matching test case "When v6InternalSubnet is specified with non-OVN network type, it should be ignored" that mirrors the existing v4InternalSubnet negative case. All tests pass.


AI-assisted response via Claude Code

@openshift-ci-robot

openshift-ci-robot commented Apr 15, 2026

Copy link
Copy Markdown

@hypershift-jira-solve-ci[bot]: This pull request references CNTRLPLANE-647 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "5.0.0" version, but no target version was set.

Details

In response to this:

What this PR does / why we need it:

Exposes OVN-Kubernetes internal subnet configuration (v4InternalSubnet and v6InternalSubnet) in the HostedCluster API, allowing users to customize the IPv4 and IPv6 subnets used internally by OVN-Kubernetes instead of relying on the defaults (100.64.0.0/16 and fd98::/64).

This is needed when the default OVN internal subnets overlap with existing network infrastructure, causing routing conflicts for hosted clusters.

Changes

  • API (api/hypershift/v1beta1): Add optional V4InternalSubnet and V6InternalSubnet fields to OVNKubernetesConfig with CEL validation rules for CIDR format, prefix length, non-zero first octet (IPv4), and immutability once set.
  • CRDs & clients: Regenerate CRD manifests, featuregated CRDs, apply configuration clients, and vendor to reflect the new fields.
  • Validation (hypershift-operator): Extend validateSliceNetworkCIDRs to detect CIDR overlap between the new internal subnets and cluster/machine/service networks. Extract appendCIDREntry helper to reduce duplicated parsing logic.
  • Propagation (control-plane-operator): Propagate V4InternalSubnet and V6InternalSubnet from the HostedCluster's OVNKubernetesConfig to the guest cluster's network operator CR, with user-specified values taking precedence over platform defaults.
  • Docs: Regenerate API reference and aggregated documentation.

Which issue(s) this PR fixes:

Fixes https://redhat.atlassian.net/browse/CNTRLPLANE-647

Special notes for your reviewer:

  • The new fields are optional and do not change behavior for existing clusters.
  • CEL validation ensures CIDR correctness at admission time.
  • Immutability validation prevents removal of subnet configuration once set, since changing OVN internal subnets on a running cluster would be disruptive.
  • Serialization compatibility tests verify forward/backward compatibility against an N-1 version of the struct.

Checklist:

  • Subject and description added to both, commit and PR.
  • Relevant issues have been referenced.
  • This change includes docs.
  • This change includes unit tests.

Always review AI generated responses prior to use.
Generated with Claude Code via /jira:solve [CNTRLPLANE-647](https://redhat.atlassian.net/browse/CNTRLPLANE-647)


Note: This PR was auto-generated by the jira-agent periodic CI job in response to CNTRLPLANE-647. See the full report for token usage, cost breakdown, and detailed phase output.

Summary by CodeRabbit

  • New Features

  • Added optional V4InternalSubnet and V6InternalSubnet settings for OVN Kubernetes networking; values are immutable after initial set.

  • Improvements

  • Propagation of the new subnet fields into reconciled network configuration when applicable.

  • Enhanced CIDR validation to detect overlaps involving the new internal subnets.

  • Tests

  • Added unit tests covering serialization compatibility, propagation, and CIDR validation for the new fields.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@hypershift-jira-solve-ci

Copy link
Copy Markdown
Author

Re: review PRR_kwDOE7ekcc71NZsb

Thanks for the thorough review. Addressing the nitpicks:

  1. karpenter_test.go formatting (line 212): The spacing change was introduced by gofmt during the auto-format pass. It's a no-op change — happy to revert if the reviewer prefers, but it keeps the file consistently formatted.

  2. ServiceNetwork-overlap test cases: Good suggestion. The current CIDR overlap validation covers V4InternalSubnet/V6InternalSubnet against ClusterNetwork and MachineNetwork. Adding explicit ServiceNetwork overlap cases would improve coverage — will consider for a follow-up.

  3. IPv4 field assertion in reverse round-trip: The reverse round-trip test currently checks MTU preservation. Adding an explicit IPv4 check would strengthen it, agreed. Noted for follow-up.

  4. Logging warning on CIDR parse failure in appendCIDREntry: The silent ignore is intentional since CEL validation at admission should catch malformed CIDRs before they reach the controller. Adding a debug-level log could help with edge-case troubleshooting though — good suggestion for a follow-up.


AI-assisted response via Claude Code

@hypershift-jira-solve-ci

Copy link
Copy Markdown
Author

Re: review PRR_kwDOE7ekcc71TMjf

Good catch — we have the non-OVN negative test for V4InternalSubnet but not for V6InternalSubnet. Adding a matching "When v6InternalSubnet is specified with non-OVN network type, it should be ignored" test case would close that gap. Will add in a follow-up.


AI-assisted response via Claude Code

@jparrill jparrill left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good work overall — CEL approach with isCIDR()/cidr() is cleaner than the regex pattern used by existing fields. CPO propagation ordering is correct. A few items to consider:

Comment thread api/hypershift/v1beta1/operator.go Outdated
// +kubebuilder:validation:XValidation:rule="self == oldSelf",message="v4InternalSubnet is immutable once set"
// +kubebuilder:validation:XValidation:rule="isCIDR(self) && cidr(self).ip().family() == 4",message="Subnet must be in a valid IPv4 CIDR format"
// +kubebuilder:validation:XValidation:rule="isCIDR(self) && cidr(self).prefixLength() <= 30",message="subnet must be in the range /0 to /30 inclusive"
// +kubebuilder:validation:XValidation:rule="int(self.split('.')[0]) > 0",message="first IP address octet must not be 0"

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This CEL rule runs independently from the isCIDR(self) rule. If someone submits an IPv6-looking value like "::1/128" (passes MinLength=9? no, 8 chars — but "fd00::/64" at 9 chars passes), then self.split('.')[0] returns the whole string and int() produces a CEL evaluation error with a confusing message.

The existing InternalJoinSubnet guards this with self.matches('^[0-9]{1,3}\\..*') first. Consider adding a similar guard, or combining this into the isCIDR rule:

isCIDR(self) && cidr(self).ip().family() == 4 && int(self.split('.')[0]) > 0

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. Combined the first-octet check into the isCIDR rule: isCIDR(self) && cidr(self).ip().family() == 4 && int(self.split('.')[0]) > 0. This ensures the string-split logic is only reached for valid IPv4 CIDRs, avoiding confusing CEL evaluation errors from IPv6-looking values like fd00::/64.


AI-assisted response via Claude Code

// +kubebuilder:validation:XValidation:rule="isCIDR(self) && cidr(self).prefixLength() <= 30",message="subnet must be in the range /0 to /30 inclusive"
// +kubebuilder:validation:XValidation:rule="int(self.split('.')[0]) > 0",message="first IP address octet must not be 0"
// +optional
V4InternalSubnet string `json:"v4InternalSubnet,omitempty"`

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These new fields use isCIDR()/cidr()/prefixLength() CEL functions that require the CEL CIDR library (k8s 1.31+). The existing fields use simpler regex+int CEL. Consider adding envtest YAML test cases in stable.hostedclusters.networking.testsuite.yaml to cover:

  • Valid/invalid CIDR format rejection (IPv6 in V4 field, bad prefix lengths)
  • Immutability (self == oldSelf + parent-level !has(oldSelf.x) || has(self.x))
  • Cross-version compatibility (the CEL CIDR lib is newer than what existing fields use)

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. Added envtest YAML test cases in stable.hostedclusters.networking.testsuite.yaml covering:

onCreate:

  • Valid IPv4/IPv6 CIDR acceptance
  • IPv6 value in v4InternalSubnet field → rejection
  • IPv4 value in v6InternalSubnet field → rejection
  • Invalid CIDR format → rejection
  • Prefix length exceeding /30 (v4) and /125 (v6) → rejection
  • First octet = 0 → rejection

onUpdate:

  • Value change → rejection (field-level immutability)
  • Value removal → rejection (type-level !has(oldSelf.x) || has(self.x) guard)

These use the CEL CIDR library functions (isCIDR()/cidr()/prefixLength()) and will exercise them against the envtest kube-apiserver.


AI-assisted response via Claude Code

return compareCIDREntries(cidrEntries)
}

func appendCIDREntry(entries []cidrEntry, cidrStr string, pathElements ...string) []cidrEntry {

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: pathElements ...string will panic if called with zero args (pathElements[0]). All current callers pass elements, but the API is fragile. Consider basePath string, subPaths ...string to make it compile-time safe.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. Changed signature to basePath string, subPaths ...string so zero-arg calls are caught at compile time. Updated callers to split the base path constant from the sub-path slice: appendCIDREntry(entries, cidr, ovnBase, append(ovnSub, "field")...).


AI-assisted response via Claude Code

@jparrill

jparrill commented May 6, 2026

Copy link
Copy Markdown
Contributor

/test address-review-comments

@hypershift-jira-solve-ci

Copy link
Copy Markdown
Author

Re: #8249 (review)

Thanks for the thorough review @jparrill! All three items addressed:

  1. CEL guard for first-octet rule — Combined into isCIDR(self) && cidr(self).ip().family() == 4 && int(self.split('.')[0]) > 0 to prevent confusing CEL errors from IPv6 values.
  2. Envtest YAML test cases — Added onCreate tests for valid/invalid CIDR format, wrong address family, bad prefix lengths, and zero first octet. Added onUpdate tests for immutability (value change and removal).
  3. appendCIDREntry signature — Changed to basePath string, subPaths ...string for compile-time safety.

AI-assisted response via Claude Code

@hypershift-jira-solve-ci

Copy link
Copy Markdown
Author

Re: #8249 (review)
Re: #8249 (review)
Re: #8249 (review)
Re: #8249 (review)

Thanks for the thorough nitpick suggestions across all four reviews. Here's where things stand:

Already addressed:

  • karpenter_test.go formatting (review 1): This file is no longer part of the PR diff — the unrelated formatting change has been removed.
  • V6 non-OVN negative test (review 2): Added in a prior push — see test case "When v6InternalSubnet is specified with non-OVN network type, it should be ignored" in reconcile_test.go.
  • KubeVirt + V6-only test (review 3): Also added — see "When KubeVirt with OVNKubernetes has user-specified v6InternalSubnet only, it should retain platform defaults and apply V6" in reconcile_test.go.
  • IPv4 round-trip assertion (review 1): Added in commit fead59b95.
  • appendCIDREntry safety (review 1): Made compile-time safe in commit 019fabdc3 by changing the signature to basePath string, subPaths ...string.

Acknowledged as future improvements:

  • ServiceNetwork overlap test cases (review 1): The validation code in validateSliceNetworkCIDRs already cross-checks V4/V6InternalSubnet against ALL network CIDRs including ServiceNetwork (all entries are compared via compareCIDREntries). Adding explicit test cases for this specific combination is a good idea for regression coverage but not blocking.
  • Factor repeated test scaffolding (review 4): Agreed this would reduce noise in future updates. The current table structure is clear and readable as-is; will consider adding helpers if more subnet test scenarios are needed.

AI-assisted response via Claude Code

@hypershift-jira-solve-ci

Copy link
Copy Markdown
Author

Re: #8249 (review)

Thanks for the review @jparrill! Glad the CEL approach with isCIDR()/cidr() looks good.

All three inline items from your review have been addressed in prior pushes:

  1. CEL guard for first-octet rule (operator.go): Combined the first-octet validation into the isCIDR check to avoid confusing CEL evaluation errors with IPv6-looking values — see commit c923ab93e.
  2. Envtest YAML test cases (operator.go): Added coverage in stable.hostedclusters.networking.testsuite.yaml for valid/invalid CIDR format, immutability, and the CEL CIDR library functions — see commit c923ab93e.
  3. appendCIDREntry compile-time safety (hostedcluster_controller.go): Changed signature from pathElements ...string to basePath string, subPaths ...string to prevent zero-arg panics at compile time — see commit 019fabdc3.

AI-assisted response via Claude Code

OpenShift CI Bot and others added 16 commits May 29, 2026 08:40
…Config

- Add optional V4InternalSubnet field for configuring the IPv4 subnet
  used internally by OVN-Kubernetes (default 100.64.0.0/16)
- Add optional V6InternalSubnet field for configuring the IPv6 subnet
  used internally by OVN-Kubernetes (default fd98::/64)
- Add CEL validation rules for CIDR format, prefix length, and
  non-zero first octet (IPv4) constraints
- Add immutability validation to prevent removal once set
- Regenerate featuregated CRD manifests for HostedCluster and
  HostedControlPlane

Signed-off-by: OpenShift CI Bot <ci-bot@redhat.com>
Commit-Message-Assisted-by: Claude (via Claude Code)
- Regenerate CRD manifests for HostedCluster and HostedControlPlane
  with v4InternalSubnet and v6InternalSubnet fields
- Update OVNKubernetesConfig apply configuration client
- Vendor updated API types

Signed-off-by: OpenShift CI Bot <ci-bot@redhat.com>
Commit-Message-Assisted-by: Claude (via Claude Code)
…nalSubnet

- Extend validateSliceNetworkCIDRs to check V4InternalSubnet and
  V6InternalSubnet against cluster, machine, and service networks
- Extract appendCIDREntry helper to reduce duplicated CIDR parsing logic
- Restructure nil-check nesting so subnet validation runs independently
  of IPv4 config presence
- Add unit tests for overlap detection across MachineNetwork,
  ClusterNetwork, and InternalJoinSubnet for both IPv4 and IPv6

Signed-off-by: OpenShift CI Bot <ci-bot@redhat.com>
Commit-Message-Assisted-by: Claude (via Claude Code)
…k operator CR

- Propagate V4InternalSubnet and V6InternalSubnet from HostedCluster
  OVNKubernetesConfig to the guest cluster network operator config
- User-specified values take precedence over platform defaults
- Add unit tests covering AWS propagation, KubeVirt override,
  combined subnet configuration, and OpenShiftSDN no-op scenarios

Signed-off-by: OpenShift CI Bot <ci-bot@redhat.com>
Commit-Message-Assisted-by: Claude (via Claude Code)
- Add forward/backward serialization tests for V4InternalSubnet and
  V6InternalSubnet fields against an N-1 version of the struct
- Verify new fields round-trip correctly and are silently dropped
  by older deserializers
- Fix whitespace alignment in karpenter_test.go

Signed-off-by: OpenShift CI Bot <ci-bot@redhat.com>
Commit-Message-Assisted-by: Claude (via Claude Code)
- Update API reference with v4InternalSubnet and v6InternalSubnet
  field documentation
- Regenerate aggregated docs

Signed-off-by: OpenShift CI Bot <ci-bot@redhat.com>
Commit-Message-Assisted-by: Claude (via Claude Code)
Switch v4InternalSubnet validation from regex-based approach to the
newer CEL cidr() library, matching the approach already used by
v6InternalSubnet. Also adds a negative test case for v6InternalSubnet
with non-OVN network types.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Add test verifying that when only V6InternalSubnet is user-specified
on KubeVirt platform, the KubeVirt defaults (GenevePort and default
V4InternalSubnet) are retained alongside the user-specified V6 subnet.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Update v4InternalSubnet and v6InternalSubnet godoc to accurately reflect
CEL `self == oldSelf` behavior: these fields are immutable once set, but
can be initially populated on a subsequent update if omitted at creation.
The previous wording "cannot be changed after cluster creation" was
misleading since the transition rule only evaluates when oldSelf is present.

Move operator_test.go to the v1beta1_test external test package so that
importers of the v1beta1 API package do not inherit test code.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Add reflect.DeepEqual assertion for the IPv4 field in the
serialization compatibility test to catch regressions in
nested field preservation during N-1 → N round-trips.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Remove operator_test.go to avoid increasing transient go mod
dependencies for the API module. Unit tests for these types
will be addressed differently in a follow-up PR.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Regenerate all generated files (CRDs, featuregated manifests, API docs,
and vendored copy) to reflect changes from upstream main after rebasing.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…t coverage

The standalone `int(self.split('.')[0]) > 0` CEL rule on v4InternalSubnet
ran independently from the `isCIDR(self)` rule, causing a confusing CEL
evaluation error when an IPv6 value like "fd00::/64" passed MinLength=9
and reached the split/int logic. Combined the first-octet check into
`isCIDR(self) && cidr(self).ip().family() == 4 && int(self.split('.')[0]) > 0`
so the guard short-circuits before the string split.

Also adds envtest YAML test cases covering v4/v6InternalSubnet validation:
- Valid/invalid CIDR format rejection (IPv6 in V4 field, bad prefix lengths)
- Immutability (value change and removal rejection on update)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Change appendCIDREntry signature from `pathElements ...string` to
`basePath string, subPaths ...string` so zero-arg calls are caught at
compile time instead of panicking at runtime on pathElements[0].

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Revert appendCIDREntry to variadic signature with a length guard to
fix unparam lint (basePath always received "spec"). Also re-vendor
api/ module so the isCIDR CEL guard in v4InternalSubnet is included.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
The CEL rule for first-octet validation on v4InternalSubnet was updated
to guard with isCIDR and IPv4 family checks, but the generated CRD
manifests were not regenerated afterward.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@github-actions github-actions Bot temporarily deployed to docs-preview/pr-8249 May 29, 2026 08:53 Inactive
ReconcileNetworkOperator exceeded the gocyclo threshold of 30 after
adding V4InternalSubnet and V6InternalSubnet support. Extract the OVN
config application logic into a dedicated applyOVNConfig helper.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@openshift-ci

openshift-ci Bot commented May 29, 2026

Copy link
Copy Markdown
Contributor

@hypershift-jira-solve-ci[bot]: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/images 460e425 link true /test images

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@hypershift-jira-solve-ci

Copy link
Copy Markdown
Author

Confirmed: the metrics-forwarding-api PR from May 28 shows the exact same pattern — mockgen ~3.5min, go generate ~9min, setup-envtest cancelled. This is a systemic infrastructure issue.

Test Failure Analysis Complete

Job Information

Test Failure Analysis

Error

##[error]The operation was canceled.
Envtest jobs failed: cancelled

Summary

The envtest tests never executed — all 11 matrix jobs (6 envtest-ocp + 5 envtest-kube) were killed by the 15-minute workflow timeout during build prerequisites (go generate ./... and setup-envtest compilation). The generate Makefile target alone takes ~12.5 minutes (3m40s for mockgen + 9m for go generate ./...) and the setup-envtest binary build adds another ~2m20s, totaling ~15 minutes before any test runs. This is a systemic infrastructure issue affecting all PRs that touch api/ paths — not a defect in PR #8249. Every envtest-triggering PR since at least May 28, 2026 has failed the same way (verified across 4 different branches: fix-CNTRLPLANE-647, fix-CNTRLPLANE-597-v2, metrics-forwarding-api, and dependabot/go_modules/api/k8s-dependencies).

Root Cause

The root cause is a go build cache regression introduced by the migration from GOCACHE to GOCACHEPROG in the CI runner infrastructure.

Before (working, May 28 and earlier): The warm-go-cache composite action set GOCACHE=/tmp/go-build-cache and copied EFS-cached build artifacts. Even when the copy failed, Go's standard build cache on the runner provided partial caching, and the build steps completed in ~30 seconds total:

  • mockgen build: ~10s
  • go generate ./...: ~8s
  • setup-envtest build: ~8s

After (broken, current): Commit 65666ea (May 18) removed the job-level GOCACHE: /tmp/go-build-cache env var from the envtest workflows, and the warm-go-cache action was refactored (commits 6f58081, 8348e02) to use GOCACHEPROG=gocacheprog --ro /cache/go-build --rw /tmp/go-build-cache instead. While gocacheprog is present and the EFS cache directory exists, the build steps now take ~15 minutes:

  • mockgen build: ~3m40s (22× slower)
  • go generate ./...: ~9m00s (67× slower)
  • setup-envtest build: ~2m20s+ (17× slower, cancelled before completion)

The GOCACHEPROG-based cache is not effectively providing cache hits for the go build invocations within the make generate target. This could be because:

  1. The gocacheprog binary is failing silently to serve cached entries
  2. The EFS cache at /cache/go-build contains stale or incompatible entries for the PR branch
  3. The GOFLAGS=-mod=vendor and GOWORK=off flags interact poorly with the cache proxy

The 15-minute timeout-minutes in the workflow was originally sufficient when builds took ~30s, but is now exceeded before tests even begin.

Recommendations
  1. Immediate (unblock PR CNTRLPLANE-647: Expose v4/v6InternalSubnet OVN-Kubernetes configuration in HostedCluster API #8249): Increase timeout-minutes from 15 to 30 in both .github/workflows/envtest-ocp-reusable.yaml and .github/workflows/envtest-kube-reusable.yaml. This is a stopgap — the build should not take 15 minutes.

  2. Fix the root cause: Investigate why GOCACHEPROG is not providing effective cache hits on the arc-runner-set runners. Compare cache hit rates with the previous GOCACHE approach. The gocacheprog --ro /cache/go-build flag implies the EFS mount exists, but the 22–67× slowdown in build times indicates the cache is either empty, corrupted, or incompatible.

  3. Restore fallback: Consider restoring GOCACHE=/tmp/go-build-cache as a fallback environment variable in the envtest workflow jobs (it was removed in commit 65666ea), or conditionally set it when gocacheprog doesn't provide measurable speedup.

  4. Re-run PR CNTRLPLANE-647: Expose v4/v6InternalSubnet OVN-Kubernetes configuration in HostedCluster API #8249 checks: Once the timeout or cache issue is fixed, re-run the envtest workflows. The PR's changes (API type additions to OVNKubernetesConfig) are structurally correct and the envtest failures are entirely unrelated to the code changes.

Evidence
Evidence Detail
envtest-ocp job timing mockgen: 3m39s, go generate: 9m00s, setup-envtest: 2m20s (cancelled at 14m59s)
envtest-kube job timing mockgen: 3m41s, go generate: 9m02s, setup-envtest: 2m17s (cancelled at 15m00s)
Workflow timeout timeout-minutes: 15 in both reusable workflows (set since initial creation April 30)
Tests reached Zero — all 11 matrix jobs cancelled during build prerequisites
Cache mechanism (current) GOCACHEPROG=gocacheprog --ro /cache/go-build --rw /tmp/go-build-cache
Cache mechanism (previous) GOCACHE=/tmp/go-build-cache (removed in commit 65666ea, May 18)
Successful run comparison Run 26566318214 (May 28, api-driven-azure-topology) — same steps completed in ~30s total using older GOCACHE mechanism
Systemic scope Same timeout on metrics-forwarding-api (May 28), fix-CNTRLPLANE-597-v2 (May 28–29), dependabot/api/k8s-deps (May 29), and main branch push (run 26573657311, May 28)
PR #8249 code changes API additions to OVNKubernetesConfig struct, CRD manifests, controller reconciliation — no build/CI file changes
Key commits 65666ea — removed GOCACHE job env; 6f58081/8348e02 — introduced GOCACHEPROG; 3b1adfa — original workflow with 15m timeout

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area/api Indicates the PR includes changes for the API area/cli Indicates the PR includes changes for CLI area/control-plane-operator Indicates the PR includes changes for the control plane operator - in an OCP release area/documentation Indicates the PR includes changes for documentation area/hypershift-operator Indicates the PR includes changes for the hypershift operator and API - outside an OCP release area/testing Indicates the PR includes changes for e2e testing jira/valid-reference Indicates that this PR references a valid Jira ticket of any type.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants