NO-ISSUE: OTA-1605 Automate OCP-42543 by JianLi-RH · Pull Request #1309 · openshift/cluster-version-operator

JianLi-RH · 2026-01-30T10:02:51Z

Test case: https://polarion.engineering.redhat.com/polarion/#/project/OSE/workitem?id=OCP-42543

Test it locally:

[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$ _output/linux/amd64/cluster-version-operator-tests run-test "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true"
  Running Suite:  - /home/jianl/1_code/cluster-version-operator
  =============================================================
  Random Seed: 1769770387 - will randomize all specs

  Will run 1 of 1 specs
  ------------------------------
  [Jira:"Cluster Version Operator"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true [Conformance, High, 42543]
  /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:90
    STEP: Setup ocapi.OC @ 01/30/26 18:53:07.673
  cluster-version-operator-tests "level"=0 "msg"="will use the environment timeout variable to run command: 90s"
  cluster-version-operator-tests "level"=0 "msg"="timeout is: 1m30s"
    STEP: Extract manifests @ 01/30/26 18:53:07.674
  cluster-version-operator-tests "level"=0 "msg"="Extract manifests to: /tmp/OTA-42543-manifest"
  cluster-version-operator-tests "level"=0 "msg"="the output directory does not exist, will create it: /tmp/OTA-42543-manifest"
  cluster-version-operator-tests "level"=0 "msg"="the output directory has been created: /tmp/OTA-42543-manifest"
  cluster-version-operator-tests "level"=0 "msg"="Running command succeeded." "cmd"="/usr/local/sbin/oc" "args"="adm release extract --to=/tmp/OTA-42543-manifest"
    STEP: Start to iterate all manifests @ 01/30/26 18:53:59.458
  cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get Deployment cluster-baremetal-operator-hostedcluster -n openshift-machine-api" "output"="Error from server (NotFound): deployments.apps \"cluster-baremetal-operator-hostedcluster\" not found\n"
  cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get Service controller-manager-service -n openshift-cloud-credential-operator" "output"="Error from server (NotFound): services \"controller-manager-service\" not found\n"
  cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get ClusterRoleBinding default-account-cluster-network-operator" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-cluster-network-operator\" not found\n"
  cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get PrometheusRule authentication-operator -n openshift-authentication-operator" "output"="Error from server (NotFound): prometheusrules.monitoring.coreos.com \"authentication-operator\" not found\n"
  cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get ClusterRoleBinding default-account-openshift-machine-config-operator" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\n"
  cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get CronJob machine-config-nodes-crd-cleanup -n openshift-machine-config-operator" "output"="Error from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\n"
  • [60.920 seconds]
  ------------------------------

  Ran 1 of 1 Specs in 60.921 seconds
  SUCCESS! -- 1 Passed | 0 Failed | 0 Pending | 0 Skipped
[
  {
    "name": "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true",
    "lifecycle": "blocking",
    "duration": 60920,
    "startTime": "2026-01-30 10:53:07.670976 UTC",
    "endTime": "2026-01-30 10:54:08.591888 UTC",
    "result": "passed",
    "output": "  STEP: Setup ocapi.OC @ 01/30/26 18:53:07.673\n  STEP: Extract manifests @ 01/30/26 18:53:07.674\n  STEP: Start to iterate all manifests @ 01/30/26 18:53:59.458\n"
  }
]
[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$

/cc @hongkailiu @DavidHurta

openshift-ci-robot · 2026-01-30T10:02:57Z

@JianLi-RH: This pull request explicitly references no jira issue.

Details

In response to this:

Test case: https://polarion.engineering.redhat.com/polarion/#/project/OSE/workitem?id=OCP-42543

Test it locally:

[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$ _output/linux/amd64/cluster-version-operator-tests run-test "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true"
 Running Suite:  - /home/jianl/1_code/cluster-version-operator
 =============================================================
 Random Seed: 1769767023 - will randomize all specs

 Will run 1 of 1 specs
 ------------------------------
 [Jira:"Cluster Version Operator"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true [Conformance, High, 42543]
 /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:90
   STEP: Setup ocapi.OC @ 01/30/26 17:57:03.144
 cluster-version-operator-tests "level"=0 "msg"="will use the environment timeout variable to run command: 120s"
 cluster-version-operator-tests "level"=0 "msg"="timeout is: 2m0s"
   STEP: Extract manifests @ 01/30/26 17:57:03.144
 cluster-version-operator-tests "level"=0 "msg"="Extract manifests to: /tmp/OTA-42543-manifest"
 cluster-version-operator-tests "level"=0 "msg"="the output directory does not exist, will create it: /tmp/OTA-42543-manifest"
 cluster-version-operator-tests "level"=0 "msg"="the output directory has been created: /tmp/OTA-42543-manifest"
 cluster-version-operator-tests "level"=0 "msg"="Running command succeeded." "cmd"="/usr/local/sbin/oc" "args"="adm release extract --to=/tmp/OTA-42543-manifest"
   STEP: Start to iterate all manifests @ 01/30/26 17:57:33.498
 cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get Deployment cluster-baremetal-operator-hostedcluster -n openshift-machine-api" "output"="Error from server (NotFound): deployments.apps \"cluster-baremetal-operator-hostedcluster\" not found\n"
 cluster-version-operator-tests "msg"="running command failed" "error"="exit status 1" "output"="Error from server (NotFound): deployments.apps \"cluster-baremetal-operator-hostedcluster\" not found\n"
 cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get Service controller-manager-service -n openshift-cloud-credential-operator" "output"="Error from server (NotFound): services \"controller-manager-service\" not found\n"
 cluster-version-operator-tests "msg"="running command failed" "error"="exit status 1" "output"="Error from server (NotFound): services \"controller-manager-service\" not found\n"
 cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get ClusterRoleBinding default-account-cluster-network-operator" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-cluster-network-operator\" not found\n"
 cluster-version-operator-tests "msg"="running command failed" "error"="exit status 1" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-cluster-network-operator\" not found\n"
 cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get PrometheusRule authentication-operator -n openshift-authentication-operator" "output"="Error from server (NotFound): prometheusrules.monitoring.coreos.com \"authentication-operator\" not found\n"
 cluster-version-operator-tests "msg"="running command failed" "error"="exit status 1" "output"="Error from server (NotFound): prometheusrules.monitoring.coreos.com \"authentication-operator\" not found\n"
 cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get ClusterRoleBinding default-account-openshift-machine-config-operator" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\n"
 cluster-version-operator-tests "msg"="running command failed" "error"="exit status 1" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\n"
 cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get CronJob machine-config-nodes-crd-cleanup -n openshift-machine-config-operator" "output"="Error from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\n"
 cluster-version-operator-tests "msg"="running command failed" "error"="exit status 1" "output"="Error from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\n"
 • [36.390 seconds]
 ------------------------------

 Ran 1 of 1 Specs in 36.390 seconds
 SUCCESS! -- 1 Passed | 0 Failed | 0 Pending | 0 Skipped
[
 {
   "name": "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true",
   "lifecycle": "blocking",
   "duration": 36390,
   "startTime": "2026-01-30 09:57:03.140591 UTC",
   "endTime": "2026-01-30 09:57:39.530736 UTC",
   "result": "passed",
   "output": "  STEP: Setup ocapi.OC @ 01/30/26 17:57:03.144\n  STEP: Extract manifests @ 01/30/26 17:57:03.144\n  STEP: Start to iterate all manifests @ 01/30/26 17:57:33.498\n"
 }
]
[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$

/cc @hongkailiu @DavidHurta

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

coderabbitai · 2026-01-30T10:03:15Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

Walkthrough

Adds a CVO Ginkgo test and corresponding payload entry, new test utilities for client initialization and manifest absence checks, Prometheus types registration for manifest decoding, and dependency updates in go.mod.

Changes

Cohort / File(s)	Summary
Payload config `.openshift-tests-extension/openshift_payload_cluster-version-operator.json`	Inserted a new payload object for the CVO test named "[Jira:"Cluster Version Operator"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true" with labels `42543`, `Conformance`, `High`, `resources: isolation`, `source: openshift:payload:cluster-version-operator`, `lifecycle: blocking`, empty `environmentSelector`.
CVO test `test/cvo/cvo.go`	Added a Ginkgo test `should not install resources annotated with release.openshift.io/delete=true` that extracts release manifests, parses manifests via library-go manifest parser, and asserts manifests annotated `release.openshift.io/delete=true` result in not-found when applied; includes additional imports and skip checks.
Test connection helpers `test/util/connection.go`	New file with unexported helpers to read `KUBECONFIG`, build a rest.Config, and return typed clientsets (imagev1, securityv1, operatorsv1, monitoringv1, admissionregistrationv1, appsv1, batchv1, corev1, rbacv1, apiextensionsv1, etc.), propagating errors for missing/invalid kubeconfig.
Manifest verification utility `test/util/util.go`	Added exported `GetManifestExpectNotFoundError(manifest.Manifest) error` implementing a type switch that decodes manifests and calls the appropriate client Get for supported resource types (including ImageStream, SecurityContextConstraints, OperatorGroup, PrometheusRule, ServiceMonitor, webhook configs, DaemonSet/Deployment/CronJob/Job, core resources, RBAC, CRDs); returns the Get error or unrecognized-type error.
Dependency updates `go.mod`	Updated and bumped many module versions (k8s.io/, ginkgo/gomega, prometheus-operator, oauth2, protobuf/json-patch, go-openapi/, and numerous indirects); added/promoted prometheus operator modules and a replace for onsi/ginkgo/v2.
Prometheus mapping for codegen `hack/generate-lib-resources.py`	Added monitoring v1 group mapping for Prometheus Operator types `PrometheusRule` and `ServiceMonitor` to the scheme_group_versions mapping.
Resource decoding scheme `lib/resourceread/resourceread.go`	Imported Prometheus Operator monitoringv1 and registered it with the scheme via AddToScheme; added monitoringv1 to the universal decoder's scheme group versions.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

openshift-ci · 2026-01-30T10:06:30Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: JianLi-RH
Once this PR has been reviewed and has the lgtm label, please assign fao89 for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In `@test/cvo/cvo.go`:
- Around line 99-105: Update the assertion message to accurately describe the
expectation: change the misleading message on the
o.Expect(err).NotTo(o.HaveOccurred(), ...) call so it states that manifest
extraction should succeed (or that no error is expected) when invoking
ocClient.AdmReleaseExtract with manifestDir
(ocClient.AdmReleaseExtract(manifestDir)); reference the manifest extraction
operation and use clear text like "expected manifest extraction to succeed" or
"no error expected when extracting manifests" instead of "The NotFound error
should occur when extract manifests".
- Around line 117-119: The loop opens files with os.Open and uses defer
file.Close(), which leaks descriptors until the enclosing function returns;
replace the deferred close by closing each file immediately after its processing
(call file.Close() directly at the end of the loop iteration) or move the
per-file logic into a helper function (e.g., processManifestFile(filePath) that
opens the file and defers file.Close() inside that helper) so file handles are
released promptly; update references to the file variable and remove the in-loop
defer file.Close() in the code around os.Open and file usage.

🧹 Nitpick comments (1)

test/oc/cli/cli.go (1)
118-125: Duplicate logging in Run method.

The ocExecutor.run method (lines 38-42) already logs success/error with command details. This Run wrapper adds redundant logging, resulting in double log entries for each command execution.

Consider removing the duplicate logging here since the executor already handles it:
♻️ Proposed simplification
 func (c *client) Run(args ...string) ([]byte, error) {
-	b, err := c.executor.run(args...)
-	if err != nil {
-		c.logger.Error(err, "running command failed", "output", string(b))
-	} else {
-		c.logger.Info("running command succeeded.")
-	}
-	return b, err
+	return c.executor.run(args...)
 }

test/cvo/cvo.go

JianLi-RH · 2026-01-30T11:46:52Z

@coderabbitai review

coderabbitai · 2026-01-30T11:46:59Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 3

🤖 Fix all issues with AI agents

In `@test/cvo/cvo.go`:
- Around line 134-140: The YAML decode loop currently swallows decode errors
with a continue, but the test should fail instead; replace the silent continue
with an assertion that fails the test on decode errors by calling
o.Expect(err).NotTo(o.HaveOccurred(), "decode manifest %s failed", filePath)
immediately after decoder.Decode(&doc) (remove the unreachable continue) so any
decoder.Decode failure in the loop (the symbol decoder.Decode and variable doc)
causes the test to fail and reports the filePath context.
- Around line 141-145: The annotation check in the loop (variables meta, ann and
the key annotation in test/cvo/cvo.go) wrongly assumes the value is the string
"true"; change the check to handle both boolean true and the string "true" (and
treat other truthy forms if present) by retrieving v := ann[annotation] and
using a type switch or explicit type assertions to accept v == true or v ==
"true" (and skip if ann is nil or the value is absent), so resources with
unquoted YAML true are correctly detected for deletion.
- Around line 103-108: The current code uses a fixed manifestDir.To =
"/tmp/OTA-42543-manifest" which can collide in parallel runs; replace that with
a unique temp directory created via os.MkdirTemp and assign the returned path to
manifestDir.To (handle and return/log any error from MkdirTemp), keep the defer
to os.RemoveAll(manifestDir.To) for cleanup, and then call
ocClient.AdmReleaseExtract(manifestDir) as before; update the code around the
manifestDir variable, ocapi.ReleaseExtractOptions initialization, and the defer
cleanup to use the MkdirTemp-created path.

test/cvo/cvo.go

hongkailiu · 2026-02-01T13:43:19Z

test/cvo/cvo.go

+					args = append(args, "-n", namespace)
+				}
+				_, err := ocClient.Run(args...)
+				o.Expect(err).To(o.HaveOccurred(), "The deleted manifest should not be installed, but actually installed")


Ha. If I understand the code correctly, you are doing the following command with oc-cli:

$ oc get <kind> <name> -n <namespace>

Because KIND|GROUP|VERSION are dynamic, it is not easy to do it via client-go (See how CVO does it). Correct?
It is really nasty, and I really do not want it but I do not have a better way.

HOWEVER, I think this should work (which is much simpler if it does) for your case:

cluster-version-operator/vendor/github.com/openshift/library-go/pkg/manifest/manifest.go

Line 386 in e9762c6

func ParseManifests(r io.Reader) ([]Manifest, error) {

Parse manifests out of files in payload

check if a manifest.Raw contains string release.openshift.io/delete=true;
if yes, save it to a temp file and do oc get -f command with the temp file and expect not-found error(s).

You do not need to any yaml/json parsing here. And you will get Manifest for free. GVK is also difficult to use correctly and the way you do it now might not be accurate (for example, you are using Kind only, not Version nor Group).

Let me know what you think about it or you need more clarification.

Unlike other cases, a simple shell script would do the case. But we like Go code more. 🤷

let me give a try today

Done in recent commit and passed in my local machine:

[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$ _output/linux/amd64/cluster-version-operator-tests run-test "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true" Running Suite: - /home/jianl/1_code/cluster-version-operator ============================================================= Random Seed: 1770004261 - will randomize all specs Will run 1 of 1 specs ------------------------------ [Jira:"Cluster Version Operator"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true [Conformance, High, 42543] /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:90 STEP: Setting up oc @ 02/02/26 11:51:01.098 cluster-version-operator-tests "level"=0 "msg"="will use the environment timeout variable to run command: 90s" cluster-version-operator-tests "level"=0 "msg"="timeout is: 1m30s" STEP: Extracting manifests in the release @ 02/02/26 11:51:01.099 cluster-version-operator-tests "level"=0 "msg"="Extract manifests to: /tmp/OTA-42543-manifest" cluster-version-operator-tests "level"=0 "msg"="the output directory does not exist, will create it: /tmp/OTA-42543-manifest" cluster-version-operator-tests "level"=0 "msg"="the output directory has been created: /tmp/OTA-42543-manifest" cluster-version-operator-tests "level"=0 "msg"="Running command succeeded." "cmd"="/usr/local/sbin/oc" "args"="adm release extract --to=/tmp/OTA-42543-manifest" STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/02/26 11:51:31.859 cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_31_cluster-baremetal-operator_06_deployment-hostedcluster-delete.yaml" "output"="Error from server (NotFound): deployments.apps \"cluster-baremetal-operator-hostedcluster\" not found\n" cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_50_cloud-credential-operator_01-service-delete.yaml" "output"="Error from server (NotFound): services \"controller-manager-service\" not found\n" cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_70_cluster-network-operator_02_rbac.yaml" "output"="NAME SECRETS AGE\nserviceaccount/cluster-network-operator 1 96m\n\nNAME ROLE AGE\nclusterrolebinding.rbac.authorization.k8s.io/cluster-network-operator ClusterRole/cluster-admin 96m\nError from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-cluster-network-operator\" not found\n" cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_90_cluster-authentication-operator_03_prometheusrule.yaml" "output"="Error from server (NotFound): prometheusrules.monitoring.coreos.com \"authentication-operator\" not found\n" cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_90_machine-config_90_deletion.yaml" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\nError from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\nError from server (NotFound): services \"machine-config-operator\" not found\nError from server (NotFound): servicemonitors.monitoring.coreos.com \"machine-config-operator\" not found\n" cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_90_machine-config_90_deletion.yaml" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\nError from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\nError from server (NotFound): services \"machine-config-operator\" not found\nError from server (NotFound): servicemonitors.monitoring.coreos.com \"machine-config-operator\" not found\n" cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_90_machine-config_90_deletion.yaml" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\nError from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\nError from server (NotFound): services \"machine-config-operator\" not found\nError from server (NotFound): servicemonitors.monitoring.coreos.com \"machine-config-operator\" not found\n" cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_90_machine-config_90_deletion.yaml" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\nError from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\nError from server (NotFound): services \"machine-config-operator\" not found\nError from server (NotFound): servicemonitors.monitoring.coreos.com \"machine-config-operator\" not found\n" cluster-version-operator-tests "msg"="failed to parse manifest file: /tmp/OTA-42543-manifest/release-metadata" "error"="error parsing: Resource with fields Group: \"\" Kind: \"cincinnati-metadata-v0\" Name: \"\" must contain kubernetes required fields kind and name" • [46.763 seconds] ------------------------------ Ran 1 of 1 Specs in 46.764 seconds SUCCESS! -- 1 Passed | 0 Failed | 0 Pending | 0 Skipped [ { "name": "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true", "lifecycle": "blocking", "duration": 46764, "startTime": "2026-02-02 03:51:01.095874 UTC", "endTime": "2026-02-02 03:51:47.859924 UTC", "result": "passed", "output": " STEP: Setting up oc @ 02/02/26 11:51:01.098\n STEP: Extracting manifests in the release @ 02/02/26 11:51:01.099\n STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/02/26 11:51:31.859\n" } ] [jianl@jianl-thinkpadt14gen4 cluster-version-operator]$

Here are some errors from the new output:

cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_90_machine-config_90_deletion.yaml" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\nError from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\nError from server (NotFound): services \"machine-config-operator\" not found\nError from server (NotFound): servicemonitors.monitoring.coreos.com \"machine-config-operator\" not found\n" cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_90_machine-config_90_deletion.yaml" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\nError from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\nError from server (NotFound): services \"machine-config-operator\" not found\nError from server (NotFound): servicemonitors.monitoring.coreos.com \"machine-config-operator\" not found\n" cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_90_machine-config_90_deletion.yaml" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\nError from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\nError from server (NotFound): services \"machine-config-operator\" not found\nError from server (NotFound): servicemonitors.monitoring.coreos.com \"machine-config-operator\" not found\n" cluster-version-operator-tests "msg"="Running command failed" "error"="exit status 1" "cmd"="/usr/local/sbin/oc" "args"="get -f /tmp/OTA-42543-manifest/0000_90_machine-config_90_deletion.yaml" "output"="Error from server (NotFound): clusterrolebindings.rbac.authorization.k8s.io \"default-account-openshift-machine-config-operator\" not found\nError from server (NotFound): cronjobs.batch \"machine-config-nodes-crd-cleanup\" not found\nError from server (NotFound): services \"machine-config-operator\" not found\nError from server (NotFound): servicemonitors.monitoring.coreos.com \"machine-config-operator\" not found\n" cluster-version-operator-tests "msg"="failed to parse manifest file: /tmp/OTA-42543-manifest/release-metadata" "error"="error parsing: Resource with fields Group: \"\" Kind: \"cincinnati-metadata-v0\" Name: \"\" must contain kubernetes required fields kind and name"

This is the content of related manifest:

--- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRoleBinding metadata: annotations: include.release.openshift.io/ibm-cloud-managed: "true" include.release.openshift.io/self-managed-high-availability: "true" include.release.openshift.io/single-node-developer: "true" release.openshift.io/delete: "true" name: default-account-openshift-machine-config-operator roleRef: apiGroup: rbac.authorization.k8s.io kind: ClusterRole name: cluster-admin subjects: - kind: ServiceAccount name: default namespace: openshift-machine-config-operator --- apiVersion: batch/v1 kind: CronJob metadata: annotations: include.release.openshift.io/self-managed-high-availability: "true" include.release.openshift.io/single-node-developer: "true" release.openshift.io/delete: "true" release.openshift.io/feature-set: Default name: machine-config-nodes-crd-cleanup namespace: openshift-machine-config-operator

It also not found in previous code.

hongkailiu · 2026-02-01T13:46:10Z

test/oc/api/api.go

 type OC interface {
 	AdmReleaseExtract(o ReleaseExtractOptions) error
 	Version(o VersionOptions) (string, error)
+	Run(args ...string) ([]byte, error)


Get(args ...string) (string, error) should be enough for your case.

The method Run() is be for oc run command.

If the idea https://github.com/openshift/cluster-version-operator/pull/1309/changes#r2751255955 works out, I would just do

GetFileExpectNotFoundError(args ...string) (string, error) to avoid abuse of oc get.

Like I said before https://github.com/openshift/cluster-version-operator/pull/1267/changes#r2579341159
If we adding GetFileExpectNotFoundError(), Run to OC client, we have to adding them to the interface as well. This is really not a good practice for using interface.

I really don't like to implement interface for a single instance.

implement interface for a single instance

Do you mean the interface OC has only one struct that implements it at the moment?

We thought we have gone thro it in the last round.

Moreover, the interface tells us how many oc cmds we have to reply on.
Everyone of them is a compromise we made between go-library and os.exec.
It will save the time for us to make a Sheet like David did the last time.

If you want to have more discussions about it, we can certainly do it.
We can always revisit it in the future if it brings us more pains than gains.

My wild guess is that it will settle down after a while.
But now we just started to use it, the methods there might grow.
But I will try my best to keep the things there under control. 🤞
As we discussed, direct calls of an oc cmd is the last resort.

Sure we can have a discussion.
I agree works will be settle down after a while. But this is completely different from how I used to use interfaces.

If you want to convince me of making Run function to run any oc cmd, should your reason be more specific? 🙂

My reasons are given in the previous comment.

hongkailiu · 2026-02-01T14:33:50Z

test/oc/cli/cli.go

+	_, err := os.Stat(o.To)
+	if errors.Is(err, os.ErrNotExist) {
+		c.logger.Info(fmt.Sprintf("the output directory does not exist, will create it: %s", o.To))
+		if err = os.Mkdir(o.To, 0755); err != nil {


Let us make the directory in the case, instead of the function.
The method here just calls oc adm release extract (maybe include some logs for debugging), nothing else.

we can also discuss this.

My reason is to keep this pkg as thin as possible because this (spawning a process to call oc) is the last resort.
Pushing more logic into this is the other way around.

hongkailiu · 2026-02-01T14:55:54Z

test/oc/cli/cli.go

@@ -70,12 +70,13 @@ func NewOCCli(logger logr.Logger) (api.OC, error) {
 	timeout := 30 * time.Second
 	timeoutStr := os.Getenv("OC_CLI_TIMEOUT")


Is 30s too short for oc adm release extract? You want 90s for it?

In that case, let us do another function.

func NewOCCliWithTimeout(logger logr.Logger, timeout time.Duration) (api.OC, error)

and

func NewOCCli(logger logr.Logger) (api.OC, error) { return NewOCCli(logger, 30 * time.Second) (api.OC, error) }

We could remove the logic about OC_CLI_TIMEOUT (i think no one is using it at the moment). I have to admit that I did not understand your request here.

Please do it in another commit. I can do it too if you want.

ok, please go ahead, I will not update my code for now.
BTW, I really do not want to introduce a new function for a parameter.
Today we add NewOCCliWithTimeout for timeout, tomorrow we may add other functions.

Today we add NewOCCliWithTimeout for timeout, tomorrow we may add other functions.

Fair point. I will make an option for it.

There it goes:

#1311

test/cvo/cvo.go

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@test/cvo/cvo.go`:
- Around line 121-160: The deferred close inside the loop captures the loop
variable file by reference and is unnecessary because file.Close() is already
called explicitly later; remove the inline defer that references closeFilePass
and the closeFilePass variable declaration, and rely on the existing explicit
err = file.Close() error handling block (keep the o.Expect checks around
file.Close()); ensure no other code paths expect closeFilePass and that
file.Close() remains invoked for each opened file.

coderabbitai · 2026-02-02T03:08:49Z

test/cvo/cvo.go

+			file, err := os.Open(filePath)
+			o.Expect(err).NotTo(o.HaveOccurred())
+			defer func() {
+				if !closeFilePass {
+					// Close the file again
+					if err = file.Close(); err != nil {
+						o.Expect(err).NotTo(o.HaveOccurred(), "close file failed")
+					}
+				}
+			}()
+			decoder := yamlv3.NewDecoder(file)
+			for {
+				var doc map[string]interface{}
+				if err := decoder.Decode(&doc); err != nil {
+					if err == io.EOF {
+						break
+					}
+					continue
+				}
+				meta, _ := doc["metadata"].(map[string]interface{})
+				ann, _ := meta["annotations"].(map[string]interface{})
+				if ann == nil || ann[annotation] != "true" {
+					continue
+				}
+				kind, _ := doc["kind"].(string)
+				name, _ := meta["name"].(string)
+				namespace, _ := meta["namespace"].(string)
+				args := []string{"get", kind, name}
+				if namespace != "" {
+					args = append(args, "-n", namespace)
+				}
+				_, err := ocClient.Run(args...)
+				o.Expect(err).To(o.HaveOccurred(), "The deleted manifest should not be installed, but actually installed")
+			}
+			// close each file
+			err = file.Close()
+			if err != nil {
+				closeFilePass = false
+				o.Expect(err).NotTo(o.HaveOccurred(), "close file failed")
+			}


⚠️ Potential issue | 🟠 Major

File handle management is overly complex and still has a subtle bug.

The defer at line 123 captures file by reference in a loop—all accumulated defers will operate on the last assigned file when the function exits. The closeFilePass flag adds complexity without fixing this. Since you already call file.Close() explicitly at line 156, remove the inner defer entirely.

🐛 Proposed simplification

filePath := filepath.Join(manifestDir.To, manifest.Name()) file, err := os.Open(filePath) o.Expect(err).NotTo(o.HaveOccurred()) - defer func() { - if !closeFilePass { - // Close the file again - if err = file.Close(); err != nil { - o.Expect(err).NotTo(o.HaveOccurred(), "close file failed") - } - } - }() decoder := yamlv3.NewDecoder(file) for { // ... decode loop unchanged ... } - // close each file - err = file.Close() - if err != nil { - closeFilePass = false - o.Expect(err).NotTo(o.HaveOccurred(), "close file failed") - } + file.Close() }

Also remove the closeFilePass variable declaration at line 114.

🤖 Prompt for AI Agents

In `@test/cvo/cvo.go` around lines 121 - 160, The deferred close inside the loop captures the loop variable file by reference and is unnecessary because file.Close() is already called explicitly later; remove the inline defer that references closeFilePass and the closeFilePass variable declaration, and rely on the existing explicit err = file.Close() error handling block (keep the o.Expect checks around file.Close()); ensure no other code paths expect closeFilePass and that file.Close() remains invoked for each opened file.

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@test/cvo/cvo.go`:
- Around line 136-137: The test currently calls ocClient.Run("get", "-f",
filePath) which queries the whole manifest; instead parse the manifest
referenced by filePath to extract the specific resource's kind, name, and
namespace (or default namespace) and call ocClient.Run("get", <kind>, <name>,
"-n", <namespace>) asserting the returned error is a NotFound (use the same
error check helper used elsewhere or assert the error string contains
"NotFound"); update the assertion around ocClient.Run and replace the file-based
get with the per-resource get so the deleted resource is verified precisely
(refer to ocClient.Run and filePath variables to locate the code to change).

test/cvo/cvo.go

JianLi-RH · 2026-02-02T08:48:16Z

/hold

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@test/oc/cli/cli.go`:
- Around line 92-100: os.Stat call result is only checked for os.ErrNotExist, so
other errors (e.g., permission denied) are ignored; modify the block around
os.Stat(o.To) to handle non-nil errors that are not os.ErrNotExist by returning
or logging the error (include context about o.To) before proceeding to os.Mkdir;
preserve the existing os.ErrNotExist path that uses os.Mkdir and c.logger.Info,
but ensure any other err from os.Stat is propagated (or wrapped) so downstream
code doesn't run on invalid assumptions.

🧹 Nitpick comments (1)

test/cvo/cvo.go (1)
121-129: Simplify error handling and consider stricter parse-error handling.

The if err != nil check at line 122 is redundant since o.Expect() handles nil errors correctly. For parse errors (line 126-129), silently continuing could mask issues with actual manifest files. Consider distinguishing between expected non-manifest files (like release-metadata) and unexpected parse failures.
♻️ Suggested simplification
 			raw, err := os.ReadFile(filePath)
-			if err != nil {
-				o.Expect(err).NotTo(o.HaveOccurred(), "failed to read manifest file")
-			}
+			o.Expect(err).NotTo(o.HaveOccurred(), "failed to read manifest file: %s", filePath)
 			manifests, err := manifest.ParseManifests(bytes.NewReader(raw))
 			if err != nil {
-				// files like release-metadata are not manifest file, so skip them
-				logger.Error(err, "failed to parse manifest file: "+filePath)
+				// Non-manifest files (e.g., release-metadata) are expected to fail parsing
+				logger.Info(fmt.Sprintf("skipping non-manifest file %s: %v", filePath, err))
+				continue
 			}

coderabbitai · 2026-02-02T10:17:42Z

test/oc/cli/cli.go

+	_, err := os.Stat(o.To)
+	if errors.Is(err, os.ErrNotExist) {
+		c.logger.Info(fmt.Sprintf("the output directory does not exist, will create it: %s", o.To))
+		if err = os.Mkdir(o.To, 0755); err != nil {
+			err = fmt.Errorf("failed to create directory: %v", err)
+			return err
+		}
+		c.logger.Info(fmt.Sprintf("the output directory has been created: %s", o.To))
+	}


⚠️ Potential issue | 🟡 Minor

Non-ErrNotExist errors from os.Stat are silently ignored.

If os.Stat fails with an error other than os.ErrNotExist (e.g., permission denied), the code continues execution without reporting it, which could lead to confusing failures downstream.

Proposed fix

func (c *client) AdmReleaseExtract(o api.ReleaseExtractOptions) error { _, err := os.Stat(o.To) - if errors.Is(err, os.ErrNotExist) { + if err != nil && !errors.Is(err, os.ErrNotExist) { + return fmt.Errorf("failed to stat output directory %s: %w", o.To, err) + } else if errors.Is(err, os.ErrNotExist) { c.logger.Info(fmt.Sprintf("the output directory does not exist, will create it: %s", o.To)) if err = os.Mkdir(o.To, 0755); err != nil { - err = fmt.Errorf("failed to create directory: %v", err) + err = fmt.Errorf("failed to create directory: %w", err) return err } c.logger.Info(fmt.Sprintf("the output directory has been created: %s", o.To)) }

🤖 Prompt for AI Agents

In `@test/oc/cli/cli.go` around lines 92 - 100, os.Stat call result is only checked for os.ErrNotExist, so other errors (e.g., permission denied) are ignored; modify the block around os.Stat(o.To) to handle non-nil errors that are not os.ErrNotExist by returning or logging the error (include context about o.To) before proceeding to os.Mkdir; preserve the existing os.ErrNotExist path that uses os.Mkdir and c.logger.Info, but ensure any other err from os.Stat is propagated (or wrapped) so downstream code doesn't run on invalid assumptions.

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@test/util/util.go`:
- Around line 31-44: Update the test call site that uses
GetManifestExpectNotFoundError to explicitly assert the error is a NotFound
error; replace the current loose check (err != nil) with an assertion using
apierrors.IsNotFound(err) (for example:
o.Expect(apierrors.IsNotFound(err)).To(o.BeTrue(), "...")). Ensure the test
imports k8s.io/apimachinery/pkg/api/errors as apierrors and uses
GetManifestExpectNotFoundError(ms) as the source of the error being checked.

test/util/util.go

JianLi-RH · 2026-02-05T04:37:35Z

Test passed on my local machine:

[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$ _output/linux/amd64/cluster-version-operator-tests run-test "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true"
  Running Suite:  - /home/jianl/1_code/cluster-version-operator
  =============================================================
  Random Seed: 1770259064 - will randomize all specs

  Will run 1 of 1 specs
  ------------------------------
  [Jira:"Cluster Version Operator"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true [Conformance, High, 42543]
  /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:94
  "level"=0 "msg"="microshift-version configmap not found"
    STEP: Setting up oc @ 02/05/26 10:37:45.301
    STEP: Extracting manifests in the release @ 02/05/26 10:37:45.301
  cluster-version-operator-tests "level"=0 "msg"="Extract manifests to: /tmp/OTA-42543-manifest-2160632442"
  cluster-version-operator-tests "level"=0 "msg"="Running command succeeded." "cmd"="/usr/local/sbin/oc" "args"="adm release extract --to=/tmp/OTA-42543-manifest-2160632442"
    STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/05/26 10:38:11.733
  cluster-version-operator-tests "msg"="failed to parse manifest file: /tmp/OTA-42543-manifest-2160632442/release-metadata" "error"="error parsing: Resource with fields Group: \"\" Kind: \"cincinnati-metadata-v0\" Name: \"\" must contain kubernetes required fields kind and name"
  • [32.839 seconds]
  ------------------------------

  Ran 1 of 1 Specs in 32.839 seconds
  SUCCESS! -- 1 Passed | 0 Failed | 0 Pending | 0 Skipped
[
  {
    "name": "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true",
    "lifecycle": "blocking",
    "duration": 32839,
    "startTime": "2026-02-05 02:37:44.078718 UTC",
    "endTime": "2026-02-05 02:38:16.918404 UTC",
    "result": "passed",
    "output": "  STEP: Setting up oc @ 02/05/26 10:37:45.301\n  STEP: Extracting manifests in the release @ 02/05/26 10:37:45.301\n  STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/05/26 10:38:11.733\n"
  }
]
[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$

JianLi-RH · 2026-02-05T10:22:00Z

hi @hongkailiu @DavidHurta Please help review this PR.

Here is the test result on my local machine:

[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$ _output/linux/amd64/cluster-version-operator-tests run-test "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true"
  Running Suite:  - /home/jianl/1_code/cluster-version-operator
  =============================================================
  Random Seed: 1770286547 - will randomize all specs

  Will run 1 of 1 specs
  ------------------------------
  [Jira:"Cluster Version Operator"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true [Conformance, High, 42543]
  /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:94
  "level"=0 "msg"="microshift-version configmap not found"
    STEP: Setting up oc @ 02/05/26 18:15:49.091
    STEP: Extracting manifests in the release @ 02/05/26 18:15:49.091
  cluster-version-operator-tests "level"=0 "msg"="Extract manifests to: /tmp/OTA-42543-manifest-1413905999"
  cluster-version-operator-tests "level"=0 "msg"="Running command succeeded." "cmd"="/usr/local/sbin/oc" "args"="adm release extract --to=/tmp/OTA-42543-manifest-1413905999"
    STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/05/26 18:16:54.17
  cluster-version-operator-tests "msg"="failed to parse manifest file: /tmp/OTA-42543-manifest-1413905999/release-metadata" "error"="error parsing: Resource with fields Group: \"\" Kind: \"cincinnati-metadata-v0\" Name: \"\" must contain kubernetes required fields kind and name"
  • [72.927 seconds]
  ------------------------------

  Ran 1 of 1 Specs in 72.927 seconds
  SUCCESS! -- 1 Passed | 0 Failed | 0 Pending | 0 Skipped
[
  {
    "name": "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true",
    "lifecycle": "blocking",
    "duration": 72927,
    "startTime": "2026-02-05 10:15:47.714002 UTC",
    "endTime": "2026-02-05 10:17:00.641615 UTC",
    "result": "passed",
    "output": "  STEP: Setting up oc @ 02/05/26 18:15:49.091\n  STEP: Extracting manifests in the release @ 02/05/26 18:15:49.091\n  STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/05/26 18:16:54.17\n"
  }
]
[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$

hongkailiu

@JianLi-RH

Nice work.
It proves that without changing much code we could implement this purely with go-lib.

While I was reviewing this change, I found a better idea.

We just need to expose an internal function like this: See the package name: dynamicclient.

https://github.com/openshift/cluster-version-operator/pull/1325/changes#diff-09ae066aa346caa4fd838c692cbdf5a6a6ef72bfc8d88cb26dd895b7fb5195d3

And see how simple the pull could go.
https://github.com/openshift/cluster-version-operator/pull/1325/changes

After that, my comments for the review are here.

66c1664

I am sorry that I did not find this earlier and I am new to this area of code.

Do you want to give it a try?

hongkailiu · 2026-02-18T22:50:54Z

/hold

JianLi-RH · 2026-02-25T02:04:49Z

Still work with dynamicclient:

[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$ _output/linux/amd64/cluster-version-operator-tests run-test "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true"
  Running Suite:  - /home/jianl/1_code/cluster-version-operator
  =============================================================
  Random Seed: 1771984989 - will randomize all specs

  Will run 1 of 1 specs
  ------------------------------
  [Jira:"Cluster Version Operator"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true [Conformance, High, 42543]
  /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:95
  "level"=0 "msg"="microshift-version configmap not found"
    STEP: Setting up oc @ 02/25/26 10:03:11.071
    STEP: Extracting manifests in the release @ 02/25/26 10:03:11.071
  cluster-version-operator-tests "level"=0 "msg"="Extract manifests to: /tmp/OTA-42543-manifest-4153762548"
  cluster-version-operator-tests "level"=0 "msg"="Running command succeeded." "cmd"="/usr/local/sbin/oc" "args"="adm release extract --to=/tmp/OTA-42543-manifest-4153762548"
    STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/25/26 10:03:37.051
  • [33.301 seconds]
  ------------------------------

  Ran 1 of 1 Specs in 33.301 seconds
  SUCCESS! -- 1 Passed | 0 Failed | 0 Pending | 0 Skipped
[
  {
    "name": "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true",
    "lifecycle": "blocking",
    "duration": 33301,
    "startTime": "2026-02-25 02:03:09.854563 UTC",
    "endTime": "2026-02-25 02:03:43.156308 UTC",
    "result": "passed",
    "output": "  STEP: Setting up oc @ 02/25/26 10:03:11.071\n  STEP: Extracting manifests in the release @ 02/25/26 10:03:11.071\n  STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/25/26 10:03:37.051\n"
  }
]
[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$

…se.openshift.io/delete=true

openshift-ci · 2026-02-27T00:53:24Z

@JianLi-RH: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/e2e-agnostic-ovn-upgrade-into-change	`4a84e7e`	link	true	`/test e2e-agnostic-ovn-upgrade-into-change`
ci/prow/unit	`4a84e7e`	link	true	`/test unit`
ci/prow/okd-scos-images	`4a84e7e`	link	true	`/test okd-scos-images`
ci/prow/verify-deps	`4a84e7e`	link	true	`/test verify-deps`
ci/prow/e2e-agnostic-ovn-upgrade-out-of-change	`4a84e7e`	link	true	`/test e2e-agnostic-ovn-upgrade-out-of-change`
ci/prow/e2e-agnostic-operator	`4a84e7e`	link	true	`/test e2e-agnostic-operator`
ci/prow/lint	`4a84e7e`	link	true	`/test lint`
ci/prow/e2e-agnostic-ovn	`4a84e7e`	link	true	`/test e2e-agnostic-ovn`
ci/prow/verify-update	`4a84e7e`	link	true	`/test verify-update`
ci/prow/gofmt	`4a84e7e`	link	true	`/test gofmt`
ci/prow/e2e-hypershift	`4a84e7e`	link	true	`/test e2e-hypershift`
ci/prow/e2e-hypershift-conformance	`4a84e7e`	link	true	`/test e2e-hypershift-conformance`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

JianLi-RH · 2026-02-27T01:30:23Z

Test again, it works fine:

[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$ _output/linux/amd64/cluster-version-operator-tests run-test "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true"
  Running Suite:  - /home/jianl/1_code/cluster-version-operator
  =============================================================
  Random Seed: 1772155719 - will randomize all specs

  Will run 1 of 1 specs
  ------------------------------
  [Jira:"Cluster Version Operator"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true [Conformance, High, 42543]
  /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:96
  "level"=0 "msg"="microshift-version configmap not found"
    STEP: Setting up oc @ 02/27/26 09:28:40.188
    STEP: Extracting manifests in the release @ 02/27/26 09:28:40.188
  cluster-version-operator-tests "level"=0 "msg"="Extract manifests to: /tmp/OTA-42543-manifest-3126446433"
  cluster-version-operator-tests "level"=0 "msg"="Running command succeeded." "cmd"="/usr/local/sbin/oc" "args"="adm release extract --to=/tmp/OTA-42543-manifest-3126446433"
    STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/27/26 09:29:16.613
  • [43.036 seconds]
  ------------------------------

  Ran 1 of 1 Specs in 43.036 seconds
  SUCCESS! -- 1 Passed | 0 Failed | 0 Pending | 0 Skipped
[
  {
    "name": "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true",
    "lifecycle": "blocking",
    "duration": 43036,
    "startTime": "2026-02-27 01:28:39.296085 UTC",
    "endTime": "2026-02-27 01:29:22.332900 UTC",
    "result": "passed",
    "output": "  STEP: Setting up oc @ 02/27/26 09:28:40.188\n  STEP: Extracting manifests in the release @ 02/27/26 09:28:40.188\n  STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/27/26 09:29:16.613\n"
  }
]
[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$

JianLi-RH · 2026-02-27T01:36:08Z

The case can fail when a resource with release.openshift.io/delete unexpected installed:

[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$ cat <<EOF | oc apply -f -
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
  annotations:
    include.release.openshift.io/ibm-cloud-managed: "true"
    include.release.openshift.io/self-managed-high-availability: "true"
    include.release.openshift.io/single-node-developer: "true"
  name: default-account-cluster-network-operator
roleRef:
  apiGroup: rbac.authorization.k8s.io
  kind: ClusterRole
  name: cluster-admin
subjects:
- kind: ServiceAccount
  name: default
  namespace: openshift-network-operator
EOF
clusterrolebinding.rbac.authorization.k8s.io/default-account-cluster-network-operator created

Test failed:

[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$ _output/linux/amd64/cluster-version-operator-tests run-test "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true"
  Running Suite:  - /home/jianl/1_code/cluster-version-operator
  =============================================================
  Random Seed: 1772155927 - will randomize all specs

  Will run 1 of 1 specs
  ------------------------------
  [Jira:"Cluster Version Operator"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true [Conformance, High, 42543]
  /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:96
  "level"=0 "msg"="microshift-version configmap not found"
    STEP: Setting up oc @ 02/27/26 09:32:08.467
    STEP: Extracting manifests in the release @ 02/27/26 09:32:08.468
  cluster-version-operator-tests "level"=0 "msg"="Extract manifests to: /tmp/OTA-42543-manifest-373617607"
  cluster-version-operator-tests "level"=0 "msg"="Running command succeeded." "cmd"="/usr/local/sbin/oc" "args"="adm release extract --to=/tmp/OTA-42543-manifest-373617607"
    STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/27/26 09:32:40.324
    [FAILED] in [It] - /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:139 @ 02/27/26 09:32:44.859
  • [FAILED] [37.239 seconds]
  [Jira:"Cluster Version Operator"] cluster-version-operator [It] should not install resources annotated with release.openshift.io/delete=true [Conformance, High, 42543]
  /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:96

    [FAILED] The deleted manifest should not be installed, but actually installed: manifest: rbac.authorization.k8s.io/v1, Kind=ClusterRoleBinding default-account-cluster-network-operator in namespace  from file "0000_70_cluster-network-operator_02_rbac.yaml", error: <nil>
    Expected
        <bool>: false
    to be true
    In [It] at: /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:139 @ 02/27/26 09:32:44.859
  ------------------------------

  Summarizing 1 Failure:
    [FAIL] [Jira:"Cluster Version Operator"] cluster-version-operator [It] should not install resources annotated with release.openshift.io/delete=true [Conformance, High, 42543]
    /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:139

  Ran 1 of 1 Specs in 37.240 seconds
  FAIL! -- 0 Passed | 1 Failed | 0 Pending | 0 Skipped
[
  {
    "name": "[Jira:\"Cluster Version Operator\"] cluster-version-operator should not install resources annotated with release.openshift.io/delete=true",
    "lifecycle": "blocking",
    "duration": 37240,
    "startTime": "2026-02-27 01:32:07.619648 UTC",
    "endTime": "2026-02-27 01:32:44.859683 UTC",
    "result": "failed",
    "output": "  STEP: Setting up oc @ 02/27/26 09:32:08.467\n  STEP: Extracting manifests in the release @ 02/27/26 09:32:08.468\n  STEP: Checking if getting manifests with release.openshift.io/delete on the cluster led to not-found error @ 02/27/26 09:32:40.324\n  [FAILED] in [It] - /home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:139 @ 02/27/26 09:32:44.859\n",
    "error": "fail [/home/jianl/1_code/cluster-version-operator/test/cvo/cvo.go:139]: The deleted manifest should not be installed, but actually installed: manifest: rbac.authorization.k8s.io/v1, Kind=ClusterRoleBinding default-account-cluster-network-operator in namespace  from file \"0000_70_cluster-network-operator_02_rbac.yaml\", error: \u003cnil\u003e\nExpected\n    \u003cbool\u003e: false\nto be true"
  }
]
Error: 1 tests failed
[jianl@jianl-thinkpadt14gen4 cluster-version-operator]$

hongkailiu

Only a couple of import re-ordering.
Otherwise, it looks good to me.

There are some tooling for ordering the imports but we do not use them for CVO.
The general conventions are: Each has its own section.

GoLang build-in packages
3rd party packages, like k8s.io etc
packages from its own repo

hongkailiu · 2026-02-27T02:36:45Z

pkg/cvo/external/dynamicclient/client.go

+package dynamicclient
+
+import (
+	internaldynamicclient "github.com/openshift/cluster-version-operator/pkg/cvo/internal/dynamicclient"


nit:
I know this is from my suggestion.
But let us move the local pkg to the last section of import.

hongkailiu · 2026-02-27T02:38:36Z

test/cvo/cvo.go

 	g "github.com/onsi/ginkgo/v2"
 	o "github.com/onsi/gomega"
-
+	"github.com/openshift-eng/openshift-tests-extension/pkg/util/sets"


Could we use "k8s.io/apimachinery/pkg/util/sets" instead?

hongkailiu · 2026-02-27T02:39:35Z

test/cvo/cvo.go

 	ocapi "github.com/openshift/cluster-version-operator/test/oc/api"
 	"github.com/openshift/cluster-version-operator/test/util"
+	"github.com/openshift/library-go/pkg/manifest"
+	apierrors "k8s.io/apimachinery/pkg/api/errors"


Those 2 should go above.

openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Jan 30, 2026

openshift-ci bot requested review from DavidHurta and hongkailiu January 30, 2026 10:03

coderabbitai bot reviewed Jan 30, 2026

View reviewed changes

test/cvo/cvo.go Outdated Show resolved Hide resolved

test/cvo/cvo.go Outdated Show resolved Hide resolved

JianLi-RH changed the title ~~NO-ISSUE: OTA-1605 Automate OCP-42543~~ WIP NO-ISSUE: OTA-1605 Automate OCP-42543 Jan 30, 2026

openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 30, 2026

JianLi-RH force-pushed the automate_42543 branch 4 times, most recently from 8d09bb9 to f175131 Compare January 30, 2026 10:57

JianLi-RH changed the title ~~WIP NO-ISSUE: OTA-1605 Automate OCP-42543~~ NO-ISSUE: OTA-1605 Automate OCP-42543 Jan 30, 2026

openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 30, 2026

coderabbitai bot reviewed Jan 30, 2026

View reviewed changes

test/cvo/cvo.go Outdated Show resolved Hide resolved

test/cvo/cvo.go Outdated Show resolved Hide resolved

test/cvo/cvo.go Outdated Show resolved Hide resolved

hongkailiu reviewed Feb 1, 2026

View reviewed changes

JianLi-RH force-pushed the automate_42543 branch from f175131 to 788dc98 Compare February 2, 2026 03:04

coderabbitai bot reviewed Feb 2, 2026

View reviewed changes

JianLi-RH force-pushed the automate_42543 branch 2 times, most recently from 159290d to 875c19b Compare February 2, 2026 04:02

coderabbitai bot reviewed Feb 2, 2026

View reviewed changes

test/cvo/cvo.go Outdated Show resolved Hide resolved

hongkailiu mentioned this pull request Feb 2, 2026

NO-JIRA: More flexible timeout option for oc-cli #1311

Merged

openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Feb 2, 2026

JianLi-RH force-pushed the automate_42543 branch from 875c19b to c43b487 Compare February 2, 2026 10:13

coderabbitai bot reviewed Feb 2, 2026

View reviewed changes

JianLi-RH force-pushed the automate_42543 branch from c43b487 to df9c6b0 Compare February 2, 2026 10:35

JianLi-RH force-pushed the automate_42543 branch from 40e229f to a0d7f94 Compare February 5, 2026 02:40

coderabbitai bot reviewed Feb 5, 2026

View reviewed changes

test/util/util.go Outdated Show resolved Hide resolved

JianLi-RH force-pushed the automate_42543 branch 2 times, most recently from ec97a81 to a1fe89e Compare February 5, 2026 10:19

JianLi-RH force-pushed the automate_42543 branch 5 times, most recently from 2a3061a to 93a03c3 Compare February 14, 2026 02:47

hongkailiu reviewed Feb 18, 2026

View reviewed changes

JianLi-RH force-pushed the automate_42543 branch 3 times, most recently from e3db48f to 80fd0c6 Compare February 25, 2026 02:00

JianLi-RH force-pushed the automate_42543 branch 5 times, most recently from 043809b to 4a84e7e Compare February 27, 2026 00:48

openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 27, 2026

Automate OCP-42543: should not install resources annotated with relea…

76015af

…se.openshift.io/delete=true

JianLi-RH force-pushed the automate_42543 branch from 4a84e7e to 76015af Compare February 27, 2026 00:53

openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 27, 2026

hongkailiu reviewed Feb 27, 2026

View reviewed changes

		@@ -70,12 +70,13 @@ func NewOCCli(logger logr.Logger) (api.OC, error) {
		timeout := 30 * time.Second
		timeoutStr := os.Getenv("OC_CLI_TIMEOUT")

Conversation

JianLi-RH commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

openshift-ci-robot commented Jan 30, 2026

Uh oh!

coderabbitai bot commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Uh oh!

openshift-ci bot commented Jan 30, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

JianLi-RH commented Jan 30, 2026

Uh oh!

coderabbitai bot commented Jan 30, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

JianLi-RH commented Jan 30, 2026 •

edited

Loading

coderabbitai bot commented Jan 30, 2026 •

edited

Loading

hongkailiu left a comment •

edited

Loading