feat(storage): integrate ACO tracing into public client and add system tests by chandra-siri · Pull Request #17223 · googleapis/google-cloud-python

chandra-siri · 2026-05-21T15:23:15Z

Description

The final piece of the App-centric Observability (ACO) tracing implementation. Wires up the metadata cache and tracing context managers to the public client interfaces and establishes a 100% sleep-free live system test suite.

Note

This PR is stacked on top of PR 3 (#17222). It should be reviewed/merged after PR 3.

Changes

Updated google/cloud/storage/client.py, bucket.py, blob.py, and transfer_manager.py to wrap critical operations (e.g. uploads, downloads, retrievals, deletes) in active trace spans and update the cache state on api calls/deletes.
Updated tests/unit/test_client.py and test_bucket.py with caching fallback tests.
Created tests/system/test_aco_observability.py with fully-deterministic concurrent system tests executing against a live GCS backend.

This is PR 4 of a 4-part series to break down the ACO tracing compatibility feature into reviewable slices.

…m tests

gemini-code-assist

Code Review

This pull request introduces an in-memory LRU cache for GCS bucket metadata to support App-centric Observability (ACO) by including destination attributes in OpenTelemetry spans. It adds a new BucketMetadataCache and a generic LRUCache implementation, updates the Client, Bucket, and Blob classes to manage cache state during operations, and modifies tracing logic to include cached metadata. Feedback focuses on improving performance and reliability, including suggestions to use a thread pool for background fetches, pre-compile regular expressions for bucket name extraction, fix incorrect log messages, and eliminate redundant URI parsing in download_blob_to_file.

gemini-code-assist · 2026-05-21T15:27:33Z

+                threading.Thread(
+                    target=self._fetch_background, args=(bucket_name,), daemon=True
+                ).start()


Spawning a new threading.Thread for every unique bucket cache miss could lead to excessive thread creation in applications that access many buckets. Consider using a concurrent.futures.ThreadPoolExecutor with a fixed number of workers to manage these background metadata fetches more efficiently.

gemini-code-assist · 2026-05-21T15:27:33Z

+            if name in (
+                "Storage.Client.getBucket",
+                "Storage.Client.lookupBucket",
+                "Storage.Bucket.reload",
+                "Storage.Bucket.exists",
+            ):


The exclusion list for background metadata fetches should include bucket creation and deletion operations. Triggering a background fetch during create is unnecessary as the bucket may not exist yet, and during delete it is redundant as the resource is being removed.

Suggested change

if name in (

"Storage.Client.getBucket",

"Storage.Client.lookupBucket",

"Storage.Bucket.reload",

"Storage.Bucket.exists",

):

if name in (

"Storage.Client.getBucket",

"Storage.Client.lookupBucket",

"Storage.Bucket.reload",

"Storage.Bucket.exists",

"Storage.Bucket.create",

"Storage.Bucket.delete",

):

gemini-code-assist · 2026-05-21T15:27:33Z

+    HAS_OPENTELEMETRY,
+)
+
+logger = logging.getLogger(__name__)


To improve performance in the api_request hot path, pre-compile the bucket name extraction regex at the module level.

Suggested change

logger = logging.getLogger(__name__)

logger = logging.getLogger(__name__)

_BUCKET_NAME_RE = re.compile(r"/b/([^/?#]+)")

gemini-code-assist · 2026-05-21T15:27:33Z

+            and hasattr(client, "_bucket_metadata_cache")
+            and client._bucket_metadata_cache
+        ):
+            match = re.search(r"/b/([^/?#]+)", kwargs.get("path", ""))


Use the pre-compiled regex for bucket name extraction.

Suggested change

match = re.search(r"/b/([^/?#]+)", kwargs.get("path", ""))

match = _BUCKET_NAME_RE.search(kwargs.get("path", ""))

gemini-code-assist · 2026-05-21T15:27:33Z

+                        span_attributes["gcp.resource.destination.id"] = dest_id
+                        span_attributes["gcp.resource.destination.location"] = loc
+                except Exception as e:
+                    logger.debug(f"Failed cache.get_or_queue_fetch in api_request: {e}")


The log message incorrectly references get_or_queue_fetch, but the code actually calls get on line 94. This could be confusing during debugging.

Suggested change

logger.debug(f"Failed cache.get_or_queue_fetch in api_request: {e}")

logger.debug(f"Failed cache.get in api_request: {e}")

gemini-code-assist · 2026-05-21T15:27:34Z

+                    and hasattr(client, "_bucket_metadata_cache")
+                    and client._bucket_metadata_cache
+                ):
+                    match = re.search(r"/b/([^/?#]+)", kwargs.get("path", ""))


Use the pre-compiled regex for bucket name extraction in the error handler.

Suggested change

match = re.search(r"/b/([^/?#]+)", kwargs.get("path", ""))

match = _BUCKET_NAME_RE.search(kwargs.get("path", ""))

gemini-code-assist · 2026-05-21T15:27:34Z

+        bucket_name = None
+        if isinstance(blob_or_uri, Blob):
+            bucket_name = blob_or_uri.bucket.name if blob_or_uri.bucket else None
+        elif isinstance(blob_or_uri, str) and blob_or_uri.startswith("gs://"):
+            try:
+                temp_blob = Blob.from_uri(blob_or_uri)
+                bucket_name = temp_blob.bucket.name
+            except Exception:
+                pass
+
+        with create_trace_span_helper(
+            self,
+            bucket_name,
+            name="Storage.Client.downloadBlobToFile",
+        ):
            if not isinstance(blob_or_uri, Blob):
                blob_or_uri = Blob.from_uri(blob_or_uri)


In download_blob_to_file, Blob.from_uri is called twice when a gs:// URI string is provided. You can optimize this by reusing the blob object created during the bucket name extraction. Additionally, ensure that the exception block logs the error instead of silently passing to aid in debugging.

Suggested change

bucket_name = None

if isinstance(blob_or_uri, Blob):

bucket_name = blob_or_uri.bucket.name if blob_or_uri.bucket else None

elif isinstance(blob_or_uri, str) and blob_or_uri.startswith("gs://"):

try:

temp_blob = Blob.from_uri(blob_or_uri)

bucket_name = temp_blob.bucket.name

except Exception:

pass

with create_trace_span_helper(

self,

bucket_name,

name="Storage.Client.downloadBlobToFile",

):

if not isinstance(blob_or_uri, Blob):

blob_or_uri = Blob.from_uri(blob_or_uri)

bucket_name = None

if isinstance(blob_or_uri, Blob):

bucket_name = blob_or_uri.bucket.name if blob_or_uri.bucket else None

elif isinstance(blob_or_uri, str) and blob_or_uri.startswith("gs://"):

try:

blob_or_uri = Blob.from_uri(blob_or_uri)

bucket_name = blob_or_uri.bucket.name

except Exception as e:

logger.warning(f"Failed to parse URI {blob_or_uri}: {e}")

with create_trace_span_helper(

self,

bucket_name,

name="Storage.Client.downloadBlobToFile",

):

if not isinstance(blob_or_uri, Blob):

blob_or_uri = Blob.from_uri(blob_or_uri)

References

Avoid broad except Exception: blocks that silently return None or pass. Instead, log the exception to aid in debugging and prevent masking underlying issues.

Remove duplicate lines of code to keep the codebase clean and avoid redundancy.

chandra-siri added 4 commits May 21, 2026 15:19

feat(storage): implement bounded LRU cache utility

5bde588

feat(storage): implement domain bucket metadata cache

d745c6f

feat(storage): implement trace span helpers and HTTP integration

d8466f0

feat(storage): integrate ACO tracing into public client and add syste…

33fce2f

…m tests

chandra-siri requested a review from a team as a code owner May 21, 2026 15:23

gemini-code-assist Bot reviewed May 21, 2026

View reviewed changes

parthea marked this pull request as draft May 21, 2026 15:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(storage): integrate ACO tracing into public client and add system tests#17223

feat(storage): integrate ACO tracing into public client and add system tests#17223
chandra-siri wants to merge 4 commits into
googleapis:mainfrom
chandra-siri:feat/gcs-aco-sdk-integration

chandra-siri commented May 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	match = re.search(r"/b/([^/?#]+)", kwargs.get("path", ""))
	match = _BUCKET_NAME_RE.search(kwargs.get("path", ""))

	logger.debug(f"Failed cache.get_or_queue_fetch in api_request: {e}")
	logger.debug(f"Failed cache.get in api_request: {e}")

Conversation

chandra-siri commented May 21, 2026

Description

Changes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant