feat(bigframes): Defer unnamed @udf deployment until needed by TrevorBergeron · Pull Request #17217 · googleapis/google-cloud-python

TrevorBergeron · 2026-05-20T22:41:50Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes #<issue_number_goes_here> 🦕

gemini-code-assist

Code Review

This pull request implements deferred deployment for unnamed User Defined Functions (UDFs) in BigFrames. Instead of provisioning UDFs immediately during registration, they are now represented as PythonUdf definitions and deployed only when the execution plan is prepared for BigQuery execution. The changes include adding tracking for deployed routines in the function session, a new plan-rewriting step in the caching executor to handle on-demand deployment, and updated data structures for UDF requirements. Feedback suggests parallelizing these deployments using asyncio.gather to improve performance when multiple UDFs are present in a single plan.

gemini-code-assist · 2026-05-20T22:44:28Z

+        for udf in unique_undeployed_udfs:
+            deployed_udf = await asyncio.to_thread(
+                session._function_session.deploy_undeployed_udf,
+                session,
+                udf,
+            )
+            deployed_mapping[udf] = deployed_udf


UDFs are currently deployed sequentially. Since each deployment involves network calls to BigQuery and resource provisioning, this can significantly delay query execution when multiple UDFs are used in a single plan. Parallelizing these deployments using asyncio.gather would improve performance.

# Deploy UDFs in parallel to improve performance tasks = [ asyncio.to_thread( session._function_session.deploy_undeployed_udf, session, udf, ) for udf in unique_undeployed_udfs ] results = await asyncio.gather(*tasks) deployed_mapping = dict(zip(unique_undeployed_udfs, results))

feat(bigframes): Defer unnamed @udf deployment until needed

a58279a

gemini-code-assist Bot reviewed May 20, 2026

View reviewed changes

fixes

50370dc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(bigframes): Defer unnamed @udf deployment until needed#17217

feat(bigframes): Defer unnamed @udf deployment until needed#17217
TrevorBergeron wants to merge 2 commits into
mainfrom
tbergeron_defer_udf_create

TrevorBergeron commented May 20, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

TrevorBergeron commented May 20, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant