fix(integrations): google-genai: reworked `gen_ai.request.messages` extraction from parameters #5275

constantinius · 2026-01-05T11:16:46Z

Description

Previously we only extracted only text parts were extracted. Now the full range of possibilities are covered.

Issues

Closes https://linear.app/getsentry/issue/TET-1638/redact-images-google-genai

…xtraction from parameters

linear · 2026-01-05T11:16:49Z

TET-1638 Redact images: Google GenAI

sentry_sdk/integrations/google_genai/utils.py

… if mime_type and file_uri are present (Cursor comment)

sentry_sdk/integrations/google_genai/utils.py

…i-report-image-inputs

github-actions · 2026-01-13T12:44:18Z

Semver Impact of This PR

🟢 Patch (bug fixes)

📋 Changelog Preview

This is how your changes will appear in the changelog.
Entries from this PR are highlighted with a left border (blockquote style).

New Features ✨

feat(ai): add parse_data_uri function to parse a data URI by constantinius in #5311
feat(asyncio): Add on-demand way to enable AsyncioIntegration by sentrivana in #5288
feat(openai-agents): Inject propagation headers for HostedMCPTool by alexander-alderman-webb in #5297
feat: Support array types for logs and metrics attributes by alexander-alderman-webb in #5314

Bug Fixes 🐛

Integrations

fix(integrations): google-genai: reworked gen_ai.request.messages extraction from parameters by constantinius in #5275

fix(integrations): pydantic-ai: properly format binary input message parts to be conformant with the gen_ai.request.messages structure by constantinius in #5251
fix(integrations): Anthropic: add content transformation for images and documents by constantinius in #5276
fix(integrations): langchain add multimodal content transformation functions for images, audio, and files by constantinius in #5278

Litellm

fix(litellm): fix gen_ai.request.messages to be as expected by constantinius in #5255
fix(litellm): Guard against module shadowing by alexander-alderman-webb in #5249

Other

fix(ai): redact message parts content of type blob by constantinius in #5243
fix(clickhouse): Guard against module shadowing by alexander-alderman-webb in #5250
fix(gql): Revert signature change of patched gql.Client.execute by alexander-alderman-webb in #5289
fix(grpc): Derive interception state from channel fields by alexander-alderman-webb in #5302
fix(pure-eval): Guard against module shadowing by alexander-alderman-webb in #5252
fix(ray): Guard against module shadowing by alexander-alderman-webb in #5254
fix(threading): Handle channels shadowing by sentrivana in #5299
fix(typer): Guard against module shadowing by alexander-alderman-webb in #5253
fix: Stop suppressing exception chains in AI integrations by alexander-alderman-webb in #5309
fix: Send client reports for span recorder overflow by sentrivana in #5310

Documentation 📚

docs(metrics): Remove experimental notice by alexander-alderman-webb in #5304
docs: Update Python versions banner in README by sentrivana in #5287

Internal Changes 🔧

Release

ci(release): Bump Craft version to fix issues by BYK in #5305
ci(release): Switch from action-prepare-release to Craft by BYK in #5290

Other

chore(gen_ai): add auto-enablement for google genai by shellmayr in #5295
chore: add unlabeled trigger to changelog-preview by BYK in #5315
chore: Add type for metric units by sentrivana in #5312
ci: Update tox and handle generic classifiers by sentrivana in #5306

_{🤖 This preview updates automatically when you update the PR.}

sentry_sdk/integrations/google_genai/utils.py

…i-report-image-inputs

sentrivana · 2026-01-14T14:40:18Z

sentry_sdk/integrations/google_genai/utils.py

+    if isinstance(function_response, dict):
+        tool_call_id = function_response.get("id")
+        tool_name = function_response.get("name")
+        response_dict = function_response.get("response") or {}
+        # Prefer "output" key if present, otherwise use entire response
+        output = response_dict.get("output", response_dict)
+    else:
+        # FunctionResponse object
+        tool_call_id = getattr(function_response, "id", None)
+        tool_name = getattr(function_response, "name", None)
+        response_obj = getattr(function_response, "response", None) or {}


I've seen this .get() vs getattr pattern a lot in our AI integrations. Feels like introducing a helper function that would try both at once would potentially deduplicate a lot of code.

Not something that needs to be done in this PR, mostly thinking out loud.

sentry_sdk/integrations/google_genai/utils.py

…edacted later

tests/integrations/google_genai/test_google_genai.py

…AI messages Add transform_content_part() and transform_message_content() functions to standardize content part handling across all AI integrations. These functions transform various SDK-specific formats (OpenAI, Anthropic, Google, LangChain) into a unified format: - blob: base64-encoded binary data - uri: URL references (including file URIs) - file: file ID references Also adds get_modality_from_mime_type() helper to infer content modality (image/audio/video/document) from MIME types.

…rmats Replace inline_data and file_data dict handling with the shared transform_content_part function. Keep Google SDK object handling and PIL.Image support local since those are Google-specific.

sentry_sdk/integrations/google_genai/utils.py

Add dedicated transform functions for each AI SDK: - transform_openai_content_part() for OpenAI/LiteLLM image_url format - transform_anthropic_content_part() for Anthropic image/document format - transform_google_content_part() for Google GenAI inline_data/file_data - transform_generic_content_part() for LangChain-style generic format Refactor transform_content_part() to be a heuristic dispatcher that detects the format and delegates to the appropriate specific function. This allows integrations to use the specific function directly for better performance and clarity, while maintaining backward compatibility through the dispatcher for frameworks that can receive any format. Added 38 new unit tests for the SDK-specific functions.

Replace generic transform_content_part with the Google-specific transform_google_content_part function for better performance and clarity since we know Google GenAI uses inline_data and file_data formats.

tests/integrations/google_genai/test_google_genai.py

…nds of inputs

sentry · 2026-01-16T10:20:50Z

sentry_sdk/integrations/google_genai/utils.py

+    if hasattr(part, "file_data") and part.file_data:
+        file_data = part.file_data
+        file_uri = getattr(file_data, "file_uri", None)
+        mime_type = getattr(file_data, "mime_type", None)
+        if file_uri and mime_type:
+            return {
+                "type": "blob",
+                "mime_type": mime_type,
+                "file_uri": file_uri,
+            }


Bug: The function _extract_part_content creates a non-standard format for file_data parts, using {"type": "blob", "file_uri": ...} instead of the expected {"type": "uri", "uri": ...}.
_{Severity: HIGH}

Suggested Fix

Update _extract_part_content to produce the standardized format for file_data. It should return a dictionary with {"type": "uri", "uri": file_data.file_uri, "modality": ..., "mime_type": ...}. This can be achieved by reusing or replicating the logic from transform_google_content_part. The associated tests should also be updated to assert the correct format.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: sentry_sdk/integrations/google_genai/utils.py#L323-L332 Potential issue: When processing a `genai_types.Part` object containing `file_data`, the `_extract_part_content` function generates a dictionary with `{"type": "blob", "file_uri": ...}`. This is inconsistent with the standardized format `{"type": "uri", "uri": ...}` used for dictionary-based parts. This inconsistency causes downstream issues, specifically preventing the `redact_blob_message_parts` function from redacting the URI, as it expects a `content` key for parts of type `blob`. The `modality` field is also missing from the generated dictionary.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

cursor · 2026-01-16T10:28:18Z

sentry_sdk/integrations/google_genai/utils.py

+                "type": "blob",
+                "mime_type": mime_type,
+                "file_uri": file_uri,
+            }


Inconsistent output format between dict and Part object handling

Medium Severity

When handling file_data from Part objects, the code returns {"type": "blob", "file_uri": ...} but when handling the same data in dict format via transform_google_content_part, it returns {"type": "uri", "uri": ...}. Additionally, the Part object handling is missing the modality field that the standardized format includes. This inconsistency affects file_data handling (using "type": "blob" instead of "type": "uri", and "file_uri" instead of "uri"), inline_data handling (missing modality), and File object handling (same issues as file_data).

Additional Locations (2)

sentry_sdk/integrations/google_genai/utils.py#L258-L269

sentry_sdk/integrations/google_genai/utils.py#L339-L345

constantinius added 5 commits December 17, 2025 10:45

fix(ai): redact message parts content of type blob

1f32952

fix(ai): skip non dict messages

795bcea

fix(ai): typing

a623e13

fix(ai): content items may not be dicts

3d3ce5b

fix(integrations): google-genai: reworked gen_ai.request.messages e…

433bc88

…xtraction from parameters

constantinius requested a review from a team as a code owner January 5, 2026 11:16

cursor bot reviewed Jan 5, 2026

View reviewed changes

sentry_sdk/integrations/google_genai/utils.py Outdated Show resolved Hide resolved

sentry_sdk/integrations/google_genai/utils.py Show resolved Hide resolved

sentry_sdk/integrations/google_genai/utils.py Show resolved Hide resolved

fix(integrations): address cursor review comments

4244319

cursor bot reviewed Jan 8, 2026

View reviewed changes

sentry_sdk/integrations/google_genai/utils.py Outdated Show resolved Hide resolved

constantinius added 2 commits January 8, 2026 15:21

fix(integrations): ensure file_data returns valid blob structure only…

f72aa45

… if mime_type and file_uri are present (Cursor comment)

fix(integrations): add type ignore for missing PIL.Image import

2be0419

cursor bot reviewed Jan 8, 2026

View reviewed changes

sentry_sdk/integrations/google_genai/utils.py Outdated Show resolved Hide resolved

Base automatically changed from constantinius/fix/redact-message-parts-type-blob to master January 13, 2026 09:56

Merge branch 'master' into constantinius/fix/integrations/google-gena…

4abdcf8

…i-report-image-inputs

fix: linting issue and review comment

86f6ecb

alexander-alderman-webb reviewed Jan 14, 2026

View reviewed changes

sentry_sdk/integrations/google_genai/utils.py Outdated Show resolved Hide resolved

Merge branch 'master' into constantinius/fix/integrations/google-gena…

7e9335e

…i-report-image-inputs

sentrivana reviewed Jan 14, 2026

View reviewed changes

sentry bot reviewed Jan 14, 2026

View reviewed changes

sentry_sdk/integrations/google_genai/utils.py Outdated Show resolved Hide resolved

constantinius added 2 commits January 14, 2026 15:42

fix(integrations): google-genai do not encode binary data that gets r…

0355c63

…edacted later

fix(integrations): Use explicit None checks instead of or {} pattern

910c679

constantinius requested review from alexander-alderman-webb and sentrivana January 14, 2026 15:52

cursor bot reviewed Jan 14, 2026

View reviewed changes

tests/integrations/google_genai/test_google_genai.py Outdated Show resolved Hide resolved

constantinius added 3 commits January 15, 2026 14:01

Merge shared content transformation functions

e7eb226

refactor(google-genai): Use shared transform_content_part for dict fo…

fc6bbfe

…rmats Replace inline_data and file_data dict handling with the shared transform_content_part function. Keep Google SDK object handling and PIL.Image support local since those are Google-specific.

cursor bot reviewed Jan 15, 2026

View reviewed changes

sentry_sdk/integrations/google_genai/utils.py Show resolved Hide resolved

sentry_sdk/integrations/google_genai/utils.py Show resolved Hide resolved

constantinius added 3 commits January 15, 2026 15:39

Merge SDK-specific transform functions

ff7247b

refactor(google-genai): use transform_google_content_part directly

b9b629e

Replace generic transform_content_part with the Google-specific transform_google_content_part function for better performance and clarity since we know Google GenAI uses inline_data and file_data formats.

alexander-alderman-webb reviewed Jan 16, 2026

View reviewed changes

tests/integrations/google_genai/test_google_genai.py Show resolved Hide resolved

test: added comprehensive tests for direct API access with various ki…

b80f6e9

…nds of inputs

constantinius requested a review from alexander-alderman-webb January 16, 2026 10:17

sentry bot reviewed Jan 16, 2026

View reviewed changes

cursor bot reviewed Jan 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(integrations): google-genai: reworked `gen_ai.request.messages` extraction from parameters #5275

fix(integrations): google-genai: reworked `gen_ai.request.messages` extraction from parameters #5275

constantinius commented Jan 5, 2026

Uh oh!

linear bot commented Jan 5, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 13, 2026 •

edited

Loading

Uh oh!

Uh oh!

sentrivana Jan 14, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sentry bot Jan 16, 2026

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix(integrations): google-genai: reworked gen_ai.request.messages extraction from parameters #5275

Are you sure you want to change the base?

fix(integrations): google-genai: reworked gen_ai.request.messages extraction from parameters #5275

Conversation

constantinius commented Jan 5, 2026

Description

Issues

Uh oh!

linear bot commented Jan 5, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Semver Impact of This PR

📋 Changelog Preview

New Features ✨

Bug Fixes 🐛

Integrations

Litellm

Other

Documentation 📚

Internal Changes 🔧

Release

Other

Uh oh!

Uh oh!

sentrivana Jan 14, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sentry bot Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Jan 16, 2026

Choose a reason for hiding this comment

Inconsistent output format between dict and Part object handling

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix(integrations): google-genai: reworked `gen_ai.request.messages` extraction from parameters #5275

fix(integrations): google-genai: reworked `gen_ai.request.messages` extraction from parameters #5275

github-actions bot commented Jan 13, 2026 •

edited

Loading