UN-3215 [FIX] Add LLMCompat bridge class to fix retriever LLM compatibility with llama-index by pk-zipstack · Pull Request #1788 · Zipstack/unstract

pk-zipstack · 2026-02-12T19:55:05Z

What

Added LLMCompat class in SDK1 that emulates the llama-index LLM interface without requiring llama-index as a dependency
Added RetrieverLLM class in prompt-service that inherits from llama-index's LLM base class and delegates calls to LLMCompat
Updated all affected prompt-service retrievers to use RetrieverLLM when passing LLM to llama-index components
Fixed SubQuestionQueryEngine failing due to missing llama-index-question-gen-openai package

Why

After the SDK1 migration, unstract.sdk1.llm.LLM uses litellm.completion() directly and does not inherit from llama_index.core.llms.llm.LLM
Prompt-service retrievers (KeywordTable, Subquestion, Fusion, Router) pass this LLM to llama-index components that expect the llama-index base class
This causes 'LLM' object has no attribute 'predict' errors at runtime for all retrieval strategies except Simple
Additionally, llama-index's resolve_llm() asserts isinstance(llm, LLM) which fails for plain classes — breaking Fusion, Router, and Subquestion retrievers

How

Two-layer architecture:

LLMCompat (unstract/sdk1/src/unstract/sdk1/llm.py): A plain Python class (no llama-index dependency) that emulates the llama-index LLM interface. Uses local dataclass emulations of llama-index types (ChatMessage, ChatResponse, CompletionResponse, LLMMetadata, MessageRole). Follows the existing EmbeddingCompat initialization pattern — takes raw adapter params and creates an SDK1 LLM internally. Implements chat(), complete(), predict(), achat(), acomplete(), apredict().
RetrieverLLM (prompt-service/.../core/retrievers/retriever_llm.py): Inherits from llama_index.core.llms.llm.LLM (passing resolve_llm() isinstance checks) and delegates all LLM calls to an internal LLMCompat instance. Converts emulated return types back to real llama-index types.

Supporting changes:

llm_helper.py: Helper to convert SDK1 LLM → RetrieverLLM instance
base_retriever.py: Added llama_index_llm property that lazily creates and caches the RetrieverLLM wrapper
Retriever updates: Replaced self.llm with self.llama_index_llm where passed to llama-index components (keyword_table, subquestion, fusion, router)
SubQuestion fix: Explicitly pass LLMQuestionGenerator to SubQuestionQueryEngine.from_defaults() to avoid import of llama-index-question-gen-openai

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

No. LLMCompat and RetrieverLLM are only used when retrievers pass LLM to llama-index components. The Simple, Automerging, and Recursive retrievers don't pass LLM to llama-index and remain unchanged. The bridge delegates all calls to the existing SDK1 LLM, so no behavior changes.

Database Migrations

None

Env Config

None

Relevant Docs

N/A

Related Issues or PRs

[FEAT] Rewrite LLMCompat to emulate llama-index interface without dependency #1793

Dependencies Versions

No new dependencies added. LLMCompat uses emulated types (no llama-index imports in SDK1). RetrieverLLM uses llama-index core classes already present in the prompt-service.

Notes on Testing

Set retrieval strategy to Keyword Table, Subquestion, Fusion, or Router in Prompt Studio with chunk_size > 0 and run extraction — should no longer error with 'LLM' object has no attribute 'predict'
Verify Simple, Automerging, Recursive strategies still work (regression check)

Screenshots

N/A

Checklist

I have read and understood the Contribution Guidelines.

coderabbitai · 2026-02-12T19:55:17Z

Important

Review skipped

Review was skipped due to path filters

⛔ Files ignored due to path filters (1)

backend/uv.lock is excluded by !**/*.lock

CodeRabbit blocks several paths by default. You can override this behavior by explicitly including those paths in the path filters. For example, including **/dist/** will override the default block on the dist directory, by removing the pattern from both the lists.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 84c4f498-cb77-4615-ac5f-6c91b01b9d91

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

Walkthrough

The changes introduce a compatibility layer to adapt the SDK1 LLM interface for use with llama-index retriever components. This includes new emulated llama-index types (MessageRole, ChatMessage, ChatResponse, CompletionResponse, LLMMetadata) and a wrapper class LLMCompat in SDK1, along with a RetrieverLLM adapter in the prompt service that bridges the two interfaces.

Changes

Cohort / File(s)	Summary
SDK1 Compatibility Types `unstract/sdk1/src/unstract/sdk1/llm.py`	Introduces emulated llama-index types (MessageRole, ChatMessage, ChatResponse, CompletionResponse, LLMMetadata) and LLMCompat wrapper class providing sync/async chat and completion interfaces that delegate to the underlying LLM instance.
Retriever LLM Adapter `prompt-service/src/unstract/prompt_service/core/retrievers/retriever_llm.py`	New file containing RetrieverLLM class that adapts an SDK1 LLMCompat instance to conform to llama-index's LLM interface with chat, complete, and async methods; streaming methods raise NotImplementedError.
Base Retriever Integration `prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py`	Adds RetrieverLLM import and introduces static `_get_llm()` method to convert SDK1 LLM instances to RetrieverLLM for llama-index compatibility.
Retriever Implementation Updates `prompt-service/src/unstract/prompt_service/core/retrievers/fusion.py`, `prompt-service/src/unstract/prompt_service/core/retrievers/keyword_table.py`, `prompt-service/src/unstract/prompt_service/core/retrievers/subquestion.py`	Minor updates including comment removal and addition of LLMQuestionGenerator import and integration in SubQuestionQueryEngine initialization.

Sequence Diagram

sequenceDiagram
    participant Client
    participant Retriever as BaseRetriever
    participant Converter as _get_llm()
    participant LLMCompat
    participant RetrieverLLM
    participant LlamaIndex as llama-index<br/>Components

    Client->>Retriever: Initialize with LLM
    Retriever->>Converter: _get_llm(llm)
    Converter->>LLMCompat: Wrap LLM instance
    LLMCompat-->>Converter: Return LLMCompat
    Converter->>RetrieverLLM: Initialize with LLMCompat
    RetrieverLLM-->>Converter: Return RetrieverLLM instance
    Converter-->>Retriever: Return RetrieverLLM
    Retriever->>LlamaIndex: Pass RetrieverLLM to retriever ops
    LlamaIndex->>RetrieverLLM: Call chat/complete methods
    RetrieverLLM->>LLMCompat: Delegate to internal LLMCompat
    LLMCompat->>LLMCompat: Route to chat/complete/predict
    LLMCompat-->>RetrieverLLM: Return ChatResponse/CompletionResponse
    RetrieverLLM-->>LlamaIndex: Return response

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 72.50% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly describes the main change: adding LLMCompat bridge class to fix retriever LLM compatibility with llama-index. It is concise, specific, and directly related to the primary objective of the PR.
Description check	✅ Passed	The description is comprehensive and well-structured, covering all required template sections including What, Why, How, backward compatibility, testing notes, and related issues. All critical information is provided.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/retriever-llm-bridge-class

📝 Coding Plan

Generate coding plan for human review comments

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Tip

You can validate your CodeRabbit configuration file in your editor.

If your editor has YAML language server, you can enable auto-completion and validation by adding # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json at the top of your CodeRabbit configuration file.

for more information, see https://pre-commit.ci

…endency (#1793) * [FEAT] Rewrite LLMCompat to emulate llama-index interface without dependency - Add emulated llama-index types (MessageRole, ChatMessage, ChatResponse, CompletionResponse, LLMMetadata) as dataclasses - Rewrite LLMCompat to match llama-index LLM interface without inheritance - Implement chat(), complete(), achat(), acomplete() for retriever compatibility - Follow EmbeddingCompat initialization pattern (takes adapter params directly) - Remove llama-index imports from llm.py - Raise NotImplementedError for streaming methods not needed by retrievers This allows SubQuestionQueryEngine, QueryFusionRetriever, and other llama-index components to use LLMCompat without requiring llama-index as a dependency. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Added retrieverLLM class to handle issues with retreivers * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: pk-zipstack <praveen@zipstack.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (6)

unstract/sdk1/src/unstract/sdk1/llm.py (4)
36-43: Prefer enum.StrEnum over (str, Enum) dual inheritance.

Python 3.11+ provides StrEnum which is the idiomatic replacement. Since the codebase already uses | union types (Python 3.10+), StrEnum should be available.
♻️ Suggested change
-from enum import Enum
+from enum import StrEnum

-class MessageRole(str, Enum):
+class MessageRole(StrEnum):
     """Emulates llama_index.core.base.llms.types.MessageRole."""
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@unstract/sdk1/src/unstract/sdk1/llm.py` around lines 36 - 43, Replace the
MessageRole class to inherit from enum.StrEnum instead of (str, Enum): import
StrEnum from enum and change the class definition for MessageRole to subclass
StrEnum; keep the same member names and values (SYSTEM, USER, ASSISTANT,
FUNCTION, TOOL) so behavior and comparisons remain identical.
659-676: predict and apredict assume a PromptTemplate-like interface on prompt.

These methods call prompt.format_messages(llm=self, ...) and prompt.format(llm=self, ...), expecting a llama-index PromptTemplate object. The type is annotated as Any, which hides this contract. Since llama-index's LLM.predict receives a BasePromptTemplate, this should work — but if ever called with a plain string, it'll raise AttributeError.

This is acceptable for the bridge use case, but documenting the expected type (even as a comment) would help maintainability.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@unstract/sdk1/src/unstract/sdk1/llm.py` around lines 659 - 676, predict and
apredict assume a PromptTemplate-like object (they call
prompt.format_messages(...) and prompt.format(...)) but are annotated as Any;
update the contract by changing the type annotation from Any to the appropriate
llama-index type (e.g. BasePromptTemplate or PromptTemplate) or at minimum add a
clear inline comment/docstring stating the expected type, and add a simple
runtime check in predict and apredict that raises a descriptive TypeError if the
prompt lacks format/format_messages (e.g. if not hasattr(prompt, "format") and
not hasattr(prompt, "format_messages")), referencing the predict/apredict
methods and self.metadata.is_chat_model, so callers get a clear error instead of
an AttributeError.
574-628: LLMCompat creates a second LLM instance — consider accepting an existing one.

RetrieverLLM.__init__ (in retriever_llm.py, Line 33-43) reads private attributes (llm._adapter_id, llm._adapter_metadata, etc.) from the SDK1 LLM to construct LLMCompat, which then creates another LLM internally (Line 613). This means every retriever bridge creates a duplicate LLM with duplicate adapter validation, platform config fetching, and callback setup.

Consider allowing LLMCompat to accept an existing LLM instance directly rather than always constructing a new one, to avoid this duplication and the reliance on private attributes.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@unstract/sdk1/src/unstract/sdk1/llm.py` around lines 574 - 628, LLMCompat
currently always constructs a new LLM (in LLMCompat.__init__) causing duplicate
LLMs; change the constructor to accept an optional existing LLM instance
parameter (e.g., llm_instance: LLM | None = None) and if provided set
self._llm_instance = llm_instance and skip creating a new LLM, while still
setting self._tool, self._adapter_instance_id, self.model_name =
self._llm_instance.get_model_name(), and self.callback_manager =
self._llm_instance.callback_manager (or None) to preserve behavior; retain the
original parameter list and only construct a new LLM when llm_instance is None
so RetrieverLLM can pass the SDK1 LLM directly and avoid duplicated adapter
validation and config fetching.
804-815: Add comment documenting the duck-typing contract for ChatMessage compatibility.

_to_litellm_messages relies on duck typing to accept both the emulated ChatMessage (defined in this file) and real llama_index.core.base.llms.types.ChatMessage objects, since achat() and acomplete() may receive either. Both provide .role and .content attributes, with the getattr(m.role, "value", ...) pattern handling both enum and string variants.

While the .content attribute is maintained in llama-index as a backward-compatibility property (returning str | None from the underlying block-based structure), a brief comment here clarifying the intentional duck-typing contract would improve maintainability and signal to future maintainers why both types are accepted.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@unstract/sdk1/src/unstract/sdk1/llm.py` around lines 804 - 815, Add a brief
doc-comment above the _to_litellm_messages function explaining the duck-typing
contract: that the function accepts both the emulated ChatMessage defined in
this module and llama_index.core.base.llms.types.ChatMessage objects because
both expose .role and .content, and that getattr(m.role, "value", str(m.role))
handles enum or string roles while m.content may be str | None; mention this is
intentional to support inputs from achat() and acomplete().
prompt-service/src/unstract/prompt_service/core/retrievers/retriever_llm.py (1)
15-15: Remove unused noqa directives flagged by Ruff.

Static analysis (Ruff RUF100) reports 10 unused noqa directives across this file (N811, ANN401). These should be removed to keep the codebase clean.

Also applies to: 33-33, 57-57, 72-72, 80-80, 88-88, 97-97, 112-112, 120-120, 128-128
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@prompt-service/src/unstract/prompt_service/core/retrievers/retriever_llm.py`
at line 15, Remove the redundant noqa directives reported by Ruff (RUF100) in
retriever_llm.py: delete the unused "# noqa: N811" on the import "from
llama_index.core.llms.llm import LLM as LlamaIndexBaseLLM" and likewise remove
the other unused "# noqa" tags on the affected lines (the import/annotation
lines referencing LlamaIndexBaseLLM and any annotated functions/classes flagged
with ANN401). Ensure you only remove the "# noqa" comments (not the imports or
annotations themselves), run Ruff/pytest to verify no new linter errors, and
keep the symbols like LlamaIndexBaseLLM unchanged so references in the file
remain valid.
llm-predict-error-investigation.md (1)
1-221: Debug artifact should not be committed to main.

This file reads as a transient investigation note, not stable documentation. Committing it to main introduces noise with no durable value: it already contains stale content (see below) and will continue diverging from the codebase over time.

If a record of the design decision is needed, consider:

Converting it to an Architecture Decision Record (ADR) in a docs/adr/ folder, keeping only the decision, rationale, and final chosen approach.

Moving the content into the PR description (where it already lives as context) and dropping the file entirely.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@llm-predict-error-investigation.md` around lines 1 - 221, This investigation
file is a debug artifact and should not be committed to main; remove
llm-predict-error-investigation.md from the branch (or revert the commit) and
either convert its essential outcome into a concise ADR under docs/adr/
(summarizing the chosen approach such as the SDK1LLMBridge, use of LiteLLM, or
SimpleKeywordTableIndex and the BaseRetriever/KeywordTableIndex impact) or move
the full investigative content into the PR description/temporary notes, keeping
only the final decision and rationale in the repo.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@llm-predict-error-investigation.md`:
- Around line 150-184: Update the document's Option 1 example to match the
actual two-layer implementation: show LLMCompat (from
unstract/sdk1/src/unstract/sdk1/llm.py) that emulates llama-index types and
RetrieverLLM (from prompt-service/.../core/retrievers/retriever_llm.py) which
inherits from llama_index.core.llms.llm.LLM and delegates to LLMCompat; replace
the single-class SDK1LLMBridge/CustomLLM example with a short description and
class names matching LLMCompat -> RetrieverLLM delegation and note that
RetrieverLLM inherits from llama_index.core.llms.llm.LLM (not CustomLLM).
- Line 5: Add explicit language specifiers to the two fenced code blocks to
satisfy MD040: change the traceback block that starts with "ERROR:
unstract.prompt_service.core.retrievers.keyword_table:" to use ```text and
change the block containing "BaseLLM -> LLM (has predict()) ->
CustomLLM/FunctionCallingLLM -> Provider" to use ```python so markdownlint
recognizes their languages.

In `@prompt-service/src/unstract/prompt_service/core/retrievers/retriever_llm.py`:
- Around line 33-43: Declare _compat as a Pydantic PrivateAttr at the class
level for v2 compatibility: import PrivateAttr from pydantic and add a class
attribute like "_compat: Any = PrivateAttr()" (or similar typed PrivateAttr) on
the retriever class, then keep the existing assignment to self._compat in
__init__ where LLMCompat(...) is created (referencing __init__, LLMCompat, and
the _compat attribute).

In `@unstract/sdk1/src/unstract/sdk1/llm.py`:
- Around line 698-728: LLMCompat.chat() and LLMCompat.complete() call
litellm.completion() directly, skipping the SDK’s usage recording and structured
error wrapping used elsewhere; update these methods to (1) call into the
internal LLM instance’s usage recorder (e.g., invoke
self._llm_instance._record_usage(...) or the same _record_usage flow used by
LLM.complete) after getting the response so usage/audit data is emitted, (2)
wrap litellm.completion() in a try/except and re-raise errors as LLMError to
match callers' expectations, and (3) if metrics capture is required, apply the
same `@capture_metrics` behavior or add a TODO comment documenting intentional
omission; locate changes in the LLMCompat.chat and LLMCompat.complete methods
around the litellm.completion(...) calls and the response handling to implement
these fixes.

---

Nitpick comments:
In `@llm-predict-error-investigation.md`:
- Around line 1-221: This investigation file is a debug artifact and should not
be committed to main; remove llm-predict-error-investigation.md from the branch
(or revert the commit) and either convert its essential outcome into a concise
ADR under docs/adr/ (summarizing the chosen approach such as the SDK1LLMBridge,
use of LiteLLM, or SimpleKeywordTableIndex and the
BaseRetriever/KeywordTableIndex impact) or move the full investigative content
into the PR description/temporary notes, keeping only the final decision and
rationale in the repo.

In `@prompt-service/src/unstract/prompt_service/core/retrievers/retriever_llm.py`:
- Line 15: Remove the redundant noqa directives reported by Ruff (RUF100) in
retriever_llm.py: delete the unused "# noqa: N811" on the import "from
llama_index.core.llms.llm import LLM as LlamaIndexBaseLLM" and likewise remove
the other unused "# noqa" tags on the affected lines (the import/annotation
lines referencing LlamaIndexBaseLLM and any annotated functions/classes flagged
with ANN401). Ensure you only remove the "# noqa" comments (not the imports or
annotations themselves), run Ruff/pytest to verify no new linter errors, and
keep the symbols like LlamaIndexBaseLLM unchanged so references in the file
remain valid.

In `@unstract/sdk1/src/unstract/sdk1/llm.py`:
- Around line 36-43: Replace the MessageRole class to inherit from enum.StrEnum
instead of (str, Enum): import StrEnum from enum and change the class definition
for MessageRole to subclass StrEnum; keep the same member names and values
(SYSTEM, USER, ASSISTANT, FUNCTION, TOOL) so behavior and comparisons remain
identical.
- Around line 659-676: predict and apredict assume a PromptTemplate-like object
(they call prompt.format_messages(...) and prompt.format(...)) but are annotated
as Any; update the contract by changing the type annotation from Any to the
appropriate llama-index type (e.g. BasePromptTemplate or PromptTemplate) or at
minimum add a clear inline comment/docstring stating the expected type, and add
a simple runtime check in predict and apredict that raises a descriptive
TypeError if the prompt lacks format/format_messages (e.g. if not
hasattr(prompt, "format") and not hasattr(prompt, "format_messages")),
referencing the predict/apredict methods and self.metadata.is_chat_model, so
callers get a clear error instead of an AttributeError.
- Around line 574-628: LLMCompat currently always constructs a new LLM (in
LLMCompat.__init__) causing duplicate LLMs; change the constructor to accept an
optional existing LLM instance parameter (e.g., llm_instance: LLM | None = None)
and if provided set self._llm_instance = llm_instance and skip creating a new
LLM, while still setting self._tool, self._adapter_instance_id, self.model_name
= self._llm_instance.get_model_name(), and self.callback_manager =
self._llm_instance.callback_manager (or None) to preserve behavior; retain the
original parameter list and only construct a new LLM when llm_instance is None
so RetrieverLLM can pass the SDK1 LLM directly and avoid duplicated adapter
validation and config fetching.
- Around line 804-815: Add a brief doc-comment above the _to_litellm_messages
function explaining the duck-typing contract: that the function accepts both the
emulated ChatMessage defined in this module and
llama_index.core.base.llms.types.ChatMessage objects because both expose .role
and .content, and that getattr(m.role, "value", str(m.role)) handles enum or
string roles while m.content may be str | None; mention this is intentional to
support inputs from achat() and acomplete().

llm-predict-error-investigation.md

prompt-service/src/unstract/prompt_service/core/retrievers/retriever_llm.py

unstract/sdk1/src/unstract/sdk1/llm.py

llm-predict-error-investigation.md

prompt-service/src/unstract/prompt_service/utils/llm_helper.py

harini-venkataraman

Added some comments. Please address. Otherwise LGTM

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

hari-kuriakose

@pk-zipstack LGTM overall.

Let's address the minor comments though, from both humans and AI.

prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py

prompt-service/src/unstract/prompt_service/core/retrievers/fusion.py

prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py

prompt-service/src/unstract/prompt_service/utils/llm_helper.py

…e_retriever.py Co-authored-by: Hari John Kuriakose <hari@zipstack.com> Signed-off-by: Praveen Kumar <praveen@zipstack.com>

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In
`@prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py`:
- Around line 28-41: The constructor is using an undefined _get_llm and never
initializes the cache causing NameError/AttributeError and potential
double-wrapping of LLMs; fix by (1) replacing the call to _get_llm with the
correct factory (e.g., get_llm) when assigning self.llm (or import/define
_get_llm if that was intended), (2) initializing self._llama_index_llm = None in
__init__, and (3) adjust the llama_index_llm property to avoid double-bridging
by checking types: if self._llama_index_llm is None then if isinstance(self.llm,
RetrieverLLM) set self._llama_index_llm = get_llama_index_llm(self.llm) (or use
the existing RetrieverLLM directly) otherwise convert once with
get_llama_index_llm(self.llm), then return self._llama_index_llm.

ℹ️ Review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Cache: Disabled due to Reviews > Disable Cache setting

Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between 0be9fb1 and 4a6c392.

📒 Files selected for processing (1)

prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py

prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py

- Remove llm_helper.py — move conversion to _get_llm staticmethod in BaseRetriever - Convert SDK1 LLM to RetrieverLLM eagerly in constructor - Remove llama_index_llm lazy property - Revert all retrievers to use self.llm directly Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

for more information, see https://pre-commit.ci

sonarqubecloud · 2026-03-03T10:00:50Z

Quality Gate passed

Issues
2 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

coderabbitai

🧹 Nitpick comments (1)

prompt-service/src/unstract/prompt_service/core/retrievers/subquestion.py (1)

43-50: Fail fast when self.llm is missing before building subquestion components.

Add an explicit guard before constructing question_gen and query engine so error handling is deterministic and immediate.

Suggested patch

             query_bundle = QueryBundle(query_str=self.prompt)

+            if self.llm is None:
+                raise RetrievalError(
+                    "SubquestionRetriever requires an initialized LLM instance."
+                )
+
             question_gen = LLMQuestionGenerator.from_defaults(
                 llm=self.llm,
             )
             query_engine = SubQuestionQueryEngine.from_defaults(
                 query_engine_tools=query_engine_tools,
                 question_gen=question_gen,
                 use_async=True,
                 llm=self.llm,
             )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@prompt-service/src/unstract/prompt_service/core/retrievers/subquestion.py`
around lines 43 - 50, Before calling LLMQuestionGenerator.from_defaults and
SubQuestionQueryEngine.from_defaults, add an explicit guard that checks self.llm
and raises a clear exception (e.g., ValueError) if it is None or falsy so
construction fails fast; update the code path where question_gen and
query_engine are created (LLMQuestionGenerator.from_defaults,
SubQuestionQueryEngine.from_defaults) to perform this check first and
return/raise immediately with a descriptive message referencing self.llm.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@prompt-service/src/unstract/prompt_service/core/retrievers/subquestion.py`:
- Around line 43-50: Before calling LLMQuestionGenerator.from_defaults and
SubQuestionQueryEngine.from_defaults, add an explicit guard that checks self.llm
and raises a clear exception (e.g., ValueError) if it is None or falsy so
construction fails fast; update the code path where question_gen and
query_engine are created (LLMQuestionGenerator.from_defaults,
SubQuestionQueryEngine.from_defaults) to perform this check first and
return/raise immediately with a descriptive message referencing self.llm.

ℹ️ Review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Cache: Disabled due to Reviews > Disable Cache setting

Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between 4a6c392 and a1040c1.

📒 Files selected for processing (4)

prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py
prompt-service/src/unstract/prompt_service/core/retrievers/fusion.py
prompt-service/src/unstract/prompt_service/core/retrievers/keyword_table.py
prompt-service/src/unstract/prompt_service/core/retrievers/subquestion.py

✅ Files skipped from review due to trivial changes (2)

prompt-service/src/unstract/prompt_service/core/retrievers/keyword_table.py
prompt-service/src/unstract/prompt_service/core/retrievers/fusion.py

hari-kuriakose · 2026-03-13T03:20:12Z

@pk-zipstack Let's address the following:

Accessing private attributes of LLM in RetrieverLLM

retriever_llm.py:43-49 directly accesses llm._adapter_id, llm._adapter_metadata, llm._adapter_instance_id, llm._tool, llm._usage_kwargs, llm._capture_metrics. This tightly couples RetrieverLLM to the internal implementation of LLM. If LLM's private attributes are renamed or restructured, RetrieverLLM will silently break.

Add public accessors on LLM (e.g. @property adapter_id) or a factory method like LMCompat.from_llm(llm) that encapsulates the extraction.

`litellm.drop_params = True` is set as a global side effect

In LLMCompat.chat(), .complete(), .achat(), .acomplete(), litellm.drop_params is set to True as a module-level global on every call. This affects all litellm usage in the process, not just the compat layer.

If this is required, it should either be set once at module load, or the existing LLM class's approach to this setting should be reused.

No unit tests

There is no test coverage.

We will require unit tests at the level of RetrieverLLM and LLMCompat layers.

hari-kuriakose

@pk-zipstack LGTM overall.

Please check the comments though.

- Add LLMCompat.from_llm() classmethod to encapsulate access to LLM's private attributes within SDK1, avoiding cross-package coupling - Update RetrieverLLM to use the factory method instead of accessing LLM._adapter_id, _adapter_metadata, etc. directly - Add 20 unit tests covering: LLMCompat.from_llm(), RetrieverLLM isinstance checks, chat/complete/achat/acomplete delegation, type conversion, emulated types, and _to_litellm_messages - Add prompt-service tox environment for running tests - Add pytest-asyncio and pytest-md-report to test dependencies Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

greptile-apps · 2026-03-13T06:28:30Z

Greptile Summary

This PR introduces a two-layer bridge architecture (LLMCompat in SDK1 + RetrieverLLM in prompt-service) to fix runtime AttributeErrors caused by SDK1's LLM no longer inheriting from llama-index's LLM base class after the SDK1 migration. It also fixes the SubQuestionQueryEngine dependency on the optional llama-index-question-gen-openai package. The fix is well-scoped: Simple, Automerging, and Recursive retrievers are untouched, and the bridge only activates when LLM-consuming retrievers are used.

Key changes:

LLMCompat emulates the llama-index LLM interface using plain dataclasses, avoiding a llama-index dependency in SDK1. It correctly flattens multi-turn message lists (including system prompts) via _messages_to_prompt and delegates all calls to the inner LLM instance.
RetrieverLLM inherits from llama-index's LLM base class (satisfying resolve_llm() isinstance checks) and properly declares _compat as a Pydantic PrivateAttr.
BaseRetriever gains a lazy llm property (avoiding unnecessary adapter init for non-LLM retrievers) and a require_llm() guard to fail fast with a clear error rather than silently falling back to OpenAI.
litellm.drop_params = True is correctly moved from per-call mutation to module-level initialization, preventing race conditions in concurrent environments.
pytest-asyncio is added to both SDK1 and prompt-service test deps, and a new tox environment runs the prompt-service unit tests in isolation.

Issues found:

LLMCompat.__init__() calls CallbackManager.set_callback(model=self, ...) after the inner LLM has already registered its own callback. Since all LiteLLM calls route through self._llm_instance, the wrapper's callback registration may be redundant and could cause double-counting for non-public adapters.
formatted parameter is accepted by complete() and acomplete() but silently dropped before delegating to _llm_instance.complete().
test_retriever_llm.py lacks coverage for the require_llm() error path (raising ValueError when _llm is None).

Confidence Score: 3/5

The core bridge logic is sound and fixes real runtime failures, but a potential double-callback registration in LLMCompat.__init__() should be verified before merging.
The two-layer bridge architecture correctly solves the llama-index isinstance compatibility problem, the PrivateAttr and require_llm() concerns from prior review rounds have been addressed, and litellm.drop_params is properly moved to module level. However, the redundant CallbackManager.set_callback() call in LLMCompat.__init__() (registering a callback for the wrapper whose LiteLLM calls are never made through it) is a real concern for usage auditing in non-public adapter deployments. Additionally, the formatted parameter drop and the missing require_llm() test coverage are minor but worth addressing before the code pattern proliferates.
unstract/sdk1/src/unstract/sdk1/llm.py — specifically the LLMCompat.__init__() CallbackManager registration block around lines 693–710.

Important Files Changed

Filename	Overview
unstract/sdk1/src/unstract/sdk1/llm.py	Adds `LLMCompat` class with emulated llama-index types; moves `litellm.drop_params = True` to module level (good fix). Notable concerns: double `CallbackManager` registration in `__init__()`, `formatted` parameter silently dropped in `complete()`/`acomplete()`, and `_messages_to_prompt` flattening all messages into a single string (addressed from prior thread).
prompt-service/src/unstract/prompt_service/core/retrievers/retriever_llm.py	New `RetrieverLLM` bridge class correctly uses `PrivateAttr` for Pydantic compatibility (prior thread concern addressed), inherits from llama-index `LLM` base class, and delegates all calls to an internal `LLMCompat`. Stream methods raise `NotImplementedError` as expected.
prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py	Adds lazy `llm` property and `require_llm()` guard method addressing prior thread concerns about eager construction and silent OpenAI fallback. Clean implementation.
prompt-service/src/unstract/prompt_service/core/retrievers/subquestion.py	Correctly uses `require_llm()` guard, explicitly passes `LLMQuestionGenerator` to avoid `llama-index-question-gen-openai` import, and passes `RetrieverLLM` to both `question_gen` and the query engine synthesizer.
unstract/sdk1/tests/test_llm_compat.py	Comprehensive test suite for `LLMCompat` with `pytest-asyncio` added to deps and `_system_prompt` set on the mock (prior thread concern addressed). Covers delegation, emulated types, `_messages_to_prompt`, and streaming stubs.
prompt-service/src/unstract/prompt_service/tests/unit/test_retriever_llm.py	Good coverage of `RetrieverLLM` delegation and `BaseRetriever.llm` lazy property. Missing test coverage for `require_llm()` error path (ValueError when LLM is None).
tox.ini	Adds `prompt-service` tox environment with `--noconftest` to isolate unit tests from Flask-dependent conftest. The `conftest.py` now documents this limitation with a warning comment (prior thread concern addressed).

Sequence Diagram

sequenceDiagram
    participant LI as llama-index Component<br/>(KeywordTable/Fusion/Router/SubQuestion)
    participant RL as RetrieverLLM<br/>(llama-index LLM subclass)
    participant LC as LLMCompat<br/>(plain Python class)
    participant SDK as SDK1 LLM<br/>(litellm wrapper)
    participant LiteLLM as litellm

    Note over LI,RL: RetrieverLLM passes isinstance(llm, LLM) check
    LI->>RL: chat(messages) / complete(prompt)
    RL->>LC: _compat.chat(messages) / _compat.complete(prompt)
    Note over LC: _messages_to_prompt() flattens<br/>all messages with role prefixes
    LC->>SDK: _llm_instance.complete(prompt, **kwargs)
    SDK->>LiteLLM: litellm.completion(messages, **kwargs)
    LiteLLM-->>SDK: raw response
    SDK-->>LC: {"response": LLMResponseCompat}
    LC-->>RL: emulated ChatResponse / CompletionResponse
    Note over RL: converts emulated → real llama-index types
    RL-->>LI: real llama-index ChatResponse / CompletionResponse

Prompt To Fix All With AI

This is a comment left during a code review.
Path: unstract/sdk1/src/unstract/sdk1/llm.py
Line: 693-710

Comment:
**Double `CallbackManager` registration for non-public adapters**

`LLMCompat.__init__()` creates an inner `LLM` instance (which already calls `CallbackManager.set_callback(model=inner_llm, ...)` inside `LLM.__init__`) and then immediately calls `CallbackManager.set_callback(model=self, ...)` again for the `LLMCompat` wrapper. Since every actual LiteLLM call goes through `self._llm_instance.complete()` (not through `LLMCompat` methods directly), the callback registered for the `LLMCompat` wrapper in step 2 will never be triggered by a LiteLLM invocation.

If `CallbackManager.set_callback` appends to a global litellm callback list (rather than being model-scoped), the callback for the inner `LLM` fires once per call while the callback for `LLMCompat` is never invoked but still occupies a slot. More critically, if the callback list is global and accumulates entries across repeated `LLMCompat.__init__()` calls, it creates a memory/reporting drift.

Note that `from_llm()` (the preferred construction path for all retriever use-cases) has the same pattern and the same concern. Consider removing the `CallbackManager.set_callback` call from `LLMCompat.__init__()` entirely, since the inner `LLM` already registers its own callback, or add a comment explaining why a second registration for the wrapper is intentional.

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: prompt-service/src/unstract/prompt_service/tests/unit/test_retriever_llm.py
Line: 113-214

Comment:
**Missing test coverage for `require_llm()` error path**

`TestBaseRetrieverLlmProperty` covers the `llm` lazy property thoroughly but has no test for `BaseRetriever.require_llm()`. Specifically, the error path — when `_llm is None` — is untested. Given that `require_llm()` is the primary guard against silent OpenAI fallbacks in `fusion.py`, `keyword_table.py`, and `subquestion.py`, it warrants at least two test cases:

```python
def test_require_llm_raises_when_no_llm(self, base_retriever_cls):
    """require_llm() should raise ValueError when no LLM was provided."""
    retriever = base_retriever_cls(
        vector_db=MagicMock(), prompt="test", doc_id="doc-1", top_k=5
    )
    with pytest.raises(ValueError, match="requires an LLM"):
        retriever.require_llm()

def test_require_llm_returns_retriever_llm_when_llm_provided(
    self, base_retriever_cls, mock_sdk1_llm
):
    """require_llm() should return the RetrieverLLM when an LLM is set."""
    with patch.object(LLMCompat, "from_llm", return_value=MagicMock()):
        retriever = base_retriever_cls(
            vector_db=MagicMock(), prompt="test", doc_id="doc-1",
            top_k=5, llm=mock_sdk1_llm,
        )
        result = retriever.require_llm()
        assert isinstance(result, RetrieverLLM)
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: unstract/sdk1/src/unstract/sdk1/llm.py
Line: 735-743

Comment:
**`formatted` parameter silently dropped in `complete()` and `acomplete()`**

Both `complete()` and `acomplete()` accept a `formatted: bool = False` parameter (matching the llama-index interface), but neither forwards it to `self._llm_instance.complete()` / `self._llm_instance.acomplete()`:

```python
def complete(self, prompt: str, formatted: bool = False, **kwargs: Any) -> CompletionResponse:
    result = self._llm_instance.complete(prompt, **kwargs)  # formatted is not forwarded
```

llama-index passes `formatted=True` when the prompt has already been pre-formatted for the model (e.g., for chat-optimized models). If `LLM.complete()` supports this parameter, silently dropping it could cause double-formatting in future. If `LLM.complete()` does not accept `formatted` at all, at minimum add a comment clarifying the intentional drop to avoid future confusion.

The same applies to `acomplete()` at the async equivalent below.

How can I resolve this? If you propose a fix, please make it concise.

_{Last reviewed commit: "Add require_llm() gu..."}

prompt-service/src/unstract/prompt_service/core/retrievers/retriever_llm.py

unstract/sdk1/src/unstract/sdk1/llm.py

…mpt-service - LLMCompat.from_llm(), emulated types, and _to_litellm_messages tests belong in sdk1/tests since those classes live in SDK1 - RetrieverLLM tests stay in prompt-service since that class lives there - Add type annotations to SDK1 tests to match existing test conventions Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

for more information, see https://pre-commit.ci

unstract/sdk1/src/unstract/sdk1/llm.py

tox.ini

unstract/sdk1/src/unstract/sdk1/llm.py

- Declare _compat as Pydantic PrivateAttr in RetrieverLLM for proper v2 lifecycle support (model_copy, serialization) - Forward **kwargs to litellm in LLMCompat chat/complete/achat/acomplete and pop cost_model after adapter.validate() to match LLM.complete() - Forward system_prompt in LLMCompat.from_llm() factory method - Remove dead predict()/apredict() from LLMCompat — RetrieverLLM inherits these from LlamaIndexBaseLLM directly - Add NOTE+TODO documenting missing usage audit for retriever LLM calls - Document --noconftest usage in tox.ini and unit conftest.py Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

for more information, see https://pre-commit.ci

unstract/sdk1/tests/test_llm_compat.py

unstract/sdk1/src/unstract/sdk1/llm.py

…e level - Add _system_prompt to mock fixture and assertion to match from_llm() - Move litellm.drop_params = True to module-level init instead of repeating it per-call in chat/complete/achat/acomplete Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py

Simple, Automerging, and Recursive retrievers never use the LLM for llama-index components. Lazy construction via a property avoids the cost of adapter init and CallbackManager setup for those paths. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

unstract/sdk1/src/unstract/sdk1/llm.py

prompt-service/src/unstract/prompt_service/core/retrievers/retriever_llm.py

…ating from_llm() now bypasses __init__ and stores the caller's LLM directly, avoiding redundant adapter validation and PlatformHelper calls. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

LLMCompat is a bridge layer — it should not invoke litellm directly. All chat/complete/achat/acomplete methods now delegate to LLM.complete() and LLM.acomplete(), which handle litellm invocation, error wrapping, and usage auditing in one place. - Replace litellm.completion() calls with self._llm_instance.complete() - Replace litellm.acompletion() calls with self._llm_instance.acomplete() - Replace _to_litellm_messages() and _get_completion_kwargs() with _messages_to_prompt() which extracts the last user message - Update tests: replace _to_litellm_messages tests with _messages_to_prompt tests covering edge cases Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

SDK1 tests (test_llm_compat.py): - Add TestLLMCompatDelegation verifying chat/complete/achat/acomplete delegate to LLM.complete()/acomplete() instead of calling litellm - Test kwargs forwarding, return type wrapping, metadata, and get_model_name/get_metrics/test_connection delegation - Test all four streaming NotImplementedError paths Prompt-service tests (test_retriever_llm.py): - Add TestBaseRetrieverLlmProperty verifying lazy construction: returns None without LLM, returns RetrieverLLM with LLM, defers creation until first access, caches across accesses, calls from_llm() exactly once, preserves raw _llm reference - Stub VectorDB via sys.modules to avoid triggering the full adapter registration chain during unit tests Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

for more information, see https://pre-commit.ci

unstract/sdk1/src/unstract/sdk1/llm.py

- Clarify that LLMCompat is a plain class (no llama-index inheritance), unlike EmbeddingCompat which inherits BaseEmbedding - Note that from_llm() is preferred when an LLM instance already exists - Document single-turn assumption in _messages_to_prompt() Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

unstract/sdk1/tests/test_llm_compat.py

…o dep _messages_to_prompt now concatenates all messages with role prefixes instead of extracting only the last user message. This preserves task-specific system instructions from llama-index components like LLMQuestionGenerator ("You are an expert Q&A system...") that were previously silently dropped. Also add pytest-asyncio to sdk1 test dependencies — it was only installed transitively, so async tests could silently pass without executing if the transitive path changed. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

for more information, see https://pre-commit.ci

prompt-service/src/unstract/prompt_service/core/retrievers/subquestion.py

Subquestion, Fusion, and KeywordTable retrievers pass self.llm to llama-index components. When no LLM is configured, self.llm returns None and llama-index silently falls back to its default OpenAI LLM, producing a confusing API key error. Add BaseRetriever.require_llm() that fails early with a clear message, and use it in all three retrievers. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-03-18T07:47:20Z

Test Results

Summary

✅ Runner Tests: 11 passed, 0 failed (11 total)
✅ SDK1 Tests: 92 passed, 0 failed (92 total)

Runner Tests - Full Report

filepath	function	$$\textcolor{#23d18b}{\tt{passed}}$$	SUBTOTAL
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_logs}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup\_skip}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_client\_init}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_exists}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config\_without\_mount}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_run\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_for\_sidecar}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_sidecar\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{TOTAL}}$$		$$\textcolor{#23d18b}{\tt{11}}$$	$$\textcolor{#23d18b}{\tt{11}}$$

SDK1 Tests - Full Report

sonarqubecloud · 2026-03-18T07:47:27Z

Quality Gate passed

Issues
3 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

unstract/sdk1/src/unstract/sdk1/llm.py

pk-zipstack added 2 commits February 13, 2026 01:01

Added bridge class for llms to fix retrieval issue

0acab2b

Fixed import error with sub-question retrieval

5fc3ef8

pk-zipstack requested review from gaya3-zipstack, hari-kuriakose and harini-venkataraman February 12, 2026 19:55

pk-zipstack self-assigned this Feb 12, 2026

pre-commit-ci bot and others added 3 commits February 12, 2026 19:55

[pre-commit.ci] auto fixes from pre-commit.com hooks

dc57c53

for more information, see https://pre-commit.ci

Merge branch 'main' into fix/retriever-llm-bridge-class

0be9fb1

pk-zipstack marked this pull request as ready for review February 19, 2026 04:28

coderabbitai bot reviewed Feb 19, 2026

View reviewed changes

pk-zipstack changed the title ~~[FIX] Add LLMCompat bridge class to fix retriever LLM compatibility with llama-index~~ UN-3215 [FIX] Add LLMCompat bridge class to fix retriever LLM compatibility with llama-index Mar 3, 2026

harini-venkataraman reviewed Mar 3, 2026

View reviewed changes

llm-predict-error-investigation.md Outdated Show resolved Hide resolved

harini-venkataraman reviewed Mar 3, 2026

View reviewed changes

prompt-service/src/unstract/prompt_service/utils/llm_helper.py Outdated Show resolved Hide resolved

harini-venkataraman approved these changes Mar 3, 2026

View reviewed changes

Remove investigation notes file from branch

922a8a1

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

hari-kuriakose requested changes Mar 3, 2026

View reviewed changes

Update prompt-service/src/unstract/prompt_service/core/retrievers/bas…

4a6c392

…e_retriever.py Co-authored-by: Hari John Kuriakose <hari@zipstack.com> Signed-off-by: Praveen Kumar <praveen@zipstack.com>

coderabbitai bot reviewed Mar 3, 2026

View reviewed changes

prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py Outdated Show resolved Hide resolved

pk-zipstack and others added 2 commits March 3, 2026 15:28

[pre-commit.ci] auto fixes from pre-commit.com hooks

a1040c1

for more information, see https://pre-commit.ci

coderabbitai bot reviewed Mar 3, 2026

View reviewed changes

Merge branch 'main' into fix/retriever-llm-bridge-class

e3e91fe

pk-zipstack requested a review from hari-kuriakose March 3, 2026 10:13

Merge branch 'main' into fix/retriever-llm-bridge-class

2b3c3ff

hari-kuriakose requested changes Mar 13, 2026

View reviewed changes

greptile-apps bot reviewed Mar 13, 2026

View reviewed changes

pk-zipstack and others added 2 commits March 13, 2026 12:07

[pre-commit.ci] auto fixes from pre-commit.com hooks

cbec777

for more information, see https://pre-commit.ci

greptile-apps bot reviewed Mar 13, 2026

View reviewed changes

unstract/sdk1/src/unstract/sdk1/llm.py Outdated Show resolved Hide resolved

tox.ini Show resolved Hide resolved

unstract/sdk1/src/unstract/sdk1/llm.py Outdated Show resolved Hide resolved

pk-zipstack and others added 2 commits March 13, 2026 14:01

[pre-commit.ci] auto fixes from pre-commit.com hooks

3273408

for more information, see https://pre-commit.ci

greptile-apps bot reviewed Mar 13, 2026

View reviewed changes

unstract/sdk1/tests/test_llm_compat.py Outdated Show resolved Hide resolved

unstract/sdk1/src/unstract/sdk1/llm.py Outdated Show resolved Hide resolved

greptile-apps bot reviewed Mar 13, 2026

View reviewed changes

prompt-service/src/unstract/prompt_service/core/retrievers/base_retriever.py Outdated Show resolved Hide resolved

greptile-apps bot reviewed Mar 13, 2026

View reviewed changes

unstract/sdk1/src/unstract/sdk1/llm.py Outdated Show resolved Hide resolved

prompt-service/src/unstract/prompt_service/core/retrievers/retriever_llm.py Show resolved Hide resolved

pk-zipstack and others added 6 commits March 13, 2026 14:41

Merge branch 'main' into fix/retriever-llm-bridge-class

0b9c5e5

Reuse existing LLM instance in LLMCompat.from_llm() instead of re-cre…

d55f8e4

…ating from_llm() now bypasses __init__ and stores the caller's LLM directly, avoiding redundant adapter validation and PlatformHelper calls. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Use drop litellm params at the module level

787eb0d

[pre-commit.ci] auto fixes from pre-commit.com hooks

4d68900

for more information, see https://pre-commit.ci

greptile-apps bot reviewed Mar 18, 2026

View reviewed changes

unstract/sdk1/src/unstract/sdk1/llm.py Outdated Show resolved Hide resolved

greptile-apps bot reviewed Mar 18, 2026

View reviewed changes

unstract/sdk1/tests/test_llm_compat.py Show resolved Hide resolved

pk-zipstack and others added 4 commits March 18, 2026 12:37

Commit uv.lock changes

b77c707

Merge branch 'main' into fix/retriever-llm-bridge-class

aa2e5ee

[pre-commit.ci] auto fixes from pre-commit.com hooks

08564f1

for more information, see https://pre-commit.ci

greptile-apps bot reviewed Mar 18, 2026

View reviewed changes

prompt-service/src/unstract/prompt_service/core/retrievers/subquestion.py Show resolved Hide resolved

greptile-apps bot reviewed Mar 18, 2026

View reviewed changes

unstract/sdk1/src/unstract/sdk1/llm.py Show resolved Hide resolved

unstract/sdk1/src/unstract/sdk1/llm.py Show resolved Hide resolved

Conversation

pk-zipstack commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

How

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

Database Migrations

Env Config

Relevant Docs

Related Issues or PRs

Dependencies Versions

Notes on Testing

Screenshots

Checklist

Uh oh!

coderabbitai bot commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

harini-venkataraman left a comment

Choose a reason for hiding this comment

Uh oh!

hari-kuriakose left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sonarqubecloud bot commented Mar 3, 2026

Quality Gate passed

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

hari-kuriakose commented Mar 13, 2026

Accessing private attributes of LLM in RetrieverLLM

litellm.drop_params = True is set as a global side effect

No unit tests

Uh oh!

hari-kuriakose left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pk-zipstack commented Feb 12, 2026 •

edited

Loading

coderabbitai bot commented Feb 12, 2026 •

edited

Loading

`litellm.drop_params = True` is set as a global side effect

greptile-apps bot commented Mar 13, 2026 •

edited

Loading