T6 M0: Technical plan + analysis notebook for multi-objective vector … by carlosrod723 · Pull Request #61 · AgentOpt/OpenTrace

carlosrod723 · 2026-02-09T20:15:50Z

M0 delivery for T6 Multi-Objective Vector Scores.

Deliverables:

docs/T6_technical_plan.md — Refined tech plan with API signatures, edge cases, test plan
examples/notebooks/t6_m0_analysis.ipynb — Colab notebook (no API keys needed)

Notebook demonstrates current baseline behavior and a working prototype of weighted vs Pareto selection with deterministic tie-break validation.

…scores

…M required)

…, evaluate_vector, BasicSearch integration, 59 tests

…k, add weight-sensitivity demo

chinganc · 2026-02-13T19:48:27Z

docs/T6_technical_plan.md

+        """
+        score, _ = self.get_feedback(query, response, reference, **kwargs)
+        if isinstance(score, dict):
+            return float(np.mean(list(score.values())))


We should leave this behavior to be configurable from the Objective side.
It should not be hard coded here.

Also why do we need this method from the Guide to begin with? I guess the question is whether we would require passing objective into Guide?
Or asked differently, should the Guide be the one who creates the Objective and sends them around? @allenanie what do you think?

chinganc · 2026-02-13T19:49:42Z

docs/T6_technical_plan.md

+    """
+    ...
+
+def aggregate_vector_scores(scores: list) -> Union[float, Dict[str, float]]:


As above, the logic should be implemented by Objective.

chinganc · 2026-02-13T19:52:25Z

docs/T6_technical_plan.md

+Isolate all multi-objective logic into one new module (`opto/trainer/objectives.py`) containing **pure functions**:
+
+```
+normalize_score()   →  scalar ↔ dict conversion


Let's use a different name. normalize_score implies some sort of scaling or shifting is done.
Let's use something explicit like to_score_dict or some term that is more neutral

chinganc

@doxav @allenanie

doxav · 2026-02-15T19:21:08Z

Hi @chinganc

I propose to address your comments by moving all dict -> scalar + aggregation policy into opto/trainer/objectives.py (Objective side), and making the behavior configurable via ObjectiveConfig.

Concretely:

Rename normalize_score -> to_score_dict (with a backwards-compatible alias).
Add ObjectiveConfig.scalarize_dict ∈ {"score","mean","weighted"} + score_key so dict→scalar reduction is never silently hard-coded in Guide/Evaluator.
Implement dict-> scalar reduction in objectives.py (score_dict_to_scalar / to_scalar_score) and use it in select_best/select_top_k for scalar-mode fallbacks.
Move mean-per-metric aggregation into objectives.py (aggregate_score_dicts) and make evaluators.aggregate_vector_scores a thin wrapper.
Update the M1 notebook + technical plan to demonstrate scalarize_dict explicitly and to recommend overriding Guide.get_score_dict() (not changing get_feedback() return type).

This keeps Guide responsible for producing raw metrics, and keeps ObjectiveConfig (trainer-side) responsible for aggregation/scalarization/selection, without passing ObjectiveConfig into the Guide.

…larize_dict, aggregate to objectives.py

Jose Carlos Rodriguez added 6 commits February 9, 2026 16:10

T6 M0: Technical plan + analysis notebook for multi-objective vector …

9506921

…scores

T6 M0: Apply Xavier's review fixes (paths, dates, motivation, real LL…

3b2a0b2

…M required)

T6 M0: Apply Xavier's review fixes to technical plan

249bde6

T6 M1: Multi-objective vector scores — ObjectiveConfig, objectives.py…

2213a19

…, evaluate_vector, BasicSearch integration, 59 tests

T6 M1: Fix Colab install cell for Python 3.12 compatibility

4590102

T6 M1: Fix scalar objective computation, document config=None fallbac…

3b8d2ed

…k, add weight-sensitivity demo

chinganc reviewed Feb 13, 2026

View reviewed changes

T6 M1: Apply Ching-An review - to_score_dict rename, configurable sca…

7401ca2

…larize_dict, aggregate to objectives.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

T6 M0: Technical plan + analysis notebook for multi-objective vector …#61

T6 M0: Technical plan + analysis notebook for multi-objective vector …#61
carlosrod723 wants to merge 7 commits intoAgentOpt:experimentalfrom
carlosrod723:t6-multi-objective-m0

carlosrod723 commented Feb 9, 2026

Uh oh!

chinganc Feb 13, 2026

Uh oh!

chinganc Feb 13, 2026

Uh oh!

chinganc Feb 13, 2026

Uh oh!

chinganc Feb 13, 2026

Uh oh!

chinganc left a comment

Uh oh!

doxav commented Feb 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

carlosrod723 commented Feb 9, 2026

Uh oh!

chinganc Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

chinganc Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

chinganc Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

chinganc Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

chinganc left a comment

Choose a reason for hiding this comment

Uh oh!

doxav commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

doxav commented Feb 15, 2026 •

edited

Loading