Setup DistABLPLoader for GraphStore mode by kmontemayor2-sc · Pull Request #485 · Snapchat/GiGL

kmontemayor2-sc · 2026-02-11T18:37:03Z

Scope of work done

Support GS inputs and sampling for ABLP Loader
Fix some bugs in DistLoader to support "labeled homogeneous" mode
Add tests for iterating over both dataloader in with ABLP dataset

Where is the documentation for this feature?: N/A

Did you add automated tests or write a test plan?

Updated Changelog.md? NO

Ready for code review?: NO

kmontemayor2-sc · 2026-02-11T19:58:36Z

/e2e_test

kmontemayor2-sc · 2026-02-11T19:58:43Z

/integration_test

kmontemayor2-sc · 2026-02-11T19:58:47Z

/unit_test_py

github-actions · 2026-02-11T19:58:53Z

GiGL Automation

@ 19:58:53UTC : 🔄 E2E Test started.

@ 21:23:22UTC : ✅ Workflow completed successfully.

github-actions · 2026-02-11T19:58:54Z

GiGL Automation

@ 19:58:54UTC : 🔄 Integration Test started.

@ 21:07:19UTC : ✅ Workflow completed successfully.

github-actions · 2026-02-11T19:59:00Z

GiGL Automation

@ 19:58:59UTC : 🔄 Python Unit Test started.

@ 21:10:08UTC : ✅ Workflow completed successfully.

mkolodner-sc

Thanks Kyle, left a few comments/questions

gigl/distributed/dist_ablp_neighborloader.py

mkolodner-sc · 2026-02-11T20:52:05Z

gigl/distributed/dist_ablp_neighborloader.py

+                # Graph Store mode inputs
+                dict[int, tuple[torch.Tensor, torch.Tensor, Optional[torch.Tensor]]],
+                tuple[
+                    NodeType,


Just to double check, how would this interplay if I have user as my anchor node type and item as my supervision node type?

Tbh it would feel a bit weird to specify {user: {0: (0, 1, 1)}} with the 0 corresponding to the user node type but the two 1 node types are items. Ideally, if a user is providing the positive and negative labels here, I feel like they should have to provide the node type in some way. This may be even more true once we will need to support multiple supervision edge types

Good callout!

Hmmm, wdyt about dict[int, tuple[tuple[NodeType, Tensor], tuple[NodeType, Tensor], Optional[tuple[NodeType, Tensor]]?

Or honest that's so complicated probably worth creating a dataclass to wrap all the data?

I know we want to avoid dataclasses here but frankly this is so complicated I think it's worth it. WDYT?

It is growing complicated enough here honestly where I think a dataclass specifically explaining this input (and potentially having the RemoteDataset.get_ablp_inputs() return this dataclass) makes sense.

Re: dict[int, tuple[tuple[NodeType, Tensor], tuple[NodeType, Tensor], Optional[tuple[NodeType, Tensor]]

Not to add complexity unfortunately to this already complex object, but for multiple supervision edge types, we may need to make it list[tuple[NodeType, Tensor]] for the positive and negatives :')

Alternatively, another option could be to expose positive_nodes, negative_nodes, or label_nodes as direct arguments. It seems like it's a tradeoff between

adhering to GLT by only having one input_nodes but having a fairly complex structure to provide as input :')

having multiple arguments but now we've evolved the API beyond GLT/PyG and further added complexity to the API

I need to think more about which approach I prefer but these are my thoughts for now

kmontemayor2-sc · 2026-02-12T21:36:49Z

/integration_test

kmontemayor2-sc · 2026-02-12T21:36:54Z

/unit_test_py

github-actions · 2026-02-12T21:37:00Z

GiGL Automation

@ 21:37:00UTC : 🔄 Integration Test started.

@ 22:39:45UTC : ❌ Workflow failed.
Please check the logs for more details.

github-actions · 2026-02-12T21:37:10Z

GiGL Automation

@ 21:37:10UTC : 🔄 Python Unit Test started.

@ 21:44:13UTC : ❌ Workflow failed.
Please check the logs for more details.

kmontemayor2-sc · 2026-02-12T21:52:04Z

/unit_test_py

github-actions · 2026-02-12T21:52:16Z

GiGL Automation

@ 21:52:15UTC : 🔄 Python Unit Test started.

@ 23:04:55UTC : ✅ Workflow completed successfully.

kmontemayor2-sc · 2026-02-12T23:13:12Z

/integration_test

github-actions · 2026-02-12T23:13:24Z

GiGL Automation

@ 23:13:23UTC : 🔄 Integration Test started.

@ 24:01:43UTC : ❌ Workflow failed.
Please check the logs for more details.

kmontemayor2-sc · 2026-02-13T00:05:54Z

/integration_test

github-actions · 2026-02-13T00:06:05Z

GiGL Automation

@ 24:06:05UTC : 🔄 Integration Test started.

@ 01:59:33UTC : ❌ Workflow failed.
Please check the logs for more details.

kmontemayor2-sc · 2026-02-13T02:34:25Z

/integration_test

github-actions · 2026-02-13T02:34:40Z

GiGL Automation

@ 02:34:39UTC : 🔄 Integration Test started.

@ 04:06:55UTC : ❌ Workflow failed.
Please check the logs for more details.

kmontemayor2-sc · 2026-02-13T18:10:57Z

/unit_test_py

kmontemayor2-sc · 2026-02-13T18:11:05Z

/integration_test_py

kmontemayor2-sc · 2026-02-13T18:11:08Z

/e2e_Test

github-actions · 2026-02-13T18:11:09Z

GiGL Automation

@ 18:11:08UTC : 🔄 Python Unit Test started.

@ 19:35:54UTC : ✅ Workflow completed successfully.

github-actions · 2026-02-13T18:11:15Z

GiGL Automation

@ 18:11:15UTC : 🔄 Integration Test started.

@ 19:50:46UTC : ❌ Workflow failed.
Please check the logs for more details.

kmontemayor2-sc · 2026-02-13T18:11:19Z

/integration_test

github-actions · 2026-02-13T18:11:25Z

GiGL Automation

@ 18:11:24UTC : 🔄 E2E Test started.

@ 19:33:42UTC : ✅ Workflow completed successfully.

github-actions · 2026-02-13T18:11:34Z

GiGL Automation

@ 18:11:34UTC : 🔄 Integration Test started.

@ 19:43:40UTC : ❌ Workflow failed.
Please check the logs for more details.

kmontemayor2-sc · 2026-02-13T19:56:35Z

/unit_test_py

kmontemayor2-sc · 2026-02-13T19:56:41Z

/integration_test

github-actions · 2026-02-13T19:56:48Z

GiGL Automation

@ 19:56:48UTC : 🔄 Python Unit Test started.

@ 21:03:45UTC : ✅ Workflow completed successfully.

github-actions · 2026-02-13T19:57:04Z

GiGL Automation

@ 19:57:03UTC : 🔄 Integration Test started.

@ 21:32:55UTC : ❌ Workflow failed.
Please check the logs for more details.

gigl/utils/sampling.py

gigl/distributed/graph_store/remote_dist_dataset.py

mkolodner-sc · 2026-02-13T21:38:17Z

gigl/distributed/dist_ablp_neighborloader.py

+                    "supervision_edge_type must not be provided when using Graph Store mode. "
+                    "The supervision edge types are inferred from the ABLPInputNodes label keys in input_nodes."


Is there a particular reason we want to be raising an error here if it is provided? I thought it would be sufficient to just validate that this is the same as what is provided in input_nodes in the case that it is also provided here, rather than raise an error directly.

gigl/distributed/dist_ablp_neighborloader.py

mkolodner-sc · 2026-02-13T21:47:38Z

gigl/distributed/dist_ablp_neighborloader.py

+        # Validate that the negative label edge types (if present) correspond to the
+        # same supervision edge types as the positive labels.


I wonder, since we need to do this, if it'd be better to have some labeled_nodes that is a Dict[EdgeType, Tuple(torch.Tensor, Optional[torch.Tensor]). That way, they are guaranteed to have the same sueprvision edge types and consumers won't be able to make this error.

mkolodner-sc · 2026-02-13T21:48:53Z

tests/test_assets/test_case.py

 logger = Logger()

-DEFAULT_TIMEOUT_SECONDS: Final[float] = 60.0 * 10  # 10 minutes
+DEFAULT_TIMEOUT_SECONDS: Final[float] = 60.0 * 30  # 30 minutes


qq: Why did we need to bump this?

kmonte added 2 commits February 11, 2026 18:35

Setup DistABLPLoader for GraphStore mode

7d0d8be

fixes

3c3bf1b

mkolodner-sc reviewed Feb 11, 2026

View reviewed changes

kmonte added 3 commits February 11, 2026 21:41

address comments

841a9fd

with timing

0669562

maybe works?

b488f7f

fixes

9198ecb

kmonte added 2 commits February 12, 2026 23:09

Merge branch 'main' into kmonte/gs-for-ablp

5cdee46

remove for now

8ad09ec

maybe?

347f359

Update test_case.py

8b7dd29

revert some stuff

0d5ff13

kmonte added 2 commits February 13, 2026 18:00

revert some stuff

d8a666d

Merge branch 'main' into kmonte/gs-for-ablp

f6a9836

bump timeout

81b3903

mkolodner-sc reviewed Feb 13, 2026

View reviewed changes

kmonte added 3 commits February 13, 2026 23:31

comments

f66bfd8

bleh

bbb499d

hmmm

8cbcf27

		"supervision_edge_type must not be provided when using Graph Store mode. "
		"The supervision edge types are inferred from the ABLPInputNodes label keys in input_nodes."

		# Validate that the negative label edge types (if present) correspond to the
		# same supervision edge types as the positive labels.

Conversation

kmontemayor2-sc commented Feb 11, 2026

Uh oh!

kmontemayor2-sc commented Feb 11, 2026

Uh oh!

kmontemayor2-sc commented Feb 11, 2026

Uh oh!

kmontemayor2-sc commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

github-actions bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

github-actions bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

mkolodner-sc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mkolodner-sc Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

kmontemayor2-sc Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

mkolodner-sc Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kmontemayor2-sc commented Feb 12, 2026

Uh oh!

kmontemayor2-sc commented Feb 12, 2026

Uh oh!

github-actions bot commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

github-actions bot commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

kmontemayor2-sc commented Feb 12, 2026

Uh oh!

github-actions bot commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

kmontemayor2-sc commented Feb 12, 2026

Uh oh!

github-actions bot commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

kmontemayor2-sc commented Feb 13, 2026

Uh oh!

github-actions bot commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

kmontemayor2-sc commented Feb 13, 2026

Uh oh!

github-actions bot commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

kmontemayor2-sc commented Feb 13, 2026

Uh oh!

kmontemayor2-sc commented Feb 13, 2026

Uh oh!

kmontemayor2-sc commented Feb 13, 2026

Uh oh!

github-actions bot commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

github-actions bot commented Feb 11, 2026 •

edited

Loading

github-actions bot commented Feb 11, 2026 •

edited

Loading

github-actions bot commented Feb 11, 2026 •

edited

Loading

mkolodner-sc Feb 11, 2026 •

edited

Loading

github-actions bot commented Feb 12, 2026 •

edited

Loading

github-actions bot commented Feb 12, 2026 •

edited

Loading

github-actions bot commented Feb 12, 2026 •

edited

Loading

github-actions bot commented Feb 12, 2026 •

edited

Loading

github-actions bot commented Feb 13, 2026 •

edited

Loading

github-actions bot commented Feb 13, 2026 •

edited

Loading

github-actions bot commented Feb 13, 2026 •

edited

Loading

github-actions bot commented Feb 13, 2026 •

edited

Loading

github-actions bot commented Feb 13, 2026 •

edited

Loading

github-actions bot commented Feb 13, 2026 •

edited

Loading

github-actions bot commented Feb 13, 2026 •

edited

Loading

github-actions bot commented Feb 13, 2026 •

edited

Loading