Breadth-first GraphQL execution

The proof-of-concept algorithm for Shopify's GraphQL Cardinal engine

GraphQL requests have two dimensions: depth and breadth. The depth dimension is finite as defined by the request document, while the breadth dimension scales by the width of response data (and can grow extremely large).

Traditional GraphQL implementations execute depth-first, which resolves every field of every object in the response individually, making resolver overhead (resolver calls, tracing, intermediary promises) scale by depth × breadth. By executing breadth-first, we instead resolve each selection depth only once with an aggregated set of objects, so resolver overhead scales by depth-only.

This breadth-first design makes processing list repetitions considerably faster.

graphql-ruby (depth): 140002 resolvers
   1.087 (± 0.0%) i/s  (919.76 ms/i) -  6.000 in  5.526807s
graphql-breadth_exec 140002 resolvers
   21.314 (± 9.4%) i/s   (46.92 ms/i) -  108.000 in  5.095015s

Comparison:
graphql-breadth_exec 140002 resolvers:   21.3 i/s
graphql-ruby (depth): 140002 resolvers:   1.1 i/s - 19.60x  slower

Batching advantages

Breadth-first resolvers look a little different than we're used to: they recieve objects and return a mapped set.

def resolve(objects, args, cxt)
  objects.map { ... }
end

This makes all field instances inherently batched as a function of the engine without using DataLoader promise patterns. However, promises are still relevant for batching work across field instances (ie: same field using different aliases, or different fields sharing a query, etc.). Promise patterns can be considerably more efficient in breadth execution by binding many objects to a single promise rather than generating a promise per object:

Napkin math

Assumption: all GraphQL fields have some non-zero overhead cost associated with their execution. For simplicity, let's round up and say this cost is 1ms.

Scenario: we resolve five fields (depth) across a list of 1000 objects (breadth).

depth-first: we call 5000 field resolvers (depth × breadth) and incur 5s (5 × 1000 × 1ms).
breadth-first: we call 5 field resolvers (depth-only) and incur only 5ms (5 × 1ms).

Now assume each field operates lazily and returns a promise:

depth-first: we build and resolve 5000 intermediary promises (depth × breadth).
breadth-first: we build and resolve 5 intermediary promises (depth-only).

Now assume we chain a .then onto the lazy promise resolution:

depth-first: we build and resolve 10,000 intermediary promises (depth × breadth × 2).
breadth-first: we build and resolve 10 intermediary promises (depth × 2).

Prototype usage

This is an extremely early proof-of-concept that demonstrates basic breadth-first concepts. It outlines a core engine flow using batched sets, and includes a basic many-to-one promissory workflow. These patterns have matured into Shopify's GraphQL Cardinal engine that now runs much of their production traffic; these patterns are also being matured for the open source community in graphql-ruby. To experiment with this prototype, you can setup a GraphQL::BreadthExec::FieldResolver:

class MyFieldResolver < GraphQL::BreadthExec::FieldResolver
   def resolve(objects, args, ctx, scope)
      map_sources(objects) { |obj| obj.my_field }
   end
end

A field resolver provides:

objects: the array of objects to resolve the field on.
args: the coerced arguments provided to this selection field.
ctx: the request context.
scope: (experimental) a handle to the execution scope that invokes lazy hooks.

A resolver must return a mapped set of data for the provided objects. Always use the map_sources helper for your mapping loop to assure that exceptions are captured properly. You may return errors for a field position by mapping an ExecutionError into it:

class MyFieldResolver < GraphQL::BreadthExec::FieldResolver
   def resolve(objects, args, ctx, scope)
      map_sources(objects) do |obj|
         obj.valid? ? obj.my_field : GraphQL::BreadthExec::ExecutionError.new("Object field not valid")
      end
   end
end

Now setup a resolver map:

RESOLVER_MAP = {
  "MyType" => {
    "myField" => MyFieldResolver.new,
  },
  "Query" => {
    "myType" => MyTypeResolver.new,
  },
}.freeze

Now parse your schema definition and execute requests:

SCHEMA = GraphQL::Schema.from_definition(%|
  type MyType {
    myField: String
  }
  type Query {
    myType: MyType
  }
|)

result = GraphQL::BreadthExec::Executor.new(
   SCHEMA,
   RESOLVER_MAP,
   GraphQL.parse(query),
   {}, # root object
   variables: { ... },
   context: { ... },
   tracers: [ ... ],
).perform

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.github/workflows		.github/workflows
benchmark		benchmark
images		images
lib/graphql		lib/graphql
test		test
.gitignore		.gitignore
Gemfile		Gemfile
LICENSE		LICENSE
README.md		README.md
Rakefile		Rakefile
graphql-breadth_exec.gemspec		graphql-breadth_exec.gemspec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Breadth-first GraphQL execution

Batching advantages

Napkin math

Prototype usage

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

gmac/graphql-breadth-exec

Folders and files

Latest commit

History

Repository files navigation

Breadth-first GraphQL execution

Batching advantages

Napkin math

Prototype usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages