In-memory caching

3. In-memory caching

Overview

The router comes equipped to do a lot of its own caching, without any additional configuration required from us. From introspection responses to query plans, the cache accelerates our router's ability to execute queries over and over again.

In this lesson, we will:

Learn about query plans
Inspect the query plans the router generates
Learn how to warm up the cache on schema changes

Query plans

When the router receives a GraphQL operation for the first time, it creates a plan for how to resolve it: the query plan, the set of instructions the router follows to fetch and assemble data from different subgraphs.

Starting from the top-level field of the operation, the router consults the supergraph schema to determine which subgraph is responsible for resolving the field, and starts to build its query plan. The router continues like this, checking each field in the query against the supergraph schema, and adding it to the query plan.

A diagram showing the router receiving a query and building a query plan for executing it

After all the fields are accounted for, the router goes on to execute the plan, making requests to subgraphs in the order outlined in the plan, and assembling the data before returning it back to the client.

Caching query plans

The router uses a query plan for every operation it resolves; but it doesn't necessarily generate the query plan each time. When it receives an operation, the router first consults its own in-memory cache for the corresponding query plan.

A diagram showing the router locating a previously cached query plan

If it finds it, then great—clearly, it's done this work before! It plucks the plan from the cache and follows the instructions to resolve the operation.

A diagram showing the router executing a query using a cached query plan

If the router does not find the query plan, it proceeds with generating it from scratch. The router also stores the plan in its in-memory cache, to be used for next time.

A diagram showing the router generating, executing, and caching a query plan

Query plans in action

Alright, let's see this in action! First, let's take a look at the query plan for one of our operations.

Jump into GraphOS Studio Explorer and paste the same operation from the previous lesson into the Operation panel.

query GetFeaturedListings {
  featuredListings {
    id
    title
    description
    numOfBeds
  }
}

studio.apollographql.com

Screenshot of Explorer with the operation

In the Response panel, we'll find a dropdown. Here we can select the option Query Plan Preview. This updates the content of the Response panel with a representation of how the query will be resolved.

studio.apollographql.com

Screenshot of Explorer with the operation, with query plan open

There are two options here: we can view the plan as a chart, or as text. Click Show plan as text, as this gives us a little bit more information about what the router does under the hood.

QueryPlan {
  Fetch(service: "listings") {
    {
      featuredListings {
        id
        title
        description
        numOfBeds
      }
    }
  },
}

This query plan looks a lot like our original query, doesn't it? The biggest difference is the Fetch statement we see at the top. This tells us which service—namely, "listings"—the router plans to fetch the following types and fields from. Because our query involves a single service, the router can fetch all of the data in a single request to the listings service.

Now let's look at a query plan that involves both subgraphs. In this case, we'll see that the response from one subgraph depends on the response from another.

In a new Operation tab, paste the following query.

query GetListingAndReviews($listingId: ID!) {
  listing(id: $listingId) {
    title
    description
    numOfBeds
    overallRating
    reviews {
      id
      text
    }
  }
}

And in the Variables panel:

{
  "listingId": "listing-1"
}

To resolve this query, the router has to coordinate data between the two services. In order for the reviews service to provide the reviews and overallRating fields, it first needs to understand which listing object it's providing data for. We see this reflected in the query plan:

QueryPlan {
  Sequence {
    Fetch(service: "listings") {
      {
        listing(id: $listingId) {
          __typename
          id
          title
          description
          numOfBeds
        }
      }
    },
    Flatten(path: "listing") {
      Fetch(service: "reviews") {
        {
          ... on Listing {
            __typename
            id
          }
        } =>
        {
          ... on Listing {
            overallRating
            reviews {
              id
              text
            }
          }
        }
      },
    },
  },
}

Here the router defines a Sequence of steps. It first conducts a Fetch to the listings service for all the listing-specific types and fields. Next, it conveys the information the reviews subgraph needs to provide data for a particular listing: __typename and id, the Listing entity's primary key field. With this information, the router can then request the corresponding reviews data from the reviews subgraph. All the data is packaged together, and returned!

Note: If the response from the reviews subgraph did not depend on the response from the listings subgraph, the query plan would reflect that the two requests run in Parallel, rather than as part of a Sequence.

Now that we've seen what query plans look like in action, let's take a closer look at the router's side of things.

Query plans & the router

To investigate what the router is doing, we'll need to provide a new flag to our router start command: --log. This lets us specify the level of verbosity we want in our logs from the router. For our purposes, we'll set the level at trace to better follow what the router is doing when it comes to query plans and caching.

MacOS / Linux / WSL

APOLLO_KEY=<APOLLO_KEY> \
APOLLO_GRAPH_REF=<APOLLO_GRAPH_REF> \
./router --config router-config.yml \
  --log trace

Windows powershell

$env:APOLLO_KEY="<APOLLO_KEY>"
$env:APOLLO_GRAPH_REF="<APOLLO_GRAPH_REF>"
./router --config router-config.yml \
  --log trace

Note: Check out the official Apollo documentation to review other log level options.

Restart the router using the command given above, substituting in your own APOLLO_KEY and APOLLO_GRAPH_REF values. When we run the command, we'll see our terminal fill up with a lot more output messages from the router.

Back in Explorer, let's run the GetListingAndReviews operation again.

Jumping back to the router terminal logs, we can scroll to find the query plan output. (You can also search for "query plan" in the logs, and pan through until you find the outputted query plan.)

Router query plan output

query plan
Sequence { nodes: [Fetch(FetchNode { service_name: "listings",
requires: [], variable_usages: ["listingId"],
operation: "query GetListingAndReviews__listings__0($listingId:ID!){listing(id:$listingId){__typename id title description numOfBeds}}",
// ...etc

Just below this, we can see the more explicit path the router takes to execute its plan. It narrates its first request to the listings subgraph, and logs the response. Then it's able to use this information about the listing entity we're querying to fetch the corresponding data with a new request to the reviews subgraph!

Diagram showing the query plan steps: fetching from listings, then using the listing data to fetch the corresponding reviews

To see more detail about where the router locates its stored query plan, we'll set up a Prometheus dashboard later on. Stay tuned!

Warming up the cache

When we push updates to our schema, it's likely that the query plans for many of our operations will change. We might have migrated fields from one service to another, for instance. This means that when the router loads a new schema, some of its cached query plans might no longer be usable.

To counteract this, the router automatically pre-computes query plans for:

The most used queries in its cache
The entire list of persisted queries (more on this in the next lesson!)

To learn more about configuring the number of queries to "warm up" on schema reload, visit the official Apollo documentation.

Practice

Which of the following pieces of data are included in the router's query plan?

The query plan's unique identifier in the cacheThe fields each subgraph is responsible for resolving in the operationThe smaller requests to each individual subgraphThe order in which subgraph requests can be sent, such as in Sequence or Parallel

Key takeaways

The router generates a query plan to describe the steps it needs to take to resolve each query it receives.
Before generating a query plan, the router first checks its in-memory cache for it. If it finds it, it uses the cached query plan instead of generating a new one.
When the router receives an updated supergraph schema, it automatically pre-computes query plans for the most used queries in its cache before switching to the new schema.
We can use the router's config file, along with the warmed_up_queries key, to define a custom number of query plans to pre-compute.

Up next

There's another category of work that the router takes care of caching in-memory: the hashes for Automatic Persisted Queries (APQ). Let's take a closer look at this in the next lesson.

Share your questions and comments about this lesson

This course is currently in

beta

. Your feedback helps us improve! If you're stuck or confused, let us know and we'll help you out. All comments are public and must follow the Apollo Code of Conduct. Note that comments that have been resolved or addressed may be removed.

You'll need a GitHub account to post below. Don't have one? Post in our Odyssey forum instead.