PostgreSQL Tutorial: Actual time per operation in EXPLAIN ANALYZE

August 25, 2025

Summary: When you’re trying to work out why a query in PostgreSQL is slow, EXPLAIN ANALYZE can be a huge help, but calculating the per-operation timings can get tricky.

Table of Contents

The basics

The first thing to know is that the reported time of a parent operation is inclusive of its children. So if we want to know the contribution of the parent operation, we need to subtract the timings reported by its children.

Consider a scan and sort of a small table (omitting the cost statistics, for brevity):

 Sort (actual time=0.049..0.054 rows=51 loops=1)
    Sort Key: c DESC
    Sort Method: quicksort  Memory: 31kB
    ->  Seq Scan on t (actual time=0.012..0.018 rows=51 loops=1)
 Planning time: 0.064 ms
 Execution time: 0.077 ms

Towards the bottom of the query plan we can see the actual total time of the sequential scan was 0.018 ms.

The same metric for its parent operation, the sort, was 0.054 ms.

To get an idea for how much time was spent sorting, rather than scanning and sorting together, we can subtract 0.054 - 0.018 = 0.036 ms.

Watch out for loops

The next biggest reason that timings in query plans are not straightforward is that the time statistics are a per-loop average (mean). So when the number of loops is more than 1, we need to remember to multiply the actual total time by the number of loops.

Here is a simple nested loop example:

 Nested Loop (actual time=0.061..0.335 rows=86 loops=1)
   ->  Index Only Scan using i on t (actual time=0.014..0.019 rows=9 loops=1)
         Index Cond: (id < 10)
         Heap Fetches: 0
   ->  Index Scan using i2 on t2 (actual time=0.011..0.028 rows=10 loops=9)
         Index Cond: (id = t.id)
 Planning Time: 0.802 ms
 Execution Time: 0.412 ms

We can see a single index-only scan (loops=1) which takes 0.019 ms.

The index scan line reports loops=9, so we need to multiply 0.028 * 9 = 0.252 ms.

This is important, as we can now see that it is a significant proportion (0.252 / 0.335 = 75%) of the time reported by its parent operation, the nested loop.

We can calculate the additional overhead of the nested loop by subtracting the totals of its children, much like we did above: 0.335 - 0.019 - 0.252 = 0.064 ms.

Parallel timings are a bit tricky too

When operations are done in parallel, PostgreSQL will report the number of threads as loops. This means the section above has you covered for calculating a CPU time, but since these are done in parallel, this is not a wall-clock time.

For that, we can get a rough estimate from the per-loop average reported. But for a more accurate number, we can use the VERBOSE parameter to get the individual worker timings, and use the slowest as the wall-clock total for the operation (normally the leader thread, if it is participating). It is worth noting that the leader (or main thread) does not report its statistics, but we can subtract the others from the total to work it out. How fun.

Let’s look at an example, from using EXPLAIN (ANALYZE, VERBOSE):

 ->  Parallel Seq Scan on s.t 
     (actual time=4.095..7178.509 rows=236170 loops=3)
       Output: c1, c2, c3
       Worker 0:  actual time=0.118..7160.961 rows=247183 loops=1
       Worker 1:  actual time=0.151..7108.307 rows=227603 loops=1

Here we can see that there were 3 threads (loops): Worker 0, Worker 1, plus the main thread. On average, they took 7178.509 ms.

(7178.509 * 3) - 7160.961 - 7108.307 = 7266.259ms

By multiplying the average by three, and subtracting the worker timings, we can see that the main thread took the longest of the three, and as such we can use this as a more accurate estimate of the operations wall-clock time.

Some subtleties around CTEs

If Postgres opts to materialize a CTE, perhaps because it is being referred to multiple times in the query, then the simple logic of subtracting child operation times from parent operation times breaks down.

Consider this query, which is deliberately easy to follow the timings of, and refers back to the CTE twice:

WITH init AS (
  SELECT * FROM pg_sleep_for('100ms')
  UNION ALL
  SELECT * FROM pg_sleep_for('200ms')
)
(SELECT * FROM init LIMIT 1)
UNION ALL
(SELECT * FROM init);

Now, lets request the execution plan for this using EXPLAIN (ANALYZE, COSTS OFF), to keep it simple:

 Append (actual time=102.409..302.675 rows=3 loops=1)
   CTE init
     ->  Append (actual time=102.397..302.649 rows=2 loops=1)
           ->  Function Scan on pg_sleep pg_sleep_for (actual time=102.396..102.398 rows=1 loops=1)
           ->  Function Scan on pg_sleep pg_sleep_for_1 (actual time=200.243..200.245 rows=1 loops=1)
   ->  Limit (actual time=102.407..102.409 rows=1 loops=1)
         ->  CTE Scan on init init_1 (actual time=102.402..102.402 rows=1 loops=1)
   ->  CTE Scan on init (actual time=0.001..200.256 rows=2 loops=1)
 Planning Time: 47.003 ms
 Execution Time: 303.905 ms

If we look carefully, we can see that the ~ 300ms is reported twice, once in the CTE Scans and Limit, and again in the Append (and it’s children). In this simple example, we can easily catch this, but in more complex cases, it can cause confusion and is worth staying mindful of.

See more

PostgreSQL Optimization

The basics

Watch out for loops

Parallel timings are a bit tricky too

Some subtleties around CTEs

See more

Search

Categories

Tags