ggspec

May 6, 2026 · View on GitHub

ggspec extracts the full declarative specification of a ggplot2 object — layers, aesthetic mappings, scales, facets, coordinate system, and labels — as tidy data frames. A second tier of functions enables structural comparison of two ggplot objects, supporting automated plot testing, auditing, and framework-agnostic grading workflows.

Motivation

Different large-language models and AI coding assistants generate syntactically different code for the same visualisation task. One AI might write geom_bar(aes(x = species)) on raw data; another might write count(species) |> ... geom_col(aes(x = species, y = n)). Both produce the same chart, but naive string or AST comparison would flag them as different. ggspec provides a principled hierarchy of equivalence checks — from strict spec equality through structural canonicalisation to rendered-output comparison — so that equivalent plots are recognised as equivalent regardless of which syntactic path an AI (or human student) took to produce them.

Installation

Install the development version from GitHub:

# install.packages("remotes")
remotes::install_github("clement-lee/ggspec")

Usage

Extracting a spec

``$ \text{r} \text{library}(\text{ggspec}) \text{library}(\text{ggplot2}) #> #> \text{Attaching} \text{package}: '\text{ggplot2}' #> \text{The} \text{following} \text{object} \text{is} \text{masked} \text{from} '\text{package}:\text{ggspec}': #> #> \text{is_ggplot}

\text{p} <- \text{ggplot}(\text{mpg}, \text{aes}(\text{displ}, \text{hwy})) + \text{geom_point}(\text{aes}(\text{colour} = \text{class})) + \text{geom_smooth}(\text{method} = "\text{lm}", \text{se} = \text{FALSE}) + \text{facet_wrap}(~\text{drv}) + \text{labs}(\text{title} = "\text{Engine} \text{displacement} \text{vs} \text{highway} \text{MPG}")

\text{spec_layers}(\text{p}) #> # \text{A} \text{tibble}: 3 \times 8 #> \text{layer} \text{geom} \text{stat} \text{position} \text{mapping} \text{params} \text{inherit_aes} \text{data_id} #> <\text{int}> <\text{chr}> <\text{chr}> <\text{chr}> <\text{list}> <\text{list}> <\text{lgl}> <\text{int}> #> 1 0 <\text{NA}> <\text{NA}> <\text{NA}> <\text{chr} [2]> <\text{list} [0]> \text{NA} 1 #> 2 1 \text{point} \text{identity} \text{identity} <\text{chr} [3]> <\text{named} \text{list} [2]> \text{TRUE} \text{NA} #> 3 2 \text{smooth} \text{smooth} \text{identity} <\text{chr} [2]> <\text{named} \text{list} [7]> \text{TRUE} \text{NA} \text{spec_aes}(\text{p}) #> # \text{A} \text{tibble}: 7 \times 5 #> \text{layer} \text{geom} \text{aesthetic} \text{variable} \text{source} #> <\text{int}> <\text{chr}> <\text{chr}> <\text{chr}> <\text{chr}> #> 1 0 <\text{NA}> \text{x} \text{displ} \text{global} #> 2 0 <\text{NA}> \text{y} \text{hwy} \text{global} #> 3 1 \text{point} \text{x} \text{displ} \text{global} #> 4 1 \text{point} \text{y} \text{hwy} \text{global} #> 5 1 \text{point} \text{colour} \text{class} \text{local} #> 6 2 \text{smooth} \text{x} \text{displ} \text{global} #> 7 2 \text{smooth} \text{y} \text{hwy} \text{global} $``

Comparing two plots

ref <- ggplot(mpg, aes(displ, hwy)) +
  geom_point(aes(colour = class)) +
  facet_wrap(~drv)

obs_correct <- ggplot(mpg, aes(displ, hwy)) +
  geom_point(aes(colour = class)) +
  facet_wrap(~drv)

obs_wrong <- ggplot(mpg, aes(displ, hwy)) +
  geom_smooth() +
  facet_wrap(~cyl)

equiv_plot(ref, obs_correct)
#> [PASS mode=strict] 6/6 checks passed
#>   Detail:
#> # A tibble: 9 × 12
#>   check  source layer geom  stat    position aesthetic variable status label_ref
#>   <chr>  <chr>  <int> <chr> <chr>   <chr>    <chr>     <chr>    <chr>  <chr>    
#> 1 layers ref        0 <NA>  <NA>    <NA>     <NA>      <NA>     <NA>   <NA>     
#> 2 layers ref        1 point identi… identity <NA>      <NA>     <NA>   <NA>     
#> 3 layers obs        0 <NA>  <NA>    <NA>     <NA>      <NA>     <NA>   <NA>     
#> 4 layers obs        1 point identi… identity <NA>      <NA>     <NA>   <NA>     
#> 5 aes    global     0 <NA>  <NA>    <NA>     x         displ    match  <NA>     
#> 6 aes    global     0 <NA>  <NA>    <NA>     y         hwy      match  <NA>     
#> 7 aes    global     1 point <NA>    <NA>     x         displ    match  <NA>     
#> 8 aes    global     1 point <NA>    <NA>     y         hwy      match  <NA>     
#> 9 aes    local      1 point <NA>    <NA>     colour    class    match  <NA>     
#> # ℹ 2 more variables: label_obs <chr>, match <list>
equiv_plot(ref, obs_wrong)
#> [FAIL mode=strict] 3/6 checks passed: Missing geom(s): point.; Aesthetic mapping issue(s): colour->class (layer 1).; Facet mismatch: cols: 'drv' vs 'cyl'
#>   Detail:
#> # A tibble: 9 × 12
#>   check  source layer geom   stat   position aesthetic variable status label_ref
#>   <chr>  <chr>  <int> <chr>  <chr>  <chr>    <chr>     <chr>    <chr>  <chr>    
#> 1 layers ref        0 <NA>   <NA>   <NA>     <NA>      <NA>     <NA>   <NA>     
#> 2 layers ref        1 point  ident… identity <NA>      <NA>     <NA>   <NA>     
#> 3 layers obs        0 <NA>   <NA>   <NA>     <NA>      <NA>     <NA>   <NA>     
#> 4 layers obs        1 smooth smooth identity <NA>      <NA>     <NA>   <NA>     
#> 5 aes    local      1 point  <NA>   <NA>     colour    class    missi… <NA>     
#> 6 aes    global     0 <NA>   <NA>   <NA>     x         displ    match  <NA>     
#> 7 aes    global     0 <NA>   <NA>   <NA>     y         hwy      match  <NA>     
#> 8 aes    global     1 point  <NA>   <NA>     x         displ    match  <NA>     
#> 9 aes    global     1 point  <NA>   <NA>     y         hwy      match  <NA>     
#> # ℹ 2 more variables: label_obs <chr>, match <list>

Comparison modes

ggspec recognises four levels of plot equivalence or similarity, ordered from most to least restrictive.

Strict equivalence

Two plots are strictly equivalent when their specifications are identical with no canonicalisation applied: same layer order, same data/mapping placement, same geom names, same random seed for stochastic elements.

compare_plots(p1, p2, mode = "strict")

Structural equivalence

Two plots are structurally equivalent when their specifications are identical after canonicalisation via canon(). The canonical form is computed by a term rewriting system (TRS) that applies a fixed set of confluent rewrite rules:

fold_global: resolves the ambiguity of placing data/mapping at the global level vs per-layer — both forms normalise to the same spec.
geom_col_to_bar: geom_col() is a shorthand for geom_bar(stat = "identity"); the canonical form always uses the latter.
layer_order: non-spanning geoms are sorted alphabetically, so layer order does not affect structural equivalence.

compare_plots(p1, p2, mode = "structural")  # default

Visual equivalence

Two plots are visually equivalent when they produce identical rendered output. This pathway uses ggplot_build() to evaluate plots semantically rather than comparing their specs, so it can detect equivalences that structural comparison cannot:

geom_bar() on raw data vs geom_col() on pre-counted data (same bars, different specs).
coord_flip() vs swapped aesthetics (same visual output, different coordinate systems).
scale_fill_*(name = "v") vs labs(fill = "v") (same legend label, different spec location).

compare_plots(p1, p2, mode = "visual")

Design constraint: visual equivalence calls ggplot_build() on both plots. This means (a) both plots must be buildable with their data accessible in the session, (b) it is slower than structural comparison, and (c) it is output-based — it does not verify that the plots were derived from the same source data. Two plots backed by different datasets that happen to produce the same rendered output will pass visual equivalence. Use structural mode when data provenance must be verified.

Conceptual similarity

Two plots are conceptually similar when they communicate the same information using potentially different visual encodings. Unlike the equivalence modes above, conceptual similarity is not a strict mathematical equivalence relation; each claim is qualified by a WHEN condition:

Claim	WHEN
boxplot, violin, jitter all similar	1 continuous + 1 discrete variable
density, histogram, freqpoly, dotplot all similar	1 continuous variable
`geom_count`, `geom_point(aes(size = n))` similar	2 discrete variables, joint counts

compare_plots(p1, p2, mode = "conceptual")

Enriching a spec with build-derived defaults

enrich_spec() uses ggplot_build() to identify which parameters and aesthetics were explicitly set by the user versus filled in by ggplot2:

es <- enrich_spec(p)
#> `geom_smooth()$ \text{using} \text{formula} = '\text{y} ~ \text{x}'

# \text{Non}-\text{aesthetic} \text{parameters} \text{with} \text{explicit} \text{flag}
\text{es}$\text{params\_tbl}[[1]]
#> # \text{A} \text{tibble}: 0  \times  4
#> # ℹ 4 \text{variables}: \text{param} <\text{chr}>, \text{value} <\text{list}>, \text{explicit} <\text{lgl}>, \text{source} <\text{chr}>

# \text{Aesthetics} \text{resolved} \text{by} \text{ggplot2}, \text{with} \text{explicit} \text{flag}
\text{es}$\text{built\_aes}[[1]]
#> # \text{A} \text{tibble}: 0  \times  3
#> # ℹ 3 \text{variables}: \text{aesthetic} <\text{chr}>, \text{value} <\text{list}>, \text{explicit} <\text{lgl}>
$``

## Key functions

| Tier       | Function               | What it returns                         |
| ---------- | ---------------------- | --------------------------------------- |
| Extraction | `spec_layers()`        | One row per layer                       |
| Extraction | `spec_aes()`           | One row per layer × aesthetic           |
| Extraction | `spec_scales()`        | One row per scale                       |
| Extraction | `spec_facets()`        | Facet type and variables                |
| Extraction | `spec_labels()`        | One row per label                       |
| Extraction | `spec_coord()`         | Coordinate system                       |
| Extraction | `enrich_spec()`        | spec\_layers + default/explicit flags   |
| Comparison | `equiv_plot()`         | All checks in one call (strict)         |
| Comparison | `equiv_layers()`       | Geom and stat per layer                 |
| Comparison | `equiv_aes()`          | Aesthetic mappings                      |
| Comparison | `compare_plots()`      | Four-mode comparison entry point        |
| Comparison | `compare_visual()`     | Visual equivalence via `ggplot_build()` |
| Comparison | `compare_conceptual()` | Conceptual similarity detectors         |
| Comparison | `equiv_rendered()`     | Rendered layer data comparison          |
| Check      | `check_plot()`         | Framework-agnostic assertion            |
| Check      | `expect_equiv_plot()`  | testthat expectation                    |

## Related packages

  - **[ggcheck](https://github.com/rstudio/ggcheck)** — designed for
    `learnr`/`gradethis` pipelines; returns ad-hoc objects. `ggspec`
    returns rectangular, pipeable tibbles and has no grading framework
    dependency.

## License

MIT