Categorical/categorical interactions

Data Analysis for Psychology in R 2

Elizabeth Pankratz (elizabeth.pankratz@ed.ac.uk)

Department of Psychology
University of Edinburgh
2025–2026

Course Overview

Introduction to linear Models	Intro to linear regression
	Interpreting linear models
	Testing individual predictors
	Model testing & comparison
	Linear model analysis
Analysing Experimental Studies	Categorical predictors and dummy coding
	Effect coding and manual post-hoc contrasts
	Assumptions and diagnostics
	Bootstrapping and confidence intervals
	Categorical predictors: Practice analysis

Interactions	Mean-centering and numeric/categorical interactions
	Numeric/numeric interactions
	Categorical/categorical interactions
	Manual contrast interactions and multiple comparisons
	Interactions: Practice analysis
Advanced Topics	Power analysis
	Binary logistic regression I
	Binary logistic regression II
	Logistic regression: Practice analysis
	Exam prep and course Q&A

Tech check and warm-up

wooclap.com, enter code GECHXE

This week’s learning objectives

What does the interaction coefficient of a linear model mean?

If we know the contrast coding for two interacting categorical predictors, how do we work out the coding of the interaction term?

What’s the difference between a “simple slope” and a “simple effect”?

How can we calculate how many interaction terms a model will have when two categorical predictors interact?

Where we are in the analysis plan today

What’s an interaction again?

An interaction is how we allow a model to estimate that the association between one predictor and the outcome is different, depending on the value of another predictor.

Today’s data

Post-travel anxiety ratings for people using two different kinds of transport:

Motorist:

Cyclist:

on two different road_types:

RuralRoad:

(all images from pixabay)

CityStreet:

Today’s data: One way to look at it

anx1 |>
  head(12)

# A tibble: 12 x 4
   ppt_id   anx transport road_type 
    <dbl> <dbl> <fct>     <fct>     
 1      1    39 Cyclist   RuralRoad 
 2      2    41 Cyclist   CityStreet
 3      3    38 Motorist  RuralRoad 
 4      5    18 Motorist  RuralRoad 
 5      7    25 Motorist  RuralRoad 
 6      8    29 Motorist  CityStreet
 7     10    38 Cyclist   CityStreet
 8     12    11 Cyclist   RuralRoad 
 9     13    33 Cyclist   CityStreet
10     14    11 Motorist  RuralRoad 
11     15    40 Motorist  RuralRoad 
12     16    19 Motorist  CityStreet

Does the difference in anxiety after travelling on different road types depend on the kind of transport?

Or, specifically: Is the difference in anxiety after travelling on rural roads vs. city streets different for motorists and cyclists?

Today’s data: Another equivalent way

anx1 |>
  head(12)

# A tibble: 12 x 4
   ppt_id   anx transport road_type 
    <dbl> <dbl> <fct>     <fct>     
 1      1    39 Cyclist   RuralRoad 
 2      2    41 Cyclist   CityStreet
 3      3    38 Motorist  RuralRoad 
 4      5    18 Motorist  RuralRoad 
 5      7    25 Motorist  RuralRoad 
 6      8    29 Motorist  CityStreet
 7     10    38 Cyclist   CityStreet
 8     12    11 Cyclist   RuralRoad 
 9     13    33 Cyclist   CityStreet
10     14    11 Motorist  RuralRoad 
11     15    40 Motorist  RuralRoad 
12     16    19 Motorist  CityStreet

Does the difference in anxiety after travelling on different road types depend on the kind of transport?

Or, equivalently: Is the difference in anxiety between motorists and cyclists different after travelling on rural roads and on city streets?

Let’s play with the numbers

How different are the differences between groups?

wooclap.com, enter code GECHXE

Let’s model it!

First: Set up factor levels, check contrast coding

anx1 <- anx1 |>
  mutate(
    transport = factor(transport, levels = c('Motorist', 'Cyclist')),
    road_type = factor(road_type, levels = c('RuralRoad', 'CityStreet')),
  )

The default contrast coding (treatment coding):

contrasts(anx1$transport)

         Cyclist
Motorist       0
Cyclist        1

contrasts(anx1$road_type)

           CityStreet
RuralRoad           0
CityStreet          1

The interaction’s contrast is the product of the two interacting contrasts

In other words: To get the interaction’s contrast, we multiply together each pair of coding values from the interacting predictors.

transport	road_type	transportCyclist	road_typeCityStreet	tC:rtCS
Motorist	RuralRoad	0	0	0 * 0 = 0
Motorist	CityStreet	0	1	0 * 1 = 0
Cyclist	RuralRoad	1	0	1 * 0 = 0
Cyclist	CityStreet	1	1	1 * 1 = 1

Think–Pair–Share: Cat/cat interaction coefs

The interaction model anx ~ transport * road_type will have four coefficients:

Intercept (but you’re pretty good at interpreting the intercept by now, so we’ll spend time on the harder stuff!)
transportCyclist
road_typeCityStreet
transportCyclist:road_typeCityStreet

Without looking ahead, try to figure out: What will each coefficient represent?

Think to yourself about each coefficient’s meaning. / Pair up with your neighbour and think together.

Then share on wooclap.com, enter code GECHXE

Fitting the model

\[ \text{anx} = \beta_0 + (\beta_1 \cdot \text{transport}) + (\beta_2 \cdot \text{road_type}) + (\beta_3 \cdot \text{transport} \cdot \text{road_type}) + \epsilon \]

m1 <- lm(anx ~ transport * road_type, data = anx1)

summary(m1)


Call:
lm(formula = anx ~ transport * road_type, data = anx1)

Residuals:
     Min       1Q   Median       3Q      Max 
-23.1176  -7.0303  -0.1963   6.0811  22.5263 

Coefficients:
                                     Estimate Std. Error t value Pr(>|t|)    
(Intercept)                           27.0303     1.6213  16.672   <2e-16 ***
transportCyclist                       0.4434     2.2162   0.200   0.8417    
road_typeCityStreet                    4.8886     2.2300   2.192   0.0300 *  
transportCyclist:road_typeCityStreet   7.7553     3.1316   2.476   0.0145 *  
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 9.314 on 138 degrees of freedom
Multiple R-squared:  0.2413,    Adjusted R-squared:  0.2248 
F-statistic: 14.63 on 3 and 138 DF,  p-value: 2.52e-08

Interpreting the model (1)

                                     Estimate Std. Error t value Pr(>|t|)
(Intercept)                             27.03       1.62   16.67     0.00
transportCyclist                         0.44       2.22    0.20     0.84
road_typeCityStreet                      4.89       2.23    2.19     0.03
transportCyclist:road_typeCityStreet     7.76       3.13    2.48     0.01

Intercept

When transport = 0 = Motorist and road_type = 0 = RuralRoad, the estimated average anxiety is 27.03 points.

transportCyclist

Specifically when road_type = 0 = RuralRoad, being a cyclist is associated with an increase in anxiety of 0.44 points.

road_typeCityStreet

Specifically when transport = 0 = Motorist, being on city streets is associated with an increase in anxiety of 4.89 points.

Interpreting the model (2)

                                     Estimate Std. Error t value Pr(>|t|)
(Intercept)                             27.03       1.62   16.67     0.00
transportCyclist                         0.44       2.22    0.20     0.84
road_typeCityStreet                      4.89       2.23    2.19     0.03
transportCyclist:road_typeCityStreet     7.76       3.13    2.48     0.01

transportCyclist:road_typeCityStreet

Being a cyclist changes the association of road_type with anx by 7.76 points.
So, the association of road_type with anx when …
- transport = 0 = Motorist is 4.89
- transport = 1 = Cyclist is 4.89 + 7.76 = 12.65

Equivalently, being on city streets changes the association of transport with anx by 7.76 points.
So, the association of transport with anx when …
- road_type = 0 = RuralRoad is 0.44
- road_type = 1 = CityStreet is 0.44 + 7.76 = 8.2

Mapping model coefficients to group means

A more formal version of “find the difference of differences”. (You’ll practice this more in labs!)

Model coefficients:

                                     Estimate Std. Error t value Pr(>|t|)
(Intercept)                             27.03       1.62   16.67     0.00
transportCyclist                         0.44       2.22    0.20     0.84
road_typeCityStreet                      4.89       2.23    2.19     0.03
transportCyclist:road_typeCityStreet     7.76       3.13    2.48     0.01

Group means:

	RuralRoad	CityStreet
Motorist	27.03	31.92
Cyclist	27.47	40.12

How model coefficients and group means are related:

`(Intercept)`	mean(Motorist, RuralRoad)	27.03
`transportCyclist`	mean(Cyclist, RuralRoad) – mean(Motorist, RuralRoad)	27.47 – 27.03 = 0.44
`road_typeCityStreet`	mean(Motorist, CityStreet) – mean(Motorist, RuralRoad)	31.92 – 27.03 = 4.89
`tC:rtCS`	[mean(Cyclist, CityStreet) – mean(Motorist, CityStreet)] – [mean(Cyclist, RuralRoad) – mean(Motorist, RuralRoad)]	(40.12 – 31.92) – (27.47 – 27.03) = 7.76

Visualising the model’s estimates (1)

When we had continuous predictors, we used probe_interaction() from the interactions library.

Now that we have only categorical predictors, we must use cat_plot() instead.

cat_plot(
  m1,
  pred = transport,
  modx = road_type,
)

If you like to think about interactions in terms of differences in vertical distances between two groups, then this way of visualising things might work for you.

Visualising the model’s estimates (2)

cat_plot() can also link each group mean with lines, using the argument geom = "line".

cat_plot(
  m1,
  pred = transport,
  modx = road_type,
  geom = 'line'
)

If you like to think about interactions in terms of differences of slopes between two groups, then this way of visualising things might work for you.

Calculating simple effects

“Simple effects”? “Simple slopes”?

Simple effects: The association of a categorical predictor with the outcome, at a specific value of another predictor.
Simple slopes: The association of a continuous predictor with the outcome, at a specific value of another predictor.

Calculating simple effects by hand

We need the model’s linear expression:

\[ \text{anx} = \beta_0 + (\beta_1 \cdot \text{transport}) + (\beta_2 \cdot \text{road_type}) + (\beta_3 \cdot \text{transport} \cdot \text{road_type}) + \epsilon \\ \]

And the model’s coefficient estimates:

coef(m1) |> round(2)

                         (Intercept)                     transportCyclist 
                               27.03                                 0.44 
                 road_typeCityStreet transportCyclist:road_typeCityStreet 
                                4.89                                 7.76

We substitute the coefficient estimates into the linear expression, to give:

\[ \text{anx} = 27.03 + (0.44 \cdot \text{transport}) + (4.89 \cdot \text{road_type}) + (7.76 \cdot \text{transport} \cdot \text{road_type}) + \epsilon \\ \]

The simple effect of `transport` for rural roads

Rural roads are represented by road_type = 0, so we substitute 0 for \(\text{road_type}\).

This is the equation for the blue line in this plot:

When we ask for a “simple effect”, we ask for the slope of this line: the difference between groups at a specific level of another predictor. So here, the simple effect is 0.44.

The simple effect of `transport` for city streets

City streets are represented by road_type = 1, so we substitute 1 for \(\text{road_type}\) below.

\[\begin{align} \text{anx} &= 27.03 + (0.44 \cdot \text{transport}) + (4.89 \cdot \text{road_type}) + (7.76 \cdot \text{transport} \cdot \text{road_type}) + \epsilon \\ \text{anx}_{\text{City}} &= 27.03 + (0.44 \cdot \text{transport}) + (4.89 \cdot 1) + (7.76 \cdot \text{transport} \cdot 1) + \epsilon \\ \text{anx}_{\text{City}} &= 27.03 + 4.89 + (0.44 \cdot \text{transport}) + (7.76 \cdot \text{transport}) + \epsilon \\ \text{anx}_{\text{City}} &= 31.92 + ((0.44 + 7.76) \cdot \text{transport}) + \epsilon \\ \text{anx}_{\text{City}} &= 31.92 + (8.2 \cdot \text{transport}) + \epsilon \\ \end{align}\]

This is the equation for the orange line in this plot:

If we ask for the simple effect of transport for city streets, it’s the slope of this line: 8.2.

What about simple effects of `road_type`?

Also doable! Those would be the symmetrical simple effects: also true, just a different angle of looking at the same data.

Those simple effects would match the blue and orange slopes in this plot:

I’ll leave calculating those lines to you—it’s good practice :)

Interactions with 2x3 data, treatment-coded

Post-travel anxiety ratings for people using two different kinds of transport:

Motorist:

Cyclist:

now on three different road_types:

RuralRoad:

CityStreet:

DualCarr(iageway):

Visualise the data

The data is exactly the same as before, plus the new category of DualCarr.

Set up the data

Factor levels:

anx2 <- anx2 |>
  mutate(
    transport = factor(transport, levels = c('Motorist', 'Cyclist')),
    road_type = factor(road_type, levels = c('RuralRoad', 'CityStreet', 'DualCarr')),
  )

The default contrast coding (treatment coding):

contrasts(anx2$transport)

         Cyclist
Motorist       0
Cyclist        1

contrasts(anx2$road_type)

           CityStreet DualCarr
RuralRoad           0        0
CityStreet          1        0
DualCarr            0        1

Our model of this data will have two interaction terms:

transportCyclist:road_typeCityStreet and
transportCyclist:road_typeDualCarr

In general, how many interaction terms will a model have?

The number of interaction terms is given by

\[ (r - 1) \times (c - 1) \]

\(r\) (which stands for “rows”) is the number of levels in the first interacting predictor;
\(c\) (which stands for “columns”) is the number of levels in the second interacting predictor.

To see why we are talking about “rows” and “columns”, imagine the variables arranged like this:

	RuralRoad	CityStreet	DualCarr
Motorist
Cyclist

So in our 2x3 data, where \(r=2\) and \(c=3\), we have

\[\begin{align} ~& (r - 1) \times (c - 1)\\ =~& (2 - 1) \times (3 - 1)\\ =~& 1 \times 2\\ =~& 2 \end{align}\]

interaction terms.

Each interaction’s contrast is the product of the two interacting contrasts (1)

Let’s start with transportCyclist:road_typeCityStreet:

transport	road_type	transportCyclist	road_typeCityStreet	tC:rtCS
Motorist	RuralRoad	0	0	?
Motorist	CityStreet	0	1	?
Motorist	DualCarr	0	0	?
Cyclist	RuralRoad	1	0	?
Cyclist	CityStreet	1	1	?
Cyclist	DualCarr	1	0	?

wooclap.com, enter code GECHXE

Each interaction’s contrast is the product of the two interacting contrasts (2)

Continue on with transportCyclist:road_typeDualCarr:

transport	road_type	transportCyclist	road_typeDualCarr	tC:rtDC
Motorist	RuralRoad	0	0	?
Motorist	CityStreet	0	0	?
Motorist	DualCarr	0	1	?
Cyclist	RuralRoad	1	0	?
Cyclist	CityStreet	1	0	?
Cyclist	DualCarr	1	1	?

wooclap.com, enter code GECHXE

Model time

Fit the model

\[ \begin{align} \text{anx} ~=~& \beta_0 + (\beta_1 \cdot \text{transport}) + (\beta_2 \cdot \text{road_type}_{\text{CS}}) + (\beta_3 \cdot \text{road_type}_{\text{DC}}) + \\ & (\beta_4 \cdot \text{transport} \cdot \text{road_type}_{\text{CS}}) + (\beta_5 \cdot \text{transport} \cdot \text{road_type}_{\text{DC}}) + \epsilon \end{align} \]

m2 <- lm(anx ~ transport * road_type, data = anx2)

summary(m2)


Call:
lm(formula = anx ~ transport * road_type, data = anx2)

Residuals:
     Min       1Q   Median       3Q      Max 
-23.8235  -7.0303   0.3514   6.1765  22.5263 

Coefficients:
                                     Estimate Std. Error t value Pr(>|t|)    
(Intercept)                           27.0303     1.6584  16.299  < 2e-16 ***
transportCyclist                       0.4434     2.2669   0.196  0.84512    
road_typeCityStreet                    4.8886     2.2811   2.143  0.03325 *  
road_typeDualCarr                      6.0197     2.2404   2.687  0.00779 ** 
transportCyclist:road_typeCityStreet   7.7553     3.2033   2.421  0.01633 *  
transportCyclist:road_typeDualCarr    14.3301     3.1744   4.514 1.06e-05 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 9.527 on 210 degrees of freedom
Multiple R-squared:  0.3692,    Adjusted R-squared:  0.3542 
F-statistic: 24.58 on 5 and 210 DF,  p-value: < 2.2e-16

Interpreting the model (1)

                                      Estimate 
(Intercept)                              27.03 
transportCyclist                          0.44 
road_typeCityStreet                       4.89 
road_typeDualCarr                         6.02 
transportCyclist:road_typeCityStreet      7.76 
transportCyclist:road_typeDualCarr       14.33

Intercept (same as before)

For Motorists on RuralRoads (i.e., at the reference levels of transport and road_type, where all predictors = 0), the estimated average anx is 27.03 points.

transportCyclist (same as before)

Specifically at the reference level of road_type (RuralRoad), being a cyclist is associated with an increase in anxiety of 0.44 points.

road_typeCityStreet (same as before)

Specifically at the reference level of transport (Motorist), being on city streets is associated with an increase in anxiety of 4.89 points.

road_typeDualCarr (new!)

Specifically at the reference level of transport (Motorist), being on dual carriageways is associated with an increase in anxiety of 6.02 points.

Interpreting the model (2)

                                      Estimate 
(Intercept)                              27.03 
transportCyclist                          0.44 
road_typeCityStreet                       4.89 
road_typeDualCarr                         6.02 
transportCyclist:road_typeCityStreet      7.76 
transportCyclist:road_typeDualCarr       14.33

transportCyclist:road_typeCityStreet

Being a cyclist changes the association of road_typeCityStreet with anx by 7.76 points.
So, the association of road_typeCityStreet with anx when …
- transport = 0 = Motorist is 4.89
- transport = 1 = Cyclist is 4.89 + 7.76 = 12.65
Equivalently, being on city streets changes the association of transport with anx by 7.76 points.
So, the association of transport with anx when …
- road_typeCityStreet = 0 = RuralRoad is 0.44
- road_typeCityStreet = 1 = CityStreet is 0.44 + 7.76 = 8.2

Interpreting the model (3)

                                      Estimate 
(Intercept)                              27.03 
transportCyclist                          0.44 
road_typeCityStreet                       4.89 
road_typeDualCarr                         6.02 
transportCyclist:road_typeCityStreet      7.76 
transportCyclist:road_typeDualCarr       14.33

transportCyclist:road_typeDualCarr

Being a cyclist changes the association of road_typeDualCarr with anx by about 14.33 points.
So, the association of road_type DualCarr with anx when …
- transport = 0 = Motorist is 6.02
- transport = 1 = Cyclist is 6.02 + 14.33 = 20.35
Equivalently, being on dual carriageways changes the association of transport with anx by 14.33 points.
So, the association of transport with anx when …
- road_typeDualCarr = 0 = RuralRoad is 0.44
- road_typeDualCarr = 1 = DualCarr is 0.44 + 14.33 = 14.77

Visualising the model’s estimates with `cat_plot()`

One option:

cat_plot(
  m2,
  pred = transport,
  modx = road_type,
)

Another option:

cat_plot(
  m2,
  pred = transport,
  modx = road_type,
  geom = 'line'
)

Why do we need more than one interaction term to capture how these predictors interact?

wooclap.com, enter code GECHXE

The big picture: Interactions

Whenever one predictor’s association with the outcome depends on another predictor, then we’re dealing with interactions.

Beyond two-way interactions

So far, we’ve been focusing on interactions between two predictors, called “two-way interactions”.

But in principle, we could throw another variable into the mix:

Maybe countries with different amounts of bike infrastructure differ in the anxiety that cyclists vs. motorists feel on different road types.

This would be a three-way interaction between country, transport, and road type.

My advice to you: Try not to design studies that involve three-way interactions.

They are tricky to interpret.
The three-way interaction term itself is often a fairly small number, and we usually don’t have enough statistical power to reliably detect it.

Back matter

Revisiting this week’s learning objectives

What does the interaction coefficient of a linear model mean?

How the slope of one predictor changes, when the other predictor goes from 0 to 1. (This interpretation applies to all kinds of predictors, both continuous and categorical.)
The interaction term is not a slope on its own. It is an adjustment value that we add to one of the predictor’s slopes.
If we have two interacting predictors, A and B, then the interaction term tells us
- how much does the association between A and the outcome (i.e., the slope of A) change, when B goes from 0 to 1?
- how much does the association between B and the outcome (i.e., the slope of B) change, when A goes from 0 to 1?

If we know the contrast coding for two interacting categorical predictors, how do we work out the coding of the interaction term?

Get all combinations of levels in the interacting predictors.
Multiply each pair of coding values together (i.e., get the product of each pair of values).
The resulting list of numbers is the coding of the interaction.

Revisiting this week’s learning objectives

What’s the difference between a “simple slope” and a “simple effect”?

The only difference is what kind of predictor we’re looking at.
Simple effects: The association of a categorical predictor with the outcome, at a specific value of another predictor (either categorical or continuous).
Simple slopes: The association of a continuous predictor with the outcome, at a specific value of another predictor (either categorical or continuous).

How can we calculate how many interaction terms a model will have when two categorical predictors interact?

Based on how many levels each predictor has.

\[ (r - 1) \times (c - 1) \]

\(r\) is the number of levels in the first interacting predictor.
\(c\) is the number of levels in the second interacting predictor.

This week

Tasks:

Attend your lab and work together on the exercises

Support:

Help each other on the Piazza forum

Complete the weekly quiz

Attend office hours (see Learn page for details)

Appendix

Key insight: The difference between differences is the same both ways

One way:

\[ 12.65 - 4.89 = 7.76 \]

Another way:

\[ 8.2 - 0.44 = 7.76 \]

No matter which angle we look at the interaction data from, the difference between differences—which appears in the model as the coefficient of the interaction term—is the same.

As we saw last week: interactions are symmetrical.

(In practice, people usually only report the angle that makes the most sense for their research question.)

Difference of differences 1

(Motorist and RuralRoad are the reference levels)

Group means:

	RuralRoad	CityStreet
Motorist	27.03	31.92
Cyclist	27.47	40.12

For motorists, the difference between city and rural:

\[ \begin{align} & \text{CityStreet}~ - \text{RuralRoad} \\ =~ & 31.92 - 27.03 \\ =~ & 4.89 \\ \end{align} \]

For cyclists, the difference between city and rural:

\[ \begin{align} & \text{CityStreet}~ - \text{RuralRoad} \\ =~ & 40.12 - 27.47 \\ =~ & 12.65 \\ \end{align} \]

The difference between cyclists’ difference and motorists’ difference:

\[ \begin{align} & \text{CyclistDiff}~ - \text{MotoristDiff} \\ =~ & 12.65 - 4.89 \\ =~ & 7.76 \\ \end{align} \]

Difference of differences 2

(Motorist and RuralRoad are the reference levels)

Group means:

	RuralRoad	CityStreet
Motorist	27.03	31.92
Cyclist	27.47	40.12

For rural roads, the difference between cyclists and motorists:

\[ \begin{align} & \text{Cyclist}~ - \text{Motorist} \\ =~ & 27.47 - 27.03 \\ =~ & 0.44 \\ \end{align} \]

For city streets, the difference between cyclists and motorists:

\[ \begin{align} & \text{Cyclist}~ - \text{Motorist} \\ =~ & 40.12 - 31.92 \\ =~ & 8.2 \\ \end{align} \]

The difference between city streets’ difference and rural roads’ difference:

\[ \begin{align} & \text{CityStreetsDiff}~ - \text{RuralRoadsDiff} \\ =~ & 8.2 - 0.44 \\ =~ & 7.76 \\ \end{align} \]

Mapping 2x3 model coefs to group means

Model coefficients:

                                      Estimate 
(Intercept)                              27.03 
transportCyclist                          0.44 
road_typeCityStreet                       4.89 
road_typeDualCarr                         6.02 
transportCyclist:road_typeCityStreet      7.76 
transportCyclist:road_typeDualCarr       14.33

Group means:

	RuralRoad	CityStreet	DualCarr
Motorist	27.03	31.92	33.05
Cyclist	27.47	40.12	47.82

How model coefficients and group means are related:

`(Intercept)`	mean(Motorist, RuralRoad)	27.03
`transportCyclist`	mean(Cyclist, RuralRoad) – mean(Motorist, RuralRoad)	27.47 – 27.03 = 0.44
`road_typeCityStreet`	mean(Motorist, CityStreet) – mean(Motorist, RuralRoad)	31.92 – 27.03 = 4.89
`road_typeDualCarr`	mean(Motorist, DualCarr) – mean(Motorist, RuralRoad)	33.05 – 27.03 = 6.02
`tC:rtCS`	[mean(Cyclist, CityStreet) – mean(Motorist, CityStreet)] – [mean(Cyclist, RuralRoad) – mean(Motorist, RuralRoad)]	(40.12 – 31.92) – (27.47 – 27.03) = 7.76
`tC:rtDC`	[mean(Cyclist, DualCarr) – mean(Motorist, DualCarr)] – [mean(Cyclist, RuralRoad) – mean(Motorist, RuralRoad)]	(47.82 – 33.05) – (27.47 – 27.03) = 14.33

Categorical/categorical interactions

Course Overview

Tech check and warm-up

This week’s learning objectives

Where we are in the analysis plan today

What’s an interaction again?

Today’s data

Today’s data: One way to look at it

Today’s data: Another equivalent way

Let’s play with the numbers

How different are the differences between groups?

Let’s model it!

First: Set up factor levels, check contrast coding

The interaction’s contrast is the product of the two interacting contrasts

Think–Pair–Share: Cat/cat interaction coefs

Fitting the model

Interpreting the model (1)

Interpreting the model (2)

Mapping model coefficients to group means

Visualising the model’s estimates (1)

Visualising the model’s estimates (2)

Calculating simple effects

“Simple effects”? “Simple slopes”?

Calculating simple effects by hand

The simple effect of transport for rural roads

The simple effect of transport for city streets

What about simple effects of road_type?

Interactions with 2x3 data, treatment-coded

Interactions with 2x3 data, treatment-coded

Visualise the data

Set up the data

In general, how many interaction terms will a model have?

Each interaction’s contrast is the product of the two interacting contrasts (1)

Each interaction’s contrast is the product of the two interacting contrasts (2)

Model time

Fit the model

Interpreting the model (1)

Interpreting the model (2)

Interpreting the model (3)

Visualising the model’s estimates with cat_plot()

Why do we need more than one interaction term to capture how these predictors interact?

The big picture: Interactions

The big picture: Interactions

Beyond two-way interactions

Back matter

Revisiting this week’s learning objectives

Revisiting this week’s learning objectives

This week

Appendix

Key insight: The difference between differences is the same both ways

Difference of differences 1

Difference of differences 2

Mapping 2x3 model coefs to group means

The simple effect of `transport` for rural roads

The simple effect of `transport` for city streets

What about simple effects of `road_type`?

Visualising the model’s estimates with `cat_plot()`