3. Assumptions and Diagnostics | Centering

Exercises: Assumptions & Diagnostics

Data: Wellbeing Across Scotland

For these next set of exercises we continue with our recurring study in which researchers want to look at the relationship between time spent outdoors and mental wellbeing, across all of Scotland. Data is collected from 20 of the Local Authority Areas and is accessible at https://uoepsy.github.io/data/LAAwellbeing.csv.

variable	description
ppt	Participant ID
name	Participant Name
laa	Local Authority Area
outdoor_time	Self report estimated number of hours per week spent outdoors
wellbeing	Warwick-Edinburgh Mental Wellbeing Scale (WEMWBS), a self-report measure of mental health and well-being. The scale is scored by summing responses to each item, with items answered on a 1 to 5 Likert scale. The minimum scale score is 14 and the maximum is 70.
density	LAA Population Density (people per square km)

Question 1

The code below will read in the data and fit the model with by-LAA random intercepts and slopes of outdoor time.

library(tidyverse)
library(lme4)
scotmw <- read_csv("https://uoepsy.github.io/data/LAAwellbeing.csv")
rs_model <- lmer(wellbeing ~ 1 + outdoor_time + (1 + outdoor_time | laa), data = scotmw)

Plot the residuals vs fitted values, and assess the extend to which the assumption holds that the residuals are zero mean.
Construct a scale-location plot. This is where the square-root of the absolute value of the standardised residuals is plotted against the fitted values, and allows you to more easily assess the assumption of constant variance.

Optional: can you create the same plot using ggplot, starting with the augment() function from the broom.mixed package?

Hints

plot(model) will give you this plot, but you might want to play with the type = c(......) argument to get the smoothing line

Question 2

Examine the normality of both the level 1 and level 2 residuals.

Hints

Use hist() if you like, or qqnorm(residuals) followed by qqline(residuals)
Extracting the level 2 residuals (the random effects) can be difficult. ranef(model) will get you some of the way.

Level 1

hist(resid(rs_model))

qqnorm(resid(rs_model))
qqline(resid(rs_model))

Tip

Indexing random effects

The output of ranef() is a type of data container called “list”. The list includes different named slots and you can find the names as follows:

names(ranef(rs_model))

[1] "laa"

You have a named slot for each group in your lmer specification. In this case, we only specified laa as grouping. To access the random effects for the specific group, type ranef(model) followed by $groupname

ranef(rs_model)$laa

                      (Intercept) outdoor_time
Angus                  -4.8568066  0.095850303
Argyll and Bute         1.6488121 -0.005181049
City of Edinburgh      10.8163125  0.199034174
Dumfries and Galloway  -2.6893688 -0.012005965
East Ayrshire          -5.6749200 -0.232750990
East Renfrewshire       0.5024800  0.084907037
Falkirk                -7.4525578 -0.156694328
Glasgow City          -10.9101439 -0.464232183
Highland                6.7315989  0.280992008
Inverclyde             -1.2966048  0.197142062
Midlothian             -1.7585791 -0.485961786
Moray                  -2.2165380  0.133392034
Na h-Eileanan Siar     14.0595006  0.399493656
Orkney Islands          0.1789928  0.154590585
Perth and Kinross       1.4894924  0.256689754
Scottish Borders       -0.2638474 -0.148174460
Shetland Islands        4.7262680  0.388873631
Stirling                6.5060959 -0.075781592
West Dunbartonshire    -6.4127140 -0.266515096
West Lothian           -3.1274727 -0.343667797

You will have multiple columns, one for each random effect.

You can get the random intercepts by laa, by selecting the first column as follows:

ranef(rs_model)$laa[, 1]

 [1]  -4.8568066   1.6488121  10.8163125  -2.6893688  -5.6749200   0.5024800
 [7]  -7.4525578 -10.9101439   6.7315989  -1.2966048  -1.7585791  -2.2165380
[13]  14.0595006   0.1789928   1.4894924  -0.2638474   4.7262680   6.5060959
[19]  -6.4127140  -3.1274727

Respectively, for the random slopes by laa, you select the second column as follows:

ranef(rs_model)$laa[, 2]

 [1]  0.095850303 -0.005181049  0.199034174 -0.012005965 -0.232750990
 [6]  0.084907037 -0.156694328 -0.464232183  0.280992008  0.197142062
[11] -0.485961786  0.133392034  0.399493656  0.154590585  0.256689754
[16] -0.148174460  0.388873631 -0.075781592 -0.266515096 -0.343667797

Level 2

qqnorm(ranef(rs_model)$laa[, 1], main = "Random intercept")
qqline(ranef(rs_model)$laa[, 1])

qqnorm(ranef(rs_model)$laa[, 2], main = "Random slope")
qqline(ranef(rs_model)$laa[, 2])

The normality of the residuals at both levels looks pretty decent here. This is especially good given that we only actually have 20 clusters (the LAAs). We have quite a small sample at this level.

Question 3

Which person in the dataset has the greatest influence on our model?
For which person is the model fit the worst (i.e., who has the highest residual?)
Which LAA has the greatest influence on our model?

Hints

as well as hlm_influence() in the HLMdiag package there is another nice function, hlm_augment()
we can often end up in confusion because the $i^{th}$ observation inputted to our model (and therefore the $i^{th}$ observation of hlm_influence() output) might not be the $i^{th}$ observation in our original dataset - there may be missing data! (Luckily, we have no missing data in this dataset).

library(HLMdiag)
l1_inf <- hlm_influence(rs_model,level=1)
dotplot_diag(l1_inf$cooksd, cutoff="internal")+
  ylim(0,.15)

Greatest influence:

hlm_augment(rs_model, level=1) %>% arrange(desc(cooksd))

# A tibble: 132 × 15
      id wellbeing outdoor_time laa          .resid .fitted .ls.resid .ls.fitted
   <dbl>     <dbl>        <dbl> <fct>         <dbl>   <dbl>     <dbl>      <dbl>
 1    74        35           33 Scottish Bo…  -5.15    40.2    -3.73        38.7
 2   129        60            5 City of Edi…   8.89    51.1     6.43        53.6
 3   109        31            8 Inverclyde    -9.22    40.2    -3.03        34.0
 4    59        32            7 Scottish Bo…  -6.43    38.4    -7.45        39.5
 5    31        35            7 Moray         -3.44    38.4     0.198       34.8
 6    90        70           34 Na h-Eilean…  -3.16    73.2    -1.19        71.2
 7    87        54           29 Highland      -5.33    59.3    -5.66        59.7
 8    62        26           21 Midlothian    -4.77    30.8     0.214       25.8
 9    67        37            7 East Ayrshi…   4.58    32.4     4.62        32.4
10    64        46           18 City of Edi… -10.5     56.5   -10.1         56.1
# ℹ 122 more rows
# ℹ 7 more variables: .mar.resid <dbl>, .mar.fitted <dbl>, cooksd <dbl>,
#   mdffits <dbl>, covtrace <dbl>, covratio <dbl>, leverage.overall <dbl>

scotmw[74, ]

# A tibble: 1 × 6
  ppt   name                 laa              outdoor_time wellbeing density
  <chr> <chr>                <chr>                   <dbl>     <dbl>   <dbl>
1 ID46  Groundskeeper Willie Scottish Borders           33        35      29

Highest residual:

hlm_augment(rs_model, level=1) %>% arrange(desc(abs(.resid)))

# A tibble: 132 × 15
      id wellbeing outdoor_time laa          .resid .fitted .ls.resid .ls.fitted
   <dbl>     <dbl>        <dbl> <fct>         <dbl>   <dbl>     <dbl>      <dbl>
 1    64        46           18 City of Edi… -10.5     56.5    -10.1        56.1
 2   107        22           12 East Ayrshi… -10.3     32.3    -10.1        32.1
 3    72        22           24 West Lothian -10.0     32.0     -9.24       31.2
 4   109        31            8 Inverclyde    -9.22    40.2     -3.03       34.0
 5   130        22           16 West Dunbar…  -8.98    31.0     -8.61       30.6
 6   129        60            5 City of Edi…   8.89    51.1      6.43       53.6
 7    93        65           18 City of Edi…   8.51    56.5      8.90       56.1
 8    85        47           15 City of Edi…  -8.25    55.2     -8.52       55.5
 9     7        38           13 Perth and K…  -7.84    45.8     -5.28       43.3
10   121        31           16 Dumfries an…  -7.78    38.8     -7.70       38.7
# ℹ 122 more rows
# ℹ 7 more variables: .mar.resid <dbl>, .mar.fitted <dbl>, cooksd <dbl>,
#   mdffits <dbl>, covtrace <dbl>, covratio <dbl>, leverage.overall <dbl>

scotmw[64, ]

# A tibble: 1 × 6
  ppt   name            laa               outdoor_time wellbeing density
  <chr> <chr>           <chr>                    <dbl>     <dbl>   <dbl>
1 ID37  Nicola Sturgeon City of Edinburgh           18        46    1958

Most influential LAA:

hlm_augment(rs_model, level="laa") %>% arrange(desc(cooksd))

# A tibble: 20 × 10
   laa       .ranef.intercept .ranef.outdoor_time .ls.intercept .ls.outdoor_time
   <chr>                <dbl>               <dbl>         <dbl>            <dbl>
 1 Midlothi…           -1.76             -0.486           8.75           -1.26  
 2 Na h-Eil…           14.1               0.399          21.4             0.0553
 3 Glasgow …          -10.9              -0.464          -9.45           -0.593 
 4 City of …           10.8               0.199          16.2            -0.144 
 5 Stirling             6.51             -0.0758         13.3            -0.500 
 6 Shetland…            4.73              0.389           3.69            0.424 
 7 Angus               -4.86              0.0959         -6.82            0.265 
 8 West Lot…           -3.13             -0.344           4.13           -0.725 
 9 Falkirk             -7.45             -0.157         -12.3            -0.0360
10 Invercly…           -1.30              0.197         -14.5             1.17  
11 Highland             6.73              0.281           8.95            0.155 
12 West Dun…           -6.41             -0.267          -4.87           -0.395 
13 Moray               -2.22              0.133          -5.81            0.267 
14 Perth an…            1.49              0.257         -20.4             1.76  
15 East Ayr…           -5.67             -0.233          -3.63           -0.391 
16 Orkney I…            0.179             0.155          -0.816           0.212 
17 Scottish…           -0.264            -0.148           3.28           -0.367 
18 Dumfries…           -2.69             -0.0120         -7.27            0.261 
19 Argyll a…            1.65             -0.00518         4.67           -0.187 
20 East Ren…            0.502             0.0849          1.40            0.0247
# ℹ 5 more variables: cooksd <dbl>, mdffits <dbl>, covtrace <dbl>,
#   covratio <dbl>, leverage.overall <dbl>

Question 4

Looking at the random effects, which LAA shows the least benefit to wellbeing as outdoor time increases, and which shows the greatest benefit?
What is the estimated wellbeing for people from City of Edinburgh with zero hours of outdoor time per week, and what is their associated increases in wellbeing for every hour per week increase in outdoor time?

It looks like the residents of Midlothian have the least improvement, and the Western Isles (Na h-Eileanan Siar) show the most increases of wellbeing with outdoor time. We can see this from the LAA-random slopes of outdoor time:

ranef(rs_model)

$laa
                      (Intercept) outdoor_time
Angus                  -4.8568066  0.095850303
Argyll and Bute         1.6488121 -0.005181049
City of Edinburgh      10.8163125  0.199034174
Dumfries and Galloway  -2.6893688 -0.012005965
East Ayrshire          -5.6749200 -0.232750990
East Renfrewshire       0.5024800  0.084907037
Falkirk                -7.4525578 -0.156694328
Glasgow City          -10.9101439 -0.464232183
Highland                6.7315989  0.280992008
Inverclyde             -1.2966048  0.197142062
Midlothian             -1.7585791 -0.485961786
Moray                  -2.2165380  0.133392034
Na h-Eileanan Siar     14.0595006  0.399493656
Orkney Islands          0.1789928  0.154590585
Perth and Kinross       1.4894924  0.256689754
Scottish Borders       -0.2638474 -0.148174460
Shetland Islands        4.7262680  0.388873631
Stirling                6.5060959 -0.075781592
West Dunbartonshire    -6.4127140 -0.266515096
West Lothian           -3.1274727 -0.343667797

with conditional variances for "laa"

We can get the cluster-specific coefficients either by adding the fixef() and ranef() together, or using coef():

coef(rs_model)

$laa
                      (Intercept) outdoor_time
Angus                    33.36700   0.31050188
Argyll and Bute          39.87261   0.20947052
City of Edinburgh        49.04011   0.41368575
Dumfries and Galloway    35.53443   0.20264561
East Ayrshire            32.54888  -0.01809942
East Renfrewshire        38.72628   0.29955861
Falkirk                  30.77124   0.05795725
Glasgow City             27.31366  -0.24958061
Highland                 44.95540   0.49564358
Inverclyde               36.92720   0.41179364
Midlothian               36.46522  -0.27131021
Moray                    36.00726   0.34804361
Na h-Eileanan Siar       52.28330   0.61414523
Orkney Islands           38.40280   0.36924216
Perth and Kinross        39.71329   0.47134133
Scottish Borders         37.95995   0.06647711
Shetland Islands         42.95007   0.60352520
Stirling                 44.72990   0.13886998
West Dunbartonshire      31.81109  -0.05186352
West Lothian             35.09633  -0.12901622

attr(,"class")
[1] "coef.mer"

coef(rs_model)$laa["City of Edinburgh",]

                  (Intercept) outdoor_time
City of Edinburgh    49.04011    0.4136857

Exercises: Centering in the MLM

Centering & Scaling in LM

We have some data from a study investigating how perceived persuasiveness of a speaker is influenced by the rate at which they speak.

dap2 <- read_csv("https://uoepsy.github.io/data/dapr2_2122_report1.csv")

We can fit a simple linear regression (one predictor) to evaluate how speech rate (variable sp_rate in the dataset) influences perceived persuasiveness (variable persuasive in the dataset). There are various ways in which we can transform the predictor variable sp_rate, which in turn can alter the interpretation of some of our estimates:

Raw X

m1 <- lm(persuasive ~ sp_rate, data = dap2)
summary(m1)$coefficients

             Estimate Std. Error   t value     Pr(>|t|)
(Intercept) 55.532060  6.4016670  8.674625 6.848945e-15
sp_rate     -0.190987  0.4497113 -0.424688 6.716809e-01

The intercept and the coefficient for speech rate are interpreted as:

(Intercept): A audio clip of someone speaking at zero phones per second is estimated as having an average persuasive rating of 55.53.
sp_rate: For every increase of one phone per second, perceived persuasiveness is estimated to decrease by -0.19.

Mean-Centered X

We can mean center our predictor and fit the model again:

dap2 <- dap2 %>% mutate(sp_rate_mc = sp_rate - mean(sp_rate))
m2 <- lm(persuasive ~ sp_rate_mc, data = dap2)
summary(m2)$coefficients

             Estimate Std. Error   t value     Pr(>|t|)
(Intercept) 52.874667  1.3519418 39.110165 6.429541e-80
sp_rate_mc  -0.190987  0.4497113 -0.424688 6.716809e-01

(Intercept): A audio clip of someone speaking at the mean phones per second is estimated as having an average persuasive rating of 52.87.
sp_rate_mc: For every increase of one phone per second, perceived persuasiveness is estimated to decrease by -0.19.

Standardised X

We can standardise our predictor and fit the model yet again:

dap2 <- dap2 %>% mutate(sp_rate_z = scale(sp_rate))
m3 <- lm(persuasive ~ sp_rate_z, data = dap2)
summary(m3)$coefficients

             Estimate Std. Error   t value     Pr(>|t|)
(Intercept) 52.874667   1.351942 39.110165 6.429541e-80
sp_rate_z   -0.576077   1.356471 -0.424688 6.716809e-01

(Intercept): A audio clip of someone speaking at the mean phones per second is estimated as having an average persuasive rating of 52.87.
sp_rate_z: For every increase of one standard deviation in phones per second, perceived persuasiveness is estimated to decrease by -0.58.

Remember that the scale(sp_rate) is subtracting the mean from each value, then dividing those by the standard deviation. The standard deviation of dap2$sp_rate is:

sd(dap2$sp_rate)

[1] 3.016315

so in our variable dap2$sp_rate_z, a change of 3.02 gets scaled to be a change of 1 (because we are dividing by sd(dap2$sp_rate)).

coef(m1)[2] * sd(dap2$sp_rate)

  sp_rate 
-0.576077

coef(m3)[2]

sp_rate_z 
-0.576077

Note that these models are identical. When we conduct a model comparison between the 3 models, the residual sums of squares is identical for all models:

anova(m1,m2,m3)

Analysis of Variance Table

Model 1: persuasive ~ sp_rate
Model 2: persuasive ~ sp_rate_mc
Model 3: persuasive ~ sp_rate_z
  Res.Df   RSS Df Sum of Sq F Pr(>F)
1    148 40576                      
2    148 40576  0         0         
3    148 40576  0         0

What changes when you center or scale a predictor in a standard regression model (one fitted with lm())?

The variance explained by the predictor remains exactly the same
The intercept will change to be the estimated mean outcome where that predictor is “0”. Scaling and centering changes what “0” represents, thereby changing this estimate (the significance test will therefore also change because the intercept now has a different meaning)
The slope of the predictor will change according to any scaling (e.g. if you divide your predictor by 10, the slope will multiply by 10).
The test of the slope of the predictor remains exactly the same.

Data: Hangry

The study is interested in evaluating whether hunger influences peoples’ levels of irritability (i.e., “the hangry hypothesis”), and whether this is different for people following a diet that includes fasting. 81 participants were recruited into the study. Once a week for 5 consecutive weeks, participants were asked to complete two questionnaires, one assessing their level of hunger, and one assessing their level of irritability. The time and day at which participants were assessed was at a randomly chosen hour between 7am and 7pm each week. 46 of the participants were following a five-two diet (five days of normal eating, 2 days of fasting), and the remaining 35 were following no specific diet.

The data are available at: https://uoepsy.github.io/data/hangry.csv.

variable	description
q_irritability	Score on irritability questionnaire (0:100)
q_hunger	Score on hunger questionnaire (0:100)
ppt	Participant
fivetwo	Whether the participant follows the five-two diet

Question 1

Read carefully the description of the study above, and try to write out (in lmer syntax) an appropriate model to test the research aims.
e.g.:

outcome ~ explanatory variables + (???? | grouping)

Try to think about the maximal random effect structure (i.e. everything that can vary by-grouping is estimated as doing so).

To help you think through the steps to get from a description of a research study to a model specification, think about your answers to the following questions.

Q: What is our outcome variable?
Q: What are our explanatory variables?
Q: Is there any grouping (or “clustering”) of our data that we consider to be a random sample? If so, what are the groups?

Hints

The research is looking at how hunger influences irritability, and whether this is different for people on the fivetwo diet.
We can split our data in to groups of each participant. We can also split it into groups of each diet. Which of these groups have we randomly sampled? Do we have a random sample of participants? Do we have a random sample of diets? Another way to think of this is “if i repeated the experiment, what these groups be different?”

Our outcome is irritability here, because it is the thing that we are trying to explain through peoples’ hunger levels and diets.

lmer(irritability ~  explanatory variables + (???? | grouping))

We are interested in the effect of hunger on irritability, and whether this effect is different for the five-two diet. So we are interested in the interaction:

lmer(irritability ~  hunger + diet + hunger:diet + (???? | grouping))

(remember that hunger + diet + hunger:diet is just a more explicit way of writing hunger*diet).

If we did this experiment again, would we have different participants?
Yes. If we did this experiment again, would we have different diets? No, because we’re interested in the specific differences between the five-two diet and no dieting. This means we will likely want to by-participant random deviations (e.g. the ( ... | participant) bit in lmer). But we won’t have by-diet random effects (1 | diet) because the diet differences are the specific differences that we wish to test.

lmer(irritability ~  hunger + diet + hunger:diet + (???? | participant))

Thinking about what can be modelled as randomly varying between participants, we have some options:

participants vary in how irritable they are on average
(the intercept, 1 | participant)
participants vary in how much hunger influences their irritability
(the effect of hunger, hunger | participant)
participants vary in how much diet influences irritability
(the effect of diet, diet | participant)
participants vary in how much diet effects hunger’s influence on irritability
(the interaction between diet and hunger, diet:hunger | participant)

We can vary 1 and 2, but not 3 and 4. This is because each participant is either following the five-two diet or they are not. So for a single participant, we can’t assess “the effect diet has” on anything, because we haven’t seen that participant under different diets. if we try to plot a single participants’ data, we can see that it is impossible for us to assess “the effect of diet”:

By contrast, we can vary the intercept and the effect of hunger, because each participant has multiple values of irritability, and multiple different observations of hunger. We can think about a single participant’s “effect of hunger on irritability” and how we might fit a line to their data:

lmer(irritability ~  hunger + diet + hunger:diet + (1 + hunger | participant))

Total, Within, Between

Recall our research aim:

… whether hunger influences peoples’ levels of irritability (i.e., “the hangry hypothesis”), and whether this is different for people following a diet that includes fasting.

Forgetting about any differences due to diet, let’s just think about the relationship between irritability and hunger. How should we interpret this research aim?
Was it:

“Are people more irritable if they are, on average, more hungry than other people?”
“Are people more irritable if they are, for them, more hungry than they usually are?”
Some combination of both a. and b.

This is just one demonstration of how the statistical methods we use can constitute an integral part of our development of a research project, and part of the reason that data analysis for scientific cannot be so easily outsourced after designing the study and collecting the data.

As our data currently is currently stored, the relationship between irritability and the raw scores on the hunger questionnaire q_hunger represents some ‘total effect’ of hunger on irritability. This is a bit like interpretation c. above - it’s a composite of both the ‘within’ ( b. ) and ‘between’ ( a. ) effects. The problem with this is that the ‘total effect’ isn’t necessarily all that meaningful. It may tell us that ‘being higher on the hunger questionnaire is associated with being more irritable’, but how can we apply this information? It is not specifically about the comparison between hungry people and less hungry people, and nor is it about how person i changes when they are more hungry than usual. It is both these things smushed together.

To disaggregate the ‘within’ and ‘between’ effects of hunger on irritability, we can group-mean center. For ‘between’, we are interested in how irritability is related to the average hunger levels of a participant, and for ‘within’, we are asking how irritability is related to a participants’ relative levels of hunger (i.e., how far above/below their average hunger level they are.).

Question 2

Add to the data these two columns:

a column which contains the average hungriness score for each participant.
a column which contains the deviation from each person’s hunger score to that person’s average hunger score.

Hints

You’ll find group_by() %>% mutate() very useful here.

Question 3

For each of the new variables you just added, plot the irritability scores against those variables.

Does it look like hungry people are more irritable than less hungry people?
Does it look like when people are more hungry than normal, they are more irritable?

Question 4

We have taken the raw hunger scores and separated them into two parts (raw hunger scores = participants’ average hunger score + observation level deviations from those averages), that represent two different aspects of the relationship between hunger and irritability.

Adjust your model specification to include these two separate variables as predictors, instead of the raw hunger scores.

Hints

hunger * diet could be replaced by (hunger1 + hunger2) * diet, thereby allowing each aspect of hunger to interact with diet.
We can only put one of these variables in the random effects (1 + hunger | participant). Recall that above we discussed how we cannot have (diet | participant), because “an effect of diet” makes no sense for a single participant (they are either on the diet or they are not, so there is no ‘effect’). Similarly, each participant has only one value for their average hungriness.

Question 5

Hopefully, you have fitted a model similar to the below:

hangrywb <- lmer(q_irritability ~ (avg_hunger + hunger_gc) * fivetwo + 
                (1 + hunger_gc | ppt), data = hangry,
            control = lmerControl(optimizer="bobyqa"))

Below, we have obtained p-values using the Kenward Rogers Approximation of $df$ for the test of whether the fixed effects are zero, so we can see the significance of each estimate.

Provide an answer for each of these questions:

For those following no diet, is there evidence to suggest that people who are on average more hungry are more irritable?
Is there evidence to suggest that this is different for those following the five-two diet? In what way?
Do people following no diet tend to be more irritable when they are more hungry than they usually are?
Is there evidence to suggest that this is different for those following the five-two diet? In what way?
(Trickier:) What does the fivetwo coefficient represent?

Parameter	Coefficient	SE	95% CI	t	df	p
Model Summary
Fixed Effects
(Intercept)	17.13	5.21	(6.75, 27.51)	3.29	77.00	0.002
avg hunger	3.86e-03	0.11	(-0.21, 0.22)	0.04	77.00	0.971
hunger gc	0.19	0.08	(0.03, 0.34)	2.45	70.40	0.017
fivetwo (1)	-10.85	6.62	(-24.03, 2.32)	-1.64	77.00	0.105
avg hunger × fivetwo (1)	0.47	0.14	(0.20, 0.74)	3.44	77.00	< .001
hunger gc × fivetwo (1)	0.38	0.10	(0.18, 0.58)	3.75	73.64	< .001
Random Effects
SD (Intercept: ppt)	6.93
SD (hunger_gc: ppt)	0.38
Cor (Intercept~hunger_gc: ppt)	-0.01
SD (Residual)	4.83

Question 6

Construct two plots showing the two model estimated interactions. Think about your answers to the previous question, and check that they match with what you are seeing in the plots (do not underestimate the utility of this activity for helping understanding!).

Hints

This isn’t as difficult as it sounds. the sjPlot package can do it in one line of code!

Question 7

Provide tests or confidence intervals for the parameters of interest, and write-up the results.

Remember: some options for inference

	df approximations	likelihood-based
tests or CIs for model parameters	`library(parameters)` `model_parameters(model, ci_method="kr")`	`confint(model, type="profile")`
model comparison (different fixed effects, same random effects)	`library(pbkrtest)` `KRmodcomp(model1,model0)`	`anova(model0,model)`
	fit models with `REML=TRUE`. good option for small samples	fit models with `REML=FALSE`. needs large N at both levels (40+)

library(parameters)
model_parameters(hangrywb, ci_method = "kr", ci_random = FALSE)

# Fixed Effects

Parameter                | Coefficient |   SE |          95% CI |     t |    df |      p
----------------------------------------------------------------------------------------
(Intercept)              |       17.13 | 5.21 | [  6.75, 27.51] |  3.29 | 77.00 | 0.002 
avg hunger               |    3.86e-03 | 0.11 | [ -0.21,  0.22] |  0.04 | 77.00 | 0.971 
hunger gc                |        0.19 | 0.08 | [  0.03,  0.34] |  2.45 | 70.40 | 0.017 
fivetwo [1]              |      -10.85 | 6.62 | [-24.03,  2.32] | -1.64 | 77.00 | 0.105 
avg hunger × fivetwo [1] |        0.47 | 0.14 | [  0.20,  0.74] |  3.44 | 77.00 | < .001
hunger gc × fivetwo [1]  |        0.38 | 0.10 | [  0.18,  0.58] |  3.75 | 73.64 | < .001

# Random Effects

Parameter                      | Coefficient
--------------------------------------------
SD (Intercept: ppt)            |        6.93
SD (hunger_gc: ppt)            |        0.38
Cor (Intercept~hunger_gc: ppt) |       -0.01
SD (Residual)                  |        4.83

To investigate the association between irritability and hunger, and whether this relationship is different depending on whether or not participants are on a restricted diet such as the five-two, a multilevel linear model was fitted.
To disaggregate between the differences in irritability due to people being in general more/less hungry, and those due to people being more/less hungry than usual for them, irritability was regressed onto both participants’ average hunger scores their relative hunger levels. Both of these were allowed to interact with whether or not participants were on the five-two diet. Random intercepts and slopes of relative-hunger level were included for participants. The model was fitting with restricted maximum likelihood estimation with the lme4 package (Bates et al., 2015), using the bobyqa optimiser from the lme4. $P$-values were obtained using Wald tests with Kenward-Roger approximation of denominator degrees of freedom.

Results indicate that for people on no diet, being more hungry than normal was associated with greater irritability ($\beta = 0.19,\ SE = 0.08,\ t(2.45) = 70.4,\ p=0.017$), and that this was increased for those following the five-two diet ($\beta = 0.38,\ SE = 0.1,\ t(3.75) = 73.64,\ p<0.001$). Although for those not on a specific diet there was no evidence for an association between irritability and being generally a more hungry person ($p=0.971$), there a significant interaction was found between average hunger and being on the five-two diet ($\beta = 0.47,\ SE = 0.14,\ t(3.44) = 77,\ p<0.001$), suggesting that when dieting, hungrier people tend to be more irritable than less hungry people.
Results suggest that the ‘hangry hypothesis’ may occur within people (when a person is more hungry than they usually are, they tend to be more irritable), but not necessarily between hungry/less hungry people. Dieting was found to increase the association of both between-person hunger and within-person hunger with irritability.

Other within-group transformations

As well as within-group mean centering a predictor (like we have done above), we can within-group standardise a predictor. This would disagregate within and between effects, but interpretation would of the within effect would be the estimated change in $y$ associated with being 1 standard deviation higher in $x$ for that group.