Block 1 Analysis & Write-Up Example

Learning Objectives

At the end of this lab, you will:

  1. Understand how to write-up and provide interpretation of linear models with single and multiple predictors.

What You Need

  1. Be up to date with lectures
  2. Have completed Labs 1 - 4

Required R Packages

Remember to load all packages within a code chunk at the start of your RMarkdown file using library(). If you do not have a package and need to install, do so within the console using install.packages(" "). For further guidance on installing/updating packages, see Section C here.

For this lab, you will need to load the following package(s):

  • tidyverse
  • psych
  • patchwork
  • sjPlot
  • kableExtra

Lab Data

You can download the data required for this lab here or read it in via this link https://uoepsy.github.io/data/FOMOdataset.csv

Section A: Write-Up

In this lab you will be presented with the output from a statistical analysis, and your job will be to write-up and present the results. We’re going to use a simulated dataset based on a paper (the same one you have worked on in lectures this week) concerning Fear of Missing Out (FoMO), socio-demographic factors, and the Big Five personality traits.

The aim in writing should be that a reader is able to more or less replicate your analyses without referring to your R code. This requires detailing all of the steps you took in conducting the analysis.
The point of using RMarkdown is that you can pull your results directly from the code. If your analysis changes, so does your report!

Make sure that your final report doesn’t show any R functions or code. Remember you are interpreting and reporting your results in text, tables, or plots, targeting a generic reader who may use different software or may not know R at all. If you need a reminder on how to hide code, format tables, etc., make sure to review the rmd bootcamp.

Important - Write-Up Examples & Plagiarism

The example write-up sections included below are not perfect - they instead should give you a good example of what information you should include within each section, and how to structure this. For example, some information is missing (e.g., description of data checks, interpretation of descriptive statistics), some information could be presented more clearly (e.g., variable names in tables, table/figure titles/captions, and rationales for choices), and writing could be more concise in places (e.g., discussion section could be more succinct and more focused on the research questions in places).

Further, you must not copy any of the write-up included below for future reports - if you do, you will be committing plagiarism, and this type of academic misconduct is taken very seriously by the University. You can find out more here.

Study Overview

Research Aim

Explore the associations among Fear of Missing Out (FoMO), age, social media networks, and the Big Five personality traits.

Research Questions

  • RQ1: Does age predict FoMO?
  • RQ2: Does the number of Instagram followers explain a significant amount of variance in FoMO over and above age?
  • RQ3: Does personality predict FoMO?
FoMO data codebook.

Setup

Setup
  1. Create a new RMarkdown file
  2. Load the required package(s)
  3. Read the FOMO dataset into R, assigning it to an object named fomo

#Loading the required package(s)
library(tidyverse)
library(patchwork)
library(sjPlot)
library(kableExtra)
library(psych)

#Reading in fomo data and storing in object named 'fomo'
fomo <- read_csv("https://uoepsy.github.io/data/FOMOdataset.csv")

#check first six rows
head(fomo)
# A tibble: 6 × 8
   FOMO   Age     N     E     O     A     C TotalFollowers
  <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>          <dbl>
1    28    38    26    30    32    27    38             98
2    26    30    23    27    33    32    30            192
3    23    33    14    30    36    27    24            177
4    18    44    26    21    37    28    34            119
5    19    43    32    21    41    35    40            278
6    24    43    22    25    31    30    24            184


Provided Analysis Code

Below you will find the code required to conduct the analysis to address the research questions. This should look similar (in most areas) to what you worked through in lecture.

Provided Analysis Code

The 3-Act Structure

We need to present our report in three clear sections - think of your sections like the 3 key parts of a play or story - we need to (1) provide some background and scene setting for the reader, (2) present our results in the context of the research question, and (3) present a resolution to our story - relate our findings back to the question we were asked and provide our answer.

Act I: Analysis Strategy

Question 1

Attempt to draft an analysis strategy section based on the above research question and analysis provided.

Your analysis strategy will contain a number of different elements detailing plans and changes to your plan. Remember, your analysis strategy should not contain any results. You may wish to include the following sections:

  • Very brief data and design description:
    • Give the reader some background on the context of your write-up. For example, you may wish to describe the data source, data collection strategy, study design, number of observational units.
    • Specify the variables of interest in relation to the research question, including their unit of measurement, the allowed range (for Likert scales), and how they are scored.
  • Data management:
    • Describe any data cleaning and/or recoding.
    • Are there any observations that have been excluded based on pre-defined criteria? How/why, and how many?
    • Describe any transformations performed to aid your interpretation (i.e., mean centering, standardisation, etc.)
  • Model specification:
    • Clearly state your hypotheses and specify your chosen significance level (i.e., \(\alpha\)).
    • What type of statistical analysis do you plan to use to answer the research question? (e.g., simple linear regression, multiple linear regression, binary logistic regression (which you will see later in this course), etc.)
    • In some cases, you may wish to include some visualisations and descriptive tables to motivate your model specification.
    • Specify the model(s) to be fitted to answer your given research question and analysis structure. Clearly specify the response and explanatory variables included in your model(s).

*** This is not yet a completed list – we will add more to this as we progress through the course (e.g., coding of categorical variables, checking assumptions, diagnostics, etc.)

As noted and encouraged throughout the course, one of the main benefits of using RMarkdown is the ability to include inline R code in your document. Try to incorporate this in your write up so you can automatically pull the specified values from your code. If you need a reminder on how to do this, see Lesson 3 of the Rmd Bootcamp.

The dataset contained information on 3370 participants, including scores on 5 personality traits assessed by the a reduced version of the 45 item Big Five Inventory (BFI; Openness, Conscientiousness, Extraversion, Agreeableness and Neuroticism), where 43 items were used. Participants were also assessed on the FoMO scale - a 10 item measure which assessed the extent to which individuals experienced apprehension regarding missing out on interesting events of others. Items on both the BFI and FoMO scales were measured on a 5-point scale, meaning that scores on the BFI could range 8-40 for Neuroticism, 8-40 for Extraversion, 10-50 for Openness to Experience, 8-40 for Agreeableness, and 9-45 for Conscientiousness; and scores on the FoMO scale could range 10-50. For both of these measures, only total scores were available. Two socio-demographic questions were also asked of all participants - their age (in years) and total number of Instagram followers.

The aim of this report was to address three research questions:

  1. Does age predict FoMO?
  2. Does the total number of people one follows on Instagram predict FoMO over and above age?
  3. Does personality predict FoMO?

To examine the associations among the variables of interest, a scatter plot of matrices (SPLOM) was used. This allowed for visual examination of marginal distributions via histograms, and bivariate associations between variables via scatter plots. The SPLOM also provided bivariate correlations between each of the variables.

To address RQ1, the following simple linear regression model was used:

\[ \text{FoMO} = \beta_0 + \beta_1 \cdot \text{Age} + \epsilon \]

where we tested whether there was a significant association between FoMO and age. Formally, this corresponded to testing whether the age coefficient was equal to zero:

\[ H_0: \beta_1 = 0 \]

\[ H_1: \beta_1 \neq 0 \]

To address RQ2, the two following models were used: \[ \text{FoMO} = \beta_0 + \beta_1 \cdot \text{Age} + \epsilon \]

\[ \text{FoMO} = \beta_0 + \beta_1 \cdot \text{Age} + \beta_2 \cdot \text{Number of Instagram Followers} + \epsilon \]

To determine whether the model with the number of Instagram followers and age was a significantly better fitting than the model with age alone, we used an incremental \(F\)-test to formally test the following hypothesis:

\[ H_0: \beta_2 = 0 \]

\[ H_1: \beta_2 \neq 0 \]

To address RQ3, the following multiple linear regression model was used, where all measures were \(z\)-scored:

\[ \text{FoMO} = \beta_0 + \beta_1 \cdot \text{N} + \beta_2 \cdot \text{E} + \beta_3 \cdot \text{O} + \beta_4 \cdot \text{C} + \beta_5 \cdot \text{A} + \epsilon \quad \\ \]

\[ \begin{align} & \text{Where:} \\ & \text{FoMO} = \text{FoMO, z-scored} \\ & \text{N} = \text{Neuroticism, z-scored} \\ & \text{E} = \text{Extraversion, z-scored} \\ & \text{O} = \text{Openness, z-scored} \\ & \text{C} = \text{Conscientiousness, z-scored} \\ & \text{A} = \text{Agreeableness, z-scored} \\ \end{align} \]

To investigate whether personality predicted FoMO, this corresponded to testing whether each of the Big 5 personality traits were equal to zero:

\[ H_0: \text{All}~~ \beta_j = 0 ~\text{(for j = 1, 2, 3, 4, 5)} \] \[ H_1: \text{At least one}~ \beta_j \neq \text{(for j = 1, 2, 3, 4, 5)} \]

All participant data was complete (no missing values), with scores on the FoMO and BFI scales within possible ranges. Throughout the report, effects were considered statistically significant at \(\alpha = .05\).

Act II: Results

Question 2

Attempt to draft a results section based on your detailed analysis strategy and the analysis provided.

The results section should follow from your analysis strategy. This is where you would present the evidence and results that will be used to answer the research questions and can support your conclusions. Make sure that you address all aspects of the approach you outlined in the analysis strategy.

In this section, it is useful to include tables and/or plots to clearly present your findings to your reader. It is important, however, to carefully select what is the key information that should be presented. You do not want to overload the reader with unnecessary or duplicate information (e.g., do not present print outs of the head of a dataset, or the same information in tables and plots, etc.), and you also want to save space in case there is a page limit. Make use of figures with multiple panels where you can.

As a broad guideline, you want to start with the results of any exploratory data analysis, presenting tables of summary statistics and exploratory plots. You may also want to visualise associations between/among variables and report covariances or correlations. Then, you should move on to the results from your model.

*** This is not yet a completed list – we will add more to this as we progress through the course (e.g., evaluation of diagnostic plots, etc.)

Descriptive statistics are displayed in Table 1.

Table 1: FoMO, Socio-Demographic Factors, and Personality Trait Descriptive Statistics
FoMO, Socio-Demographic Factors, and Personality Traits Descriptive Statistics
N Mean SD Minimum Maximum
Fear of Missing Out 3370 24.63 6.42 10 46
Age (in years) 3370 33.61 10.42 12 75
Number of Instagram Followers 3370 203.25 93.42 1 594
Neuroticism 3370 22.92 5.77 8 40
Extraversion 3370 25.88 5.94 8 40
Openness 3370 37.61 5.80 14 50
Conscientiousness 3370 31.04 5.49 13 45
Agreeableness 3370 30.89 4.93 13 43

Bivariate correlations showed a moderate negative association between FoMO and age; moderate positive associations between FoMO and Neuroticism and between FoMO and number of Instagram followers; and weak associations between FoMO and the remaining Big 5 personality variables (see Figure 1).

Figure 1: Bivariate scatter plots (below diagonal), histograms (diagonal), and Pearson correlation coefficient (above diagonal) for FoMO, Socio-Demographic Factors, and Personality Traits

In relation to RQ1, full regression results, including 95% Confidence Intervals, are shown in Table 2. This model was significant (\(F(1, 3368)=380.1, p <.001)\), and results indicated that age was a significant predictor of FoMO \((\beta = -0.2, CI_{95} = [-0.22, -0.18], SE = 0.01, p <.001)\). Age explained approximately 10% of the variance in FoMO scores. For every one-year increase in age, participants’ FoMO scores decreased by 0.20 points. This suggested that older participants experienced less FoMO than younger participants (see Figure 2(a)).

Table 2: RQ1 - Regression Table for FoMO Model
RQ1 - Regression Table for FoMO Model
  FoMO
Predictors Estimates CI p
(Intercept) 31.22 30.53 – 31.92 <0.001
Age (in years) -0.20 -0.22 – -0.18 <0.001
Observations 3370
R2 / R2 adjusted 0.101 / 0.101
Figure 2: (a) Association between FoMO and Age (b) Association between FoMO and Instagram Followers

As presented in Table 3, we found evidence that an individual’s total number of followers on Instagram and age provided a significantly better fitting model predicting FoMO over age alone \((F(1 ,3367) = 267.04, p<.001)\). Together, age and the number of Instagram followers explained approximately 17% of the variance in FoMO scores.

Table 3: RQ2 - Model Comparison - fomo_mdl1 vs fomo_mdl2
Model Comparison - fomo_mdl1 vs fomo_mdl2
Res.Df RSS Df Sum of Sq F Pr(>F)
3368 124676.4 NA NA NA NA
3367 115515.0 1 9161.44 267.04 7.98e-58

In order to address the third research question, all scores were converted to \(z\)-scores. The Big 5 personality traits significantly predicted FoMO scores (\(F(5, 3364) = 184.53, p < .001\)). The personality traits collectively explained approximately 21% of variance in FoMO scores. Full model results are presented in Table 4. For every 1 standard deviation unit increase in Neuroticism scores, FoMO scores increased by 0.43 standard deviations after controlling for other personality traits. When accounting for other personality traits, for every 1 standard deviation unit increase in Conscientiousness scores, FoMO scores decreased by 0.31 standard deviations. For every 1 standard deviation unit increase in Agreeableness scores, FoMO scores decreased by 0.09 standard deviations after controlling for other personality traits. Neither Extraversion \((p = .27)\) nor Openness \((p = .47)\) were significant predictors of FoMO.

Table 4: RQ3 - Regression Table for FoMO Model with Personality
RQ3 - Regression Table for FoMO Model with Personality
  FoMO (Z-Scored)
Predictors Estimates CI p
(Intercept) -0.00 -0.03 – 0.03 1.000
Neuroticism (Z-Scored) 0.43 0.39 – 0.46 <0.001
Extraversion (Z-Scored) 0.02 -0.01 – 0.05 0.267
Openness (Z-Scored) 0.01 -0.02 – 0.04 0.471
Conscientiousness
(Z-Scored)
-0.31 -0.34 – -0.28 <0.001
Agreeableness (Z-Scored) -0.09 -0.12 – -0.05 <0.001
Observations 3370
R2 / R2 adjusted 0.215 / 0.214

Associations between FoMO and the three significant personality traits (whilst holding the others constant) are shown in Figure 3.

Figure 3: Predicted FoMO Scores by Neuroticsm, Conscientiousness, and Agreeableness after accounting for other Big 5 Traits

Act III: Discussion

Question 3

Attempt to draft a discussion section based on your results and the analysis provided.

In the discussion section, you should summarise the key findings from the results section and provide the reader with a few take-home sentences drawing the analysis together and relating it back to the original question.

The discussion should be relatively brief, and should not include any statistical analysis - instead think of the discussion as a conclusion, providing an answer to the research question(s).

Previous research had identified an association between an individual’s fear of missing out, various socio-demographic factors, and personality. In this analysis, we examined the associations of FoMO with age, total number of Instagram followers, and personality.

Our results suggested that: (1) age was a significant predictor of FoMO, where older age was associated with lower levels of FoMO; (2) including the total number of Instagram followers as well as age resulted in a significantly better fitting model for explaining FoMO scores (where more followers were associated with higher levels of FoMO); and (3) three of the Big Five personality traits - Neuroticism, Agreeableness, and Conscientiousness - were significant predictors of FoMO (specifically high Neuroticism, low Agreeableness, and low Conscientiousness). Therefore, we can reject the null hypotheses for each of our research questions.

However, it is important to note that we can make no causal claims on the directions of these associations from these data - we cannot conclude that being a young person who is highly Neurotic and low on Agreeableness and Conscientiousness with a large number of Instagram followers will cause feelings of FoMO.


Section B: Block 1 (Weeks 1 - 4) Recap

In the second part of the lab, there is no new content - the purpose of the recap section is for you to revisit and revise the concepts you have learned over the last 4 weeks.

We would encourage you to complete any outstanding work on these exercises (e.g., complete partial write-ups), and review solutions. Doing so will allow you to have good quality materials to refer to during the assessed report (released in Semester 2).

Given that we are now \(\frac{1}{4}\) of the way through the DAPR2 course, we would also strongly encourage you to start creating your revision materials in advance of the exam. You can access all the flashcards that you’ve been presented with in this block here. These will provide a good starting point for collating your notes together on the block 1 content. We also suggest that you review your weekly quiz feedback (as many of you have learned in Psychology 2A, it is important to provide feedback to allow learners to improve their learning and retention of information, as well as correct any misunderstandings!).