Week 5. Testing differences between groups

Written by Padraic Monaghan

5.1 Overview

This week, there are two mini lectures, and the practical workbook working with R-studio.

Before the practical on Tuesday, please try to work through the practical workbook in your group.

Bring your questions (and/or answers) to the practical.

5.2 Learning Goals

Understand when t-tests are appropriately applied to data
Understand the distinction between paired and independent t-tests
Interpret p-values from t-tests
Determine how a paired t-test is calculated
Determine how an independent t-test is calculated
Understand effect sizes for t-tests
Be able to effectively interpret t-test results
Be able to accurately present t-test results in research reports

5.3 Lectures and slides

5.3.1 Lectures

Watch Lecture week 5 part 1:

Watch Lecture week 5 part 2:

Take the quiz (not assessed) on the lecture materials.

5.3.2 Slides

Download the lecture slides for:

5.4 Practical Materials

5.4.1 Workbook

Part 1 is some revision from the last 4 weeks.
Part 2 covers running an independent t-test.
Part 3 covers running a paired t-test.
Part 4 provides more practice running paired t-tests.
Part 5 presents some extras - exploring different datasets and running t-tests on those data.

5.4.1.1 Part 1: Revision

Task 1: Checklist: What I can now do

You should be able to answer yes to all the following. If you can’t yet, go back to the previous workbooks and repeat your working until you can answer yes, being able to type in and run the commands without referring to your notes.

I can open R-studio
I can open new libraries using library()
I can make an R script file
I can input a file into an object in R-studio using read_csv()
I can join two files together using inner_join()
I can select certain variables from an object using select()
I can select subsets of data using filter() (e.g., I can select participants in two conditions from a data set containing participants in four conditions)
I can make new variables using mutate()
I can arrange data according to subsets using group_by()
I can change format of data from wide to long format using pivot_longer
I can change format of data from long to wide format using pivot_wider
I can produce summaries of means and standard deviations for subsets of data after applying group_by() using summarise()
I can draw histograms of single variables, point plots of two ratio/interval/ordinal variables, bar plots of counts, and box plots of one categorical and one ratio/interval/ordinal variable using ggplot()
I can run a Chi-squared test and Cramer’s V test using chisq.test() and cramersV()
I can interpret the results of a Chi-squared test and Cramer’s V test and write up a simple report of the results.
I can save an R script file.

ANSWER

Here are some examples of the commands/functions in use:

rm(list=ls())
library(tidyverse)

Warning: package 'tidyr' was built under R version 4.1.1

Warning: package 'purrr' was built under R version 4.1.1

Warning: package 'stringr' was built under R version 4.1.1

── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.4     ✔ readr     2.1.4
✔ forcats   1.0.0     ✔ stringr   1.5.0
✔ ggplot2   3.5.1     ✔ tibble    3.2.1
✔ lubridate 1.9.2     ✔ tidyr     1.3.0
✔ purrr     1.0.1     
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors

# from week4 practical
dat <- read_csv("data/wk4/PSYC411-shipley-scores-anonymous-17_24.csv");

Rows: 270 Columns: 8
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): english_status, Gender
dbl (6): subject_ID, Age, Shipley_Voc_Score, Gent_1_score, Gent_2_score, aca...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

dat$academic_year <- as.factor(dat$academic_year)
#View(dat)

# from week2 practical
dat2 <- read_csv("data/wk2/ahicesd.csv")

Rows: 992 Columns: 50
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
dbl (50): id, occasion, elapsed.days, intervention, ahi01, ahi02, ahi03, ahi...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

pinfo <- read_csv("data/wk2/participantinfo.csv")

Rows: 295 Columns: 6
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
dbl (6): id, intervention, sex, age, educ, income

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

all_dat <- inner_join(x = dat2, y = pinfo, by = c("id", "intervention"))
summarydata <- select(all_dat, id, ahiTotal, cesdTotal, age, occasion)
dat_17 <- filter(dat, academic_year == "201718")
all_dat2 <- group_by(all_dat, intervention, occasion)
summarise(all_dat2, mean(ahiTotal), sd(ahiTotal), n())

`summarise()` has grouped output by 'intervention'. You can override using the
`.groups` argument.

# A tibble: 24 × 5
# Groups:   intervention [4]
   intervention occasion `mean(ahiTotal)` `sd(ahiTotal)` `n()`
          <dbl>    <dbl>            <dbl>          <dbl> <int>
 1            1        0             68.4           14.0    72
 2            1        1             69.5           13.4    30
 3            1        2             70.3           15.7    38
 4            1        3             75.0           12.8    29
 5            1        4             76.5           16.2    36
 6            1        5             75.5           14.5    27
 7            2        0             68.8           13.0    76
 8            2        1             71.6           12.5    48
 9            2        2             73.0           14.0    48
10            2        3             72.5           14.2    43
# ℹ 14 more rows

ggplot(dat, aes(x = Age)) + geom_histogram()

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Warning: Removed 193 rows containing non-finite outside the scale range
(`stat_bin()`).

ggplot(dat, aes(x = Shipley_Voc_Score, y = Gent_1_score)) + geom_point()

Warning: Removed 10 rows containing missing values or values outside the scale range
(`geom_point()`).

ggplot(dat, aes(x = Shipley_Voc_Score, y = Gent_1_score)) + geom_point() + geom_smooth( method = lm)

`geom_smooth()` using formula = 'y ~ x'

Warning: Removed 10 rows containing non-finite outside the scale range
(`stat_smooth()`).

Warning: Removed 10 rows containing missing values or values outside the scale range
(`geom_point()`).

all_dat$occasion <- as.factor(all_dat$occasion)
ggplot(all_dat, aes(x = occasion, y = ahiTotal)) + geom_boxplot()

dat_long <- pivot_longer(dat, names_to = "test", values_to = "score", cols = c("Gent_1_score", "Gent_2_score"))
# or:
dat_long <- pivot_longer(dat, names_to = "test", values_to = "score", cols = starts_with("Gent"))
dat_wide <- pivot_wider(dat_long, names_from = "test", values_from = "score")

library(lsr)

Warning: package 'lsr' was built under R version 4.1.1

chisq.test(x = all_dat$occasion, y = all_dat$intervention)


    Pearson's Chi-squared test

data:  all_dat$occasion and all_dat$intervention
X-squared = 9.7806, df = 15, p-value = 0.8333

cramersV(x = all_dat$occasion, y = all_dat$intervention)

[1] 0.05732779

5.4.1.2 Part 2: Running an independent t-test

Task 2: Load, prepare, and explore the data

Clear out R using rm(list=ls())
Load again the data set on the Shipley and Gent vocabulary scores from week 4.
Set the research question: do people who self-identify as male or female have different scores on the Gent vocabulary test? The research hypothesis is: “People who identify as male or female have different vocabulary scores”. What is the null hypothesis?

Answer

There is no difference between people who self-identify as male or female on vocabulary scores.

To test the research hypothesis, we will filter people who self-identify as male or female from the data set. To be inclusive, additional research questions would be part of your research project to analyse also people who self-identify as other gender. Run this command to extract a subset of the data (note that the | stands for “or”, and means Gender matches male or gender matches female:

dat2 <- filter(dat, Gender == 'Male' | Gender == 'Female')

Draw a box plot of Gent vocabulary test 1 scores by gender. For a box plot, note that we need data in “long format”, where each observation is on one line, and we have a column that indicates which condition (in this case Gender) the participant is in. Does it look like there might be a gender effect? What is the direction of the effect?

Answer

dat <- read_csv("data/wk4/PSYC411-shipley-scores-anonymous-17_24.csv")

Rows: 270 Columns: 8
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): english_status, Gender
dbl (6): subject_ID, Age, Shipley_Voc_Score, Gent_1_score, Gent_2_score, aca...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

# remember to change academic year to be a nominal/categorical variable, rather than a number:
dat$academic_year <- as.factor(dat$academic_year)
dat2 <- filter(dat, Gender == 'Male' | Gender == 'Female')
ggplot(dat2, aes(x = Gender, y = Gent_1_score)) + geom_boxplot()

Warning: Removed 4 rows containing non-finite outside the scale range
(`stat_boxplot()`).

The graph indicates that maybe males are slightly higher, but looks like a lot of overlap.

Note that unless we had filtered the data, the box plot would contain ‘NA’ as well, which stands for missing data. In a data set it’s always a good idea to call missing data ‘NA’ rather than just leaving them blank because this could be interpreted as a zero or as an error of filling in data. Missing values make things untidy, so it’s good practice to focus only on the variables we need for the t-test and remove all other missing values. Use select() to get just the Gender and Gent_1_score variables, and put this in a new object called ‘dat3’.

Answer

dat3 <- select(dat2, Gender, Gent_1_score)

Next, in order to run a t-test we have to remove any rows of data which contain a ‘NA’ - either in the Gender or the Gent_1_score variables. We do this using drop_na(dat3), put the result in a new object called ‘dat4’. Run this command:

dat4 <- drop_na(dat3)

Now, redraw the box plot from Step 21. Check there are just two groups still.

Answer

dat3 <- select(dat2, Gender, Gent_1_score)
dat4 <- drop_na(dat3)
ggplot(dat2, aes(x = Gender, y = Gent_1_score)) + geom_boxplot()

Warning: Removed 4 rows containing non-finite outside the scale range
(`stat_boxplot()`).

Yes, two groups.

Compute mean and SDs for people who self-identify as male or female on Gent vocabulary test 1 scores.

Hint

Use group_by() and summarise().

Answer

dat5 <- group_by(dat4, Gender)
summarise(dat5, mean(Gent_1_score), sd(Gent_1_score))

# A tibble: 2 × 3
  Gender `mean(Gent_1_score)` `sd(Gent_1_score)`
  <chr>                 <dbl>              <dbl>
1 Female                 58.6               14.0
2 Male                   62.2               14.5

# Or, if you know how to use the %>% (pipe) ...:
dat4 %>% group_by(Gender) %>% summarise(mean(Gent_1_score), sd(Gent_1_score))

# A tibble: 2 × 3
  Gender `mean(Gent_1_score)` `sd(Gent_1_score)`
  <chr>                 <dbl>              <dbl>
1 Female                 58.6               14.0
2 Male                   62.2               14.5

Task 3: Run the independent t-test and measure effect size

Conduct an independent t-test using this command:

t.test(Gent_1_score ~ Gender, data = dat4 )

‘Gent_1_score ~ Gender’ : the ~ can be interpreted as ‘by’, i.e., compute Gent_1_score by Gender

The results should look like this, do yours?

    Welch Two Sample t-test

data:  Gent_1_score by Gender
t = -1.6693, df = 83.298, p-value = 0.09881
alternative hypothesis: true difference in means between group Female and group Male is not equal to 0
95 percent confidence interval:
 -7.982423  0.697279
sample estimates:
mean in group Female   mean in group Male 
            58.57561             62.21818

The key part of the results to look at is the one that has t = -1.6693, df = 83.298, p-value = 0.09881. This is the result that you report: t(83.30) = -1.67, p = .099.

The value is negative because the function includes Female before Male - and Female score is lower than Male score. What matters is how far away from zero the t-test is (either positively or negatively). The df value is slightly odd because the t.test() function figures out degrees of freedom in a technical way which takes into account differences in variance in the data between the two groups. We can just use the value that the t.test() function gives us.

Is this a significant difference?

Answer

No, it isn’t. The p-value is greater than 0.05.

Now we need to compute the effect size, using Cohen’s d. You need to load the library lsr then use this command:

cohensD(Gent_1_score ~ Gender, method = "unequal", data = dat4)

It’s pretty much the same as the t-test() command except that we use ‘method = ’unequal’. For a paired t-test you would use ‘method = ’paired’

What is the effect size? Make a brief report of the results - reporting means and SDs, the t-test, p-value, and Cohen’s d. Discuss your brief report in your group.

Answer

d = 0.27

An ideal brief report of the results would state what the research question/hypothesis is, describe the means and SDs of the groups being compared, and then report the t-test statistic, with p-value and Cohen’s d effect size, then provide a brief interpretation of what the results mean.

Based on Hyde and Linn (1988), we hypothesised that people who self-identify as female may score slightly higher than males in terms of vocabulary scores. Males (mean = 62.2, SD = 14.5) scored higher than females (mean = 58.6, SD = 14.0) on the first time participants attempted the Gent vocabulary test, however this difference was not significant, t(83.30) = -1.67, p = .099, Cohen’s d = 0.27. We did not find evidence to support the hypothesis.

Reference

Hyde, J. S., & Linn, M. C. (1988). Gender differences in verbal ability: A meta-analysis. Psychological Bulletin, 104(1), 53–69. https://doi.org/10.1037/0033-2909.104.1.53

Make sure all commands are in the source window, save them as a new R script file.

Task 4: Practise running another independent t-test

Next research question: do people who are native English speakers have different vocabulary scores than those who learned English as a second language? What is the research hypothesis and the null hypothesis?

Answer

Research Hypothesis: People who speak English as a native language have higher vocabulary scores than those with English as a second language.

Null Hypothesis: There is no difference in vocabulary scores between native English and second language English speakers.

Repeat the Steps 22-30 in Tasks 2 and 3 except using english_status in place of Gender throughout.

Answer

dat <- read.csv("data/wk4/PSYC411-shipley-scores-anonymous-17_24.csv")
dat2 <- select(dat, english_status, Gent_1_score)
dat3 <- drop_na(dat2)
ggplot(dat3, aes(x = english_status, y = Gent_1_score)) + geom_boxplot()

dat4 <- group_by(dat3, english_status)
summarise(dat4, mean(Gent_1_score), sd(Gent_1_score), n())

# A tibble: 2 × 4
  english_status `mean(Gent_1_score)` `sd(Gent_1_score)` `n()`
  <chr>                         <dbl>              <dbl> <int>
1 ESL                            49.1              15.3     95
2 native                         65.2               9.36   163

t.test(Gent_1_score ~ english_status, data = dat3 )


    Welch Two Sample t-test

data:  Gent_1_score by english_status
t = -9.2983, df = 135.56, p-value = 3.256e-16
alternative hypothesis: true difference in means between group ESL and group native is not equal to 0
95 percent confidence interval:
 -19.57080 -12.70599
sample estimates:
   mean in group ESL mean in group native 
            49.09474             65.23313

cohensD(Gent_1_score ~ english_status, method = "unequal", data = dat3 )

[1] 1.27044

Write a brief report of the results, including means and SDs for native speakers and ESL speakers, t-test, p-value, and Cohen’s d. Discuss your report in your group.

Answer

We hypothesised that English as an additional language would result in lower overall vocabulary scores than English as a first language, because of the relation between length of time learning a language and language skills (Davies et al., 2017). Native English speakers (mean = 65.4, SD = 9.8) scored significantly higher on the Gent vocabulary test than speakers of English as a second language (mean = 48.3, SD = 15.7), t(127.79) = -9.08, p < .001, Cohen’s d = 1.30, a large effect size. Thus, the hypothesis was supported, with native English speakers scoring significantly higher on the Gent vocabulary test than those with English as an additional language.

Reference

Davies, R. A. I., Arnell, R., Birchenough, J., Grimmond, D., & Houlson, S. (2017). Reading Through the Life Span: Individual Differences in Psycholinguistic Effects. Journal of Experimental Psychology: Learning, Memory, and Cognition, 43(8), 1298-1338. https://doi.org/10.1037/xlm0000366

Save your R script file.

5.4.1.3 Part 3: Conducting a paired t-test

Task 5: Conducting a paired t-test

Clear out R-studio before we get started again using rm(list=ls())
We are going to investigate again the data from this paper: Woodworth, R.J., O’Brien-Malone, A., Diamond, M.R. and Schuez, B., 2017. Data from, “Web-based Positive Psychology Interventions: A Reexamination of Effectiveness”. Journal of Open Psychology Data, 6(1).

Our research question is whether happiness scores are affected by the interventions. We will look at the pre-test (occasion 0) and the first test after the intervention (occasion 1).

What is the research hypothesis and what is the null hypothesis?

Answer

RH: happiness scores change (increase) from the first to the second occasion of testing.

NH: happiness scores do not change across occasions.

For a paired t-test we can only include data from people who have produced scores at both occasions of testing. So, we need a slightly different version of the data, which you can download here for the ahicesd.csv file and here for the participantinfo2.csv file.

Remind yourself what these data mean.

Once again, join the ahicesd.csv and participantinfo2.csv data in R-studio by aligning the names for the participant numbers in these two data sets (see week 2 workbook for reminders about this).

Answer

library(tidyverse)
dat <- read_csv("data/wk5/ahicesd.csv")

Rows: 992 Columns: 50
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
dbl (50): id, occasion, elapsed.days, intervention, ahi01, ahi02, ahi03, ahi...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

pinfo <- read_csv("data/wk5/participantinfo2.csv")

New names:
Rows: 147 Columns: 7
── Column specification
──────────────────────────────────────────────────────── Delimiter: "," dbl
(7): ...1, id, intervention, sex, age, educ, income
ℹ Use `spec()` to retrieve the full column specification for this data. ℹ
Specify the column types or set `show_col_types = FALSE` to quiet this message.
• `` -> `...1`

all_dat <- inner_join(x = dat, y = pinfo, by = c("id", "intervention"))

Let’s select only the relevant variables. Use select() to select only id, ahiTotal, and occasion variables, and save this as a new object called ‘summarydata’

Answer

summarydata <- select(all_dat, c("id", "ahiTotal", "occasion"))

Use filter to pull out only occasion == 0 or occasion == 1 scores

Hint

use occasion == 0 | occasion == 1'), save this as a new object called summarydata2

Answer

summarydata2 <- filter(summarydata, occasion == 0 | occasion == 1)

Here is where we would usually remove all the NA values, but there aren’t any in this file (so we don’t need drop_na()).
Now, we need to make sure occasion is treated as a categorical variable, rather than a continuous variable, so we need to convert it to a factor:

summarydata2$occasion <-as.factor(summarydata2$occasion)

Now, draw a box plot of ahiTotal scores by occasion (why do we use a box plot?)

Answer

ggplot(summarydata2, aes(x = occasion, y = ahiTotal)) + geom_boxplot()

We use a boxplot because we have one nominal/categorical variable and one interval/ratio/ordinal measure.

Compute mean and SD for each occasion

Answer

ggplot(summarydata2, aes(x = occasion, y = ahiTotal)) + geom_boxplot()

In order to run the paired t-test, we first need to make sure that the paired values (in this case the measures from the same person) are on the same row of the data. So, let’s use pivot_wider to put the ahiTotal scores for occasion 0 in one column, and the ahiTotal scores for occasion 1 in another column:

summarydata2_wide <- pivot_wider(summarydata2, names_from = occasion, values_from = ahiTotal, names_prefix = "occasion_")

Note

Note that we use the names_prefix = “occasion_” as an extra argument to make the column names “occasion_0” and “occasion_1”.

What would happen if we didn’t use names_prefix? It would still work, but it’s a bit more awkward to work with the output.

Then, we can run the paired t-test in the following way:

t.test(summarydata2_wide$occasion_0, summarydata2_wide$occasion_1, paired = TRUE)

Is the result significant?

Answer

yes, because p < .05.

Now run Cohen’s d: it’s similar to the way we do the paired t-test:

cohensD( summarydata2_wide$occasion_0, summarydata2_wide$occasion_1, method =  "paired")

What is the value for Cohen’s d?

Answer

d = 0.4059904

Write up a brief report of the result and discuss in your group.

Answer

We tested whether participants’ happiness scores at first testing after the interventions were different than their scores prior to the interventions. We found that, prior to the intervention, scores were significantly lower (M = 69.3, SD = 12.3) than they were immediately after the interventions (M = 72.4, SD = 12.6), t(146) = = 4.92, p < .001, Cohen’s d = 0.41, a medium effect. The results indicate that the intervention had a positive effect on happiness scores.

Save your R script file.

5.4.1.4 Part 4: More practise running a paired t-test

We are going to figure out whether people have different scores the first and second time they take the Gent vocabulary test.

Go back to the vocabulary scores data. Load the data into dat, and make another object dat2 that contains only the subject_ID, Gent_1_score and Gent_2_score.
Some people did not do all the tests - look at participant 46 for instance. To do a t-test we need data where the person does both tests. We can filter out the scores where there are no NAs by repeating the drop_na we did at step 23, above. Call the new data object dat3.

Answer

dat <- read.csv("data/wk4/PSYC411-shipley-scores-anonymous-17_24.csv"); 
dat$academic_year <- as.factor(dat$academic_year)
dat2 <- select(dat, subject_ID, Gent_1_score, Gent_2_score)
dat3 <- drop_na(dat2)

Run the paired t-test.

Each row contains a score for each person for each Gent test, and so we are ready to run the paired t-test:

Answer

t.test(dat3$Gent_1_score, dat3$Gent_2_score, paired = TRUE)


    Paired t-test

data:  dat3$Gent_1_score and dat3$Gent_2_score
t = -2.2126, df = 246, p-value = 0.02784
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -2.4488231 -0.1422701
sample estimates:
mean of the differences 
              -1.295547

cohensD( dat3$Gent_1_score, dat3$Gent_2_score, , method =  "paired")

[1] 0.1407865

Get mean and SD of the scores for the tests.

Answer

summarise(dat3, mean(Gent_1_score), sd(Gent_1_score), mean(Gent_2_score), sd(Gent_2_score))

  mean(Gent_1_score) sd(Gent_1_score) mean(Gent_2_score) sd(Gent_2_score)
1           59.16194         13.99441           60.45749         14.89002

In order to draw a box plot of the Gent vocabulary scores taken at the first and second occasion, we will need the data in “long format”, so that there is a column saying which test it is and a column reporting the scores on the Gent tests. Apply the pivot_longer command and then draw the box plot.

Answer

dat4 <- pivot_longer(dat3, names_to = "test", values_to = "score", cols = c("Gent_1_score", "Gent_2_score"))
ggplot(dat4, aes(x = test, y = score)) + geom_boxplot()

5.4.1.5 Part 5: Extras

In the vocabulary scores data, is there a significant difference between males and females for your academic year group?

Answer

dat2 <- filter(dat, academic_year == "202324")
ggplot(dat2, aes(x = Gender, y = Gent_1_score)) + geom_boxplot()

dat3 <- group_by(dat2, Gender)
summarise(dat3, mean(Gent_1_score), sd(Gent_1_score), mean(Gent_2_score), sd(Gent_2_score), n())

# A tibble: 2 × 6
  Gender `mean(Gent_1_score)` `sd(Gent_1_score)` `mean(Gent_2_score)`
  <chr>                 <dbl>              <dbl>                <dbl>
1 Female                 62.2               7.85                 61.6
2 Male                   61.3              13.4                  57.1
# ℹ 2 more variables: `sd(Gent_2_score)` <dbl>, `n()` <int>

# or:
dat2 %>% group_by(Gender) %>% summarise(mean(Gent_1_score), sd(Gent_1_score), mean(Gent_2_score), sd(Gent_2_score), n())

# A tibble: 2 × 6
  Gender `mean(Gent_1_score)` `sd(Gent_1_score)` `mean(Gent_2_score)`
  <chr>                 <dbl>              <dbl>                <dbl>
1 Female                 62.2               7.85                 61.6
2 Male                   61.3              13.4                  57.1
# ℹ 2 more variables: `sd(Gent_2_score)` <dbl>, `n()` <int>

t.test(Gent_1_score ~ Gender, data = dat2 )


    Welch Two Sample t-test

data:  Gent_1_score by Gender
t = 0.16297, df = 7.6522, p-value = 0.8748
alternative hypothesis: true difference in means between group Female and group Male is not equal to 0
95 percent confidence interval:
 -11.68411  13.44601
sample estimates:
mean in group Female   mean in group Male 
            62.16667             61.28571

cohensD(Gent_1_score ~ Gender, method = "unequal", data = dat2 )

[1] 0.08004559

t.test(Gent_2_score ~ Gender, data = dat2 )


    Welch Two Sample t-test

data:  Gent_2_score by Gender
t = 0.58606, df = 6.695, p-value = 0.5771
alternative hypothesis: true difference in means between group Female and group Male is not equal to 0
95 percent confidence interval:
 -13.55749  22.38289
sample estimates:
mean in group Female   mean in group Male 
            61.55556             57.14286

cohensD(Gent_2_score ~ Gender, method = "unequal", data = dat2 )

[1] 0.3007885

nothing significant…

Are there significant differences for the other vocabulary test measures between males and females, or between those with English as first or second language?
The data from this paper are called ClassDraw.csv available here. The data are on osf as well, but they’re not properly formatted, so I adjusted them and put them here.

Jalava, S. T., Wammes, J. D., & Cheng, K. (2023). Drawing your way to an A: Long-lasting improvements in classroom quiz performance following drawing. Psychonomic Bulletin & Review, 30, 1939–1945. https://doi.org/10.3758/s13423-023-02294-2

There are some useful tips in the results of this study about the benefit of doodling…

My challenge to you:

Can you make a ggplot that looks a bit like Figure 2 from this study?

Answer

jalava <- read_csv("data/wk5/ClassDraw.csv")

Rows: 168 Columns: 9
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Gender
dbl (8): ID, Age, write_1, write_2, draw_1, draw_2, exam_draw, exam_write

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

jalava1 <- pivot_longer(jalava, names_to = c("test",".value"), names_sep="_" , values_to = "score", cols = c("write_1","write_2","draw_1","draw_2"))
jalava2 <- pivot_longer(jalava1, names_to = "test_time", values_to = "score", cols = c(`1`,`2`))

jalava2$test_time <- as.factor(jalava2$test_time)
ggplot(jalava2, aes(x = test_time, y = score, fill = test)) + geom_boxplot()

Warning: Removed 48 rows containing non-finite outside the scale range
(`stat_boxplot()`).

# median looks very similar (boxplots very overlapping), but means are a bit different:
jalava2 %>% group_by(test, test_time) %>% summarise(mean(score, na.rm=TRUE), sd(score, na.rm=TRUE))

`summarise()` has grouped output by 'test'. You can override using the
`.groups` argument.

# A tibble: 4 × 4
# Groups:   test [2]
  test  test_time `mean(score, na.rm = TRUE)` `sd(score, na.rm = TRUE)`
  <chr> <fct>                           <dbl>                     <dbl>
1 draw  1                               0.486                     0.215
2 draw  2                               0.322                     0.216
3 write 1                               0.467                     0.212
4 write 2                               0.3                       0.212

If you’ve made some progress on the de Zubicaray et al. (2024) data analysis, to the point where you have replicated (more or less) the multiple regression analysis from the first study in that paper, then here is the next step.

Note, this is a major challenge, but it will be a step into new research!

The task is to take the de Zubicaray data, and add a new variable which determines whether the word is a palindrome or not. A palindrome is a word that is spelled the same backwards as forwards (e.g., abba, rotavator).

I would like to know if being a palindrome or not affects processing of the words. My prediction is that being a palindrome helps access the word.

What you will need to do is apply a function called “palindrome” to the words in the de Zubicaray dataset, to make a new variable, called or some other name then run a multiple regression with all the other variables plus this one.

Here is a script with the function called palindrome included, with some comments to show how it is put into practice. You will need to add the function into the top of an analysis script (e.g., rmd or r script) file and make sure that is loaded along with the other commands you are using.

5.4.2 Data

Data referred to in this workbook:

5.4.3 Answers

The answers to the workbook will appear below each question in the workbook, above, after the practical has finished, so you can check your work.

5.5 Extras

Optionally, if you can give us your (anonymised) feedback on how the course is going from your perspective, that would be very welcome.

Also optionally, read the articles on the importance of statistical understanding and insights from good data visualisation:

How scientists can be better at statistics. Note that this article is hosted on the Spectator website, and (content warning) refers to Harold Shipman, a serial killer.
Florence Nightingale and data visualisation. Note that this is hosted on Scientific American.