Department of Psychology, Lancaster University
Tip
Ask me anything:
We are working together to develop concepts: our work will develop across the weeks but we introduce the key ideas at specific points
We are working together to develop skills in a series of classes
We want to be capable of being wrong
The derivation chain (Meehl, 1990; Scheel et al., 2021)
We often teach and learn about different kinds of validity but the key idea is simple (Borsboom et al., 2004):
a test is valid for measuring an attribute if and only if (a) the attribute exists and (b) variations in the attribute causally produce variations in the outcomes of the measurement procedure
We want to work with valid measures but validity requires explaining: (Q.1) Does the thing exist in the world? (Q.2) Is variation in that thing be reflected in variation in our measurement?
ETHNICITY
White, is AGE
34 years, scored 33 on Shipley
vocabulary, scored 7 on HLVA
health literacy and, on average, self-rated their understanding of health information as 7.96 (so 8/9, mean.self
) while scoring 0.49 accuracy in tests of understanding (49% mean.acc
)# A tibble: 4 × 6
mean.acc mean.self HLVA SHIPLEY AGE ETHNICITY
<dbl> <dbl> <dbl> <dbl> <dbl> <fct>
1 0.49 7.96 7 33 34 White
2 0.85 7.28 7 33 25 White
3 0.82 7.36 8 40 43 White
4 0.94 7.88 11 33 46 White
Covariance
\[COV_{xy} = \frac{\sum(x - \bar{x})(y - \bar{y})}{n -1}\]
Covariance divided by standard deviations
\[r = \frac{COV_{xy}}{s_xs_y}\]
SHIPLEY
out of 40; mean.acc
(proportion, out of 1)mean.acc
and mean.self
scores will be associatedmean.acc
and mean.self
scores will be correlatedHistograms showing the distribution of mean accuracy and mean self-rated accuracy scores in the ‘clearly.one.subjects’ dataset: means calculated for each participant over all their responses
mean.accuracy
score of .49, lower than the averagemean.accuracy
score of .94, higher than the averagemean.self
scores be: will they be higher or lower than the average mean.self
score?Scatterplots showing whether values on mean accuracy (mean.acc
) vary together with values on mean self-rated accuracy (mean.self
) for the participants in this sample
cor.test
function, and name one variable clearly.one.subjects$mean.acc
clearly.one.subjects$mean.self
method = "pearson"
because we have a choicecor
) and the p-valuecor = .4863771
which we round to \(cor = .49\)p-value = 2.026e-11
indicating that the correlation is significant \(p < .001\)
Pearson's product-moment correlation
data: clearly.one.subjects$mean.acc and clearly.one.subjects$mean.self
t = 7.1936, df = 167, p-value = 2.026e-11
alternative hypothesis: true correlation is not equal to 0
95 percent confidence interval:
0.3619961 0.5937425
sample estimates:
cor
0.4863771
Mean accuracy and mean self-rated accuracy were significantly correlated (\(r = .49 (167 \text{ df}), p < .001\)). Higher mean accuracy scores are associated with higher mean self-rated accuracy scores.
mean.acc
) scores are associated with higher mean self-rated accuracy (mean.self
) scoresWe can simulate data to demonstrate: (left) the correlation is positive, \(r = .5\); (right) the correlation is negative, \(r = -.5\)
Scatterplots showing how simulated data values on mean accuracy and mean self-rated accuracy could vary together given positive or negative correlations
Scatterplots showing how simulated data values on mean accuracy and mean self-rated accuracy could vary together given positive correlations of increasing size
Rammstein concert crowd surfing; flickr, CC, Anirudh Koul
flickr: Jeremy Brooks ‘Free speech fear free’