Table 1

Definitions of the psychometric properties used in the selection criteria for included articles

DomainMeasurement propertyExample test statisticDefinition
ReliabilityThe consistency of a test or measurement, that is, how consistently a measure produces similar results with repeated measures over a short period of time or across assessors at the same time point. This can also be thought of as the correlation between observed scores across replications.
Test–retest reliabilityCorrelation coefficientCorrelation between scores from the same test from assessments conducted over a short time interval.
Inter-rater reliabilitykappa, Bland-Altman testThe extent to which independent assessors produce similar ratings in judging the same abilities or characteristics in the same target person at the same time.
Internal consistency reliabilityCronbach’s alpha, alphaDegree of interrelatedness among items on the same tool, that is, how well the items work together to provide information on an underlying construct.
ValidityThe degree to which the tool measures what it is supposed to measure, that is, the degree to which the tool reflects the underlying construct.
Content/face validityThe degree to which the content of the tool is adequate for the construct being measured, that is, assessing the extent to which a tool appears to reflect the underlying construct.
Concurrent/criterion validityCorrelation coefficient; regression estimateThe degree to which scores on one measurement tool are related to scores obtained at about the same point in time from another tool considered the gold standard.
Convergent validityCorrelation coefficient; regression estimateEvidence that scores on a test or measurement are associated with theoretically related measures or variables.
PredictiveCorrelation coefficient; regression estimateEvidence that a score correlates with a variable that can only be assessed at some point after the test has been administered or the measurement made, for example, evidence that scores now are correlated with scores at a future time point.
Structural validity (dimensionality)Exploratory factor analysis: number of factors, eigen values
Confirmatory factor analysis: model fit statistics such as Comparative Fit Index, root mean square error of approximation
The degree to which the scores of an assessment are an adequate reflection of the dimensionality of the construct to be measured.
InvarianceThe property when a scale or construct provides the same results across different samples, populations, settings or characteristics.
Measurement invariance over countriesLikelihood ratio χ2 statistic and p-value from freeing parameters across groupsThe degree to which an assessment or construct provides the same results across separate samples in different countries.
Measurement invariance over other groupsLikelihood ratio χ2 statistic and p-value from freeing parameters across groupsThe degree to which an assessment or construct provides the same results across different groups.
  • All definitions are based on the APA Dictionary of Psychology.7