All corrections
Wikipedia April 20, 2026 at 11:48 PM

en.wikipedia.org/wiki/Point-biserial_correlation_coefficient

1 correction found

1
Claim
If it can be assumed that the dichotomous variable Y is normally distributed, a better descriptive index is given by the biserial coefficient:
Correction

This is misstated: a dichotomous variable itself cannot be normally distributed. The biserial correlation assumes the observed binary variable comes from an underlying continuous, normally distributed latent variable that has been dichotomized.

Full reasoning

A dichotomous variable takes only two values, so it is a discrete variable. A normal distribution is a continuous distribution, not a two-point distribution.

For the biserial correlation, standard references do not assume that the observed binary variable itself is normal. Instead, they assume that the binary variable is a dichotomized measurement of an underlying continuous variable that is normally distributed:

  • SAS states that the biserial correlation is used when the binary variable "has an underlying continuous distribution but is measured as binary."
  • NCSS explains the biserial correlation by starting with bivariate normal variables and then dichotomizing one of them to create the binary variable.
  • NIST defines the normal distribution as a continuous distribution.

So the sentence should refer to an underlying continuous normally distributed variable, not to the observed dichotomous variable itself being normally distributed.

3 sources
Model: OPENAI_GPT_5 Prompt: v1.16.0