Dunnett’s test#

Dunnett’s test compares the means of multiple experimental groups against a single control group. In [1], the influence of drugs on blood count measurements on three groups of animals is investigated.

The following table summarizes the results of the experiment in which two groups received different drugs, and one group acted as a control. Blood counts (in millions of cells per cubic millimeter) were recorded:

import numpy as np
control = np.array([7.40, 8.50, 7.20, 8.24, 9.84, 8.32])
drug_a = np.array([9.76, 8.80, 7.68, 9.36])
drug_b = np.array([12.80, 9.68, 12.16, 9.20, 10.55])

We would like to see if the means between any of the groups are significantly different. First, visually examine a box and whisker plot.

import matplotlib.pyplot as plt
fig, ax = plt.subplots(1, 1)
ax.boxplot([control, drug_a, drug_b])
ax.set_xticklabels(["Control", "Drug A", "Drug B"])
ax.set_ylabel("mean")
plt.show()
../../_images/d2dd93ae3ce14b4b5144f8cecad0ed5de80f89048b787359ad03f2c8db020e10.png

Note the overlapping interquartile ranges of the drug A group and control group and the apparent separation between the drug B group and control group.

Next, we will use Dunnett's test to assess whether the difference between group means is significant while controlling the family-wise error rate: the probability of making any false discoveries.

Let the null hypothesis be that the experimental groups have the same mean as the control and the alternative be that an experimental group does not have the same mean as the control. We will consider a 5% family-wise error rate to be acceptable, and therefore we choose 0.05 as the threshold for significance.

from scipy.stats import dunnett
res = dunnett(drug_a, drug_b, control=control)
res.pvalue
array([0.62005852, 0.00579179])

The p-value corresponding with the comparison between group A and control exceeds 0.05, so we do not reject the null hypothesis for that comparison. However, the p-value corresponding with the comparison between group B and control is less than 0.05, so we consider the experimental results to be evidence against the null hypothesis in favor of the alternative: group B has a different mean than the control group.

References#