Wilcoxon signed-rank test – Wikipedia

Posted on September 9, 2019 by lordneo

before-content-x4

Statistical hypothesis test

after-content-x4

The Wilcoxon signed-rank test is a non-parametric statistical hypothesis test used either to test the location of a population based on a sample of data, or to compare the locations of two populations using two matched samples.^[1] The one-sample version serves a purpose similar to that of the one-sample Student’s t-test.^[2] For two matched samples, it is a paired difference test like the paired Student’s t-test (also known as the “t-test for matched pairs” or “t-test for dependent samples”). The Wilcoxon test can be a good alternative to the t-test when population means are not of interest; for example, when one wishes to test whether a population’s median is nonzero, or whether there is a better than 50% chance that a sample from one population is greater than a sample from another population.

Table of Contents

History[edit]

The test is named for Frank Wilcoxon (1892–1965) who, in a single paper, proposed both it and the rank-sum test for two independent samples.^[3] The test was popularized by Sidney Siegel (1956) in his influential textbook on non-parametric statistics.^[4] Siegel used the symbol T for the test statistic, and consequently, the test is sometimes referred to as the Wilcoxon T-test.

Test procedure[edit]

There are two variants of the signed-rank test. From a theoretical point of view, the one-sample test is more fundamental because the paired sample test is performed by converting the data to the situation of the one-sample test. However, most practical applications of the signed-rank test arise from paired data.

For a paired sample test, the data consists of samples

{displaystyle (X_{1},Y_{1}),dots ,(X_{n},Y_{n})}

${displaystyle (X_{1},Y_{1}),dots ,(X_{n},Y_{n})}$ . Each sample is a pair of measurements. In the simplest case, the measurements are on an interval scale. Then they may be converted to real numbers, and the paired sample test is converted to a one-sample test by replacing each pair of numbers

after-content-x4

{displaystyle (X_{i},Y_{i})}

$(X_i, Y_i)$ by its difference

{displaystyle X_{i}-Y_{i}}

${displaystyle X_{i}-Y_{i}}$ .^[5] In general, it must be possible to rank the differences between the pairs. This requires that the data be on an ordered metric scale, a type of scale that carries more information than an ordinal scale but may have less than an interval scale.^[6]

The data for a one-sample test is a set of real number samples

{displaystyle X_{1},dots ,X_{n}}

$X_{1},dots ,X_{n}$ . Assume for simplicity that the samples have distinct absolute values and that no sample equals zero. (Zeros and ties introduce several complications; see below.) The test is performed as follows:^[7]^[8]

Compute ${displaystyle |X_{1}|,dots ,|X_{n}|}$
Sort ${displaystyle |X_{1}|,dots ,|X_{n}|}$
Let ${displaystyle operatorname {sgn} }$
Produce a ${displaystyle p}$

The ranks are defined so that

{displaystyle R_{i}}

$R_{i}$ is the number of

{displaystyle j}

$j$ for which

{displaystyle |X_{j}|leq |X_{i}|}

${displaystyle |X_{j}|leq |X_{i}|}$ . Additionally, if

{displaystyle sigma colon {1,dots ,n}to {1,dots ,n}}

${displaystyle sigma colon {1,dots ,n}to {1,dots ,n}}$ is such that

{displaystyle |X_{sigma (1)}|

${displaystyle |X_{sigma (1)}|<dots <|X_{sigma (n)}|}$ , then

{displaystyle R_{sigma (i)}=i}

${displaystyle R_{sigma (i)}=i}$ for all

{displaystyle i}

$i$ .

The signed-rank sum

{displaystyle T}

$T$ is closely related to two other test statistics. The positive-rank sum

{displaystyle T^{+}}

${displaystyle T^{+}}$ and the negative-rank sum

{displaystyle T^{-}}

${displaystyle T^{-}}$ are defined by^[9]

{displaystyle {begin{aligned}T^{+}&=sum _{1leq ileq n, X_{i}>0}R_{i},\T^{-}&=sum _{1leq ileq n, X_{i}<0}R_{i}.end{aligned}}}

Because

{displaystyle T^{+}+T^{-}}

${displaystyle T^{+}+T^{-}}$ equals the sum of all the ranks, which is

{displaystyle 1+2+dots +n=n(n+1)/2}

${displaystyle 1+2+dots +n=n(n+1)/2}$ , these three statistics are related by:^[10]

{displaystyle {begin{aligned}T^{+}&={frac {n(n+1)}{2}}-T^{-}={frac {n(n+1)}{4}}+{frac {T}{2}},\T^{-}&={frac {n(n+1)}{2}}-T^{+}={frac {n(n+1)}{4}}-{frac {T}{2}},\T&=T^{+}-T^{-}=2T^{+}-{frac {n(n+1)}{2}}={frac {n(n+1)}{2}}-2T^{-}.end{aligned}}}

Because

{displaystyle T}

$T$ ,

{displaystyle T^{+}}

${displaystyle T^{+}}$ , and

{displaystyle T^{-}}

${displaystyle T^{-}}$ carry the same information, any of them may be used as the test statistic.

The positive-rank sum and negative-rank sum have alternative interpretations that are useful for the theory behind the test. Define the Walsh average

{displaystyle W_{ij}}

$W_{ij}$ to be

{displaystyle {tfrac {1}{2}}(X_{i}+X_{j})}

${displaystyle {tfrac {1}{2}}(X_{i}+X_{j})}$ . Then:^[11]

{displaystyle {begin{aligned}T^{+}=#{W_{ij}>0colon 1leq ileq jleq n},\T^{-}=#{W_{ij}<0colon 1leq ileq jleq n}.end{aligned}}}

Null and alternative hypotheses[edit]

One-sample test[edit]

The one-sample Wilcoxon signed-rank test can be used to test whether data comes from a symmetric population with a specified median.^[12] If the population median is known, then it can be used to test whether data is symmetric about its center.^[13]

To explain the null and alternative hypotheses formally, assume that the data consists of independent and identically distributed samples from a distribution