Up  

 

 

THE SIGN TEST FOR THE MEDIAN

there are times when a nonparametric test is desirable, particularly if the available measurements are ordinal.

The sign test for the median provides a simple, distribution-free test of hypotheses about symmetry of sample data around their median.

The sign test gets its name from the fact that pluses and minuses, rather than numerical values, provide the data used in the computations.

Median Incomes for Selected Canadian Cities 1993

 

Victoria

21,200

Thunder Bay

21,000

Kitchener

20,800

London

20,800

Calgary

20,800

Toronto

20,300

Regina

20,200

Halifax

20,000

Edmonton

20,000

Windsor

19,800

Vancouver

19,600

Quebec

18,800

Winnipeg

18,300

St. Catharines-Niagara

18,400

Saskatoon

18,000

Montreal

17,200

Saint John

16,900

St. John's

16,900

Sherbrooke

16,100

Chicoutimi-Jonquiere

15,600

CDN median =18,000

from: http://www.indnet.org/demog/0024.html

an economic geographer consulting for a federal government agency is presented with what is explained to be a random sample of n 20 observations of median family income in Canada

The statistical question is, do the sample data support the null hypothesis that the true median family income for this metropolis is $18,000 at the .05 level of significance?

As a first step in testing the hypothesis, the geographer determines which tract incomes lie above and which ones lie below the hypothesized median of $18,000.

A plus sign is attached to those tract incomes that are above the hypothesized median and a minus sign to those that fall below the median.

Observed values that equal the hypothesized median are counted neither as plus or minus, and the sample size is reduced accordingly.

From the listing of the sample of household incomes, it may be easily seen that 5 of the tract values are below the hypothesized median of $18,000, and 14 of the values are above the median with one at the median.

So the proportion of the observation below the median is p=5/19=.263

The second step in this test is to calculate the probability of obtaining as few or fewer of the less frequently occurring sign, or the probability of obtaining as many or more of the more frequently occurring sign.

The critical upper and lower bounds are found with

It does not matter which proportion—the most frequently occurring sign or the less frequently occurring sign—is computed, since the test assumption is that half the observations will fall above and half below the median.

For a two-tailed test, the area under a symmetrical distribution between the critical upper and lower bounds constitutes an acceptance region

Since p falls outside the bounds dictated by the sample size, we reject the null hypothesis and conclude the proportion of negative scores is significantly different from 0.5, or that the median = $18,000.