chi square

Chi-square test (P²)

nominal data (categories)

one-sample, two-sample, > 2 sample test

research question: are wheat growing farms located with respect to soil type? That is, is wheat grown in particular soil-type areas?

1) take a random sample of 100 wheat farms and determine the soil types underlying the farms

2) there are 4 ‘classes’ of soil type

class
	clay	sand	loam	limestone
freq of wheat farms	30	30	30	10	E100

this is the ‘observed’ distribution of wheat farms

3) under a null hypothesis what would be our ‘expected’ distribution?

Answer: the % percentages of land under different soil types

class
	clay	sand	loam	limestone
actual % of land under soil type	30	40	20	10	E100

our null hypothesis is that:

H₀: soil type has no influence on wheat farm location

if H0 was true, then we would expect the observed number of wheat farms to be roughly equal to/proportional to actual % of land under particular soil types

observed	30	40	20	10
Expected	30	40	20	10

what we found was

observed	30	30	30	10
Expected	30	40	20	10

are these differences significant or could they have occurred due to random sampling differences?

DO SAMPLING DIFFERENCES REFLECT DIFFERENCES IN POPULATION OVER SOIL TYPE

4) H₁: Soil type has an influence on wheat farm location

5) Set significance level at 95% confidence or "=0.05

6) compute P² statistic

where O_i = observed value in category I

E_i= expected value in category I

k= number of categories

look up critical value (pg 276 in text)

df= k-1=4-1 = 3

df - means that given the total frequency, once the frequencies are known for all but one of the categories, the frequency in the final category is determined

P² _c("=0.05) (df=3) = 7.815

if O_i and E_i were equal then P² =0

P² > as differences >

7.815 defines value of P² where top 5% distribution starts with df=3

in this case P²=7.5

To reject H₀: calculated value must exceed the critical value

we cannot reject H₀: cannot say that we would expect the value 7.5 to occur $ 95 out of 100

if H₀ is correct, the probability of 7.5 occurring is > "=0.05

therefore farming is not related to soil type

rules of thumb

1) if the number of categories is greater than 2, no more than 1/5 of the expected frequencies should be less than 5 and none should be 0

2) if the number of categories is 2, both the expected and observed frequencies should be 5 or larger

these illustrate an important restriction on P² in that for many categories there should not be small frequencies

also the data must be in frequencies, P² will give false results if used on proportions or percentages of occurances in categories

poisson example

We begin our discussion by querying: Does the following sequence of data "fit" the Poisson distribution?

7	7	6	7	3
4	5	7	2	9
4	8	4	6	6
4	7	8	2	4
4	2	8	3	8
4	10	7	2	8
7	3	2	8	9
6	2	8	3	3
6	3	4	6	5
2	3	7	7	4

The null and alternative hypotheses:

H_o: The data set is distributed as Poisson

H_A: Not.

The test statistic and sampling distribution:

we select the test statistic and distribution currently under discussion. That is,

with df=n-k-1

where the expected (denoted as E_i) and observed (denoted as O_i) are determined via a frequency distribution and k is the number of parameters which require estimation before the test can be conducted, and n is the number of intervals over which the frequency distribution is determined. To complete the required test, we must first determine some overall attributes of the data set. As we can see the data values ranges from 2 to 10.

Maximum	10
Minimum	2
Average	5.28
Count	50

Then we must determine a reasonable breakdown of this range to use as the basis of our test. We first do a frequency analysis.

7	7	6	7	3	1	0
4	5	7	2	9	2	7
4	8	4	6	6	3	7
4	7	8	2	4	4	9
4	2	8	3	8	5	2
4	10	7	2	8	6	6
7	3	2	8	9	7	9
6	2	8	3	3	8	7
6	3	4	6	5	9	2
2	3	7	7	4	10	1
						50

Notice the check down at the lower right corner to make sure that my frequency analysis "read" all of the observations. Then we "compress" this breakdown a little to make sure we have more than 5 observations per interval.

	Observed	Poisson Probability	Expected
0,1,2	7	0.103	5.148
3	7	0.125	6.247
4	9	0.165	8.246
5,6	8	0.327	16.370
7	9	0.116	5.780
8,9,10	10	0.164	8.210
n	50		50

computation of the test statistics

	Observed	Poisson Probability	Expected	Chi-Square
0,1,2	7	0.103	5.148	0.666
3	7	0.125	6.247	0.091
4	9	0.165	8.246	0.069
5,6	8	0.327	16.370	4.279
7	9	0.116	5.780	1.794
8,9,10	10	0.164	8.210	0.390
n =	50		50	7.290
df =	4			0.2202

Further the degrees-of-freedom is determined to be n - k - 1 = 6 - 1 - 1 = 4, i.e., n = 6 intervals, k = 1 parameter required computation before the test could be run.

Draw a conclusion

The quantity in green is the p-value our

P² = 7.290. This is the probability of randomly being more extreme than 7.290 in the

P² with 4 degrees-of-freedom.

Since this p-value = 0.2202 is larger than 0.05 we cannot reject the null hypothesis in favor of the alternative. If we consult the tables in the text we find that the critical

P² (for = 0.05) to be 9.488.

P² and the fit to a poisson distribution

Some statistical points to note about the Poisson distribution

Like the binomial distribution it is assumed that events are independent of one another.

: is generally unknown and must be estimated by a sample mean.

Testing a spatial (and temporal) distribution

Often we wish to discover if spatial and temporal patterns are random. If they are not then a pattern must be regular or clumped (aggregated), in which case there is probably an interesting biological process at work.

As an example let us assume that we are interested in the spatial arrangement of buzzard Buteo buteo nests. We could set up two hypotheses:

Ho : Buzzard nests are randomly distributed

H₁ : Buzzard nests are not randomly distributed

We could test the H_o by collecting data from a number of 4km² squares. In each square we record the number of nests. (The size of the square is determined with respect to the biology of the organism) The results are tabulated in a frequency table. In this example 60 squares were assessed.

	X(No.of Nests)	Observed Frequency	No.of Nests
	0	4	0
	1	22	22
	2	15	30
	3	10	30
	4	7	28
	5+	2	10
Sum =		60	120

Average number of nests per square = 120 / 60 = 2.00 (the sample mean x)

the mean of the sample (x), is our best estimate

Substituting this value into the Poisson equation enables us to calculate the probability of observing 0, 1, 2 etc. nests per square. The steps are summarized below.

Assume that nests are randomly distributed with a mean of 2.00 nests per square.

Use the Poisson equation to find P(0), P(1), P(2) etc., nests per square.

Convert these probabilities to expected numbers of squares by multiplying the probabilities by the number of surveyed squares.

For example, suppose that P(1) = 0.25 (25% chance of finding 1 nest per square) we would then expect to find that 25% of the surveyed squares would contain one nest as long as the number of nests per square was random with a mean of 2.00 nests per square.

In this case 60 x 0.25 = 15.

Results

X(No. of Nests)	P(X)	P(X)x60
0	0.1353	8.12
1	0.2707	16.24
2	0.2707	16.24
3	0.1804	10.83
4	0.0902	5.41
5+	0.0526	3.16

Note that the simplest way of finding P(5+), if not using tables, is by subtraction, ie 1 - 0.9474, where 0.9474 is 0.1353 + 0.2707 +0.2707 + 0.1804 + 0.0902.

In order to determine if the nests are randomly distributed we need to find out if the differences between what we observed and what we would expect, given a random distribution, are significant.

If these differences are significant we can conclude that the pattern of buzzard nests is not random.

It is also possible that the pattern could be random and any differences between observed and expected frequencies may have arisen by chance.

In order to decide between these alternatives we must use a statistical test which allows us to compare the observed and expected frequencies and determine if there is a significant difference between these two sets of frequencies.

The intermediate calculations needed for this test are shown below

X	observed	expected	obs-exp(O-E)	(O-E)²/E
0	4	8.12	-4.12	2.09
1	22	16.24	5.76	2.04
2	15	16.24	-1.24	0.09
3	10	10.83	-0.83	0.06
4	7	5.41	1.59	0.46
5+	2	3.16	-1.16	0.42

Using the data above P² = 2.09 + 2.04 + 0.09 + 0.06 + 0.46 + 0.42 = 5.181.

Normally this would be the end of our calculation of P², but there is a common complication.

Because of a bias that it introduces, we should not use expected frequencies < 5 in the calculation of P² .

When they occur they should be amalgamated with the next value (above or below as appropriate).

In this example, 3.16 (E(5+)) is added to 5.41 (E(4)) to give 8.57 (E(4+)). Since we have combined the expected frequencies we must also combine the observed, i.e. 7 + 2 = 9.

Using this correction

X	observed	expected	obs-exp(O-E)	(O-E)²/E
0	4	8.12	-4.12	2.09
1	22	16.24	5.76	2.04
2	15	16.24	-1.24	0.09
3	10	10.83	-0.83	0.06
4+	9	8.57	0.43	0.02

Our value, 4.31, is smaller than 7.815 consequently we fail to reject H_o and say that the distribution of nests is not significantly different from a random distribution with a mean of 2.00.

One problem with this approach is that it is very dependent on the scale of the study. If we had worked with 1km² squares we would probably have come to a different conclusion. Scale effects are an important topic in all spatial analyses.

the material for the poisson example is derived from http://149.170.199.144/resdesgn/poisson.htm

Chi squared and the normal distribution

Does the following data set accurately "fit" the normal distribution with a mean of 0 and a standard deviation of 1?

0.464	0.060	1.486	1.022	1.394
0.906	1.179	-1.501	-0.609	1.372
-0.482	-1.376	-1.010	0.005	1.393
-1.787	-0.105	-1.339	1.041	0.279
-1.805	-1.186	0.658	-0.439	-1.399

This naturally induces the following hypothesis testing steps:

1) The null and alternative hypotheses:

H₀: The sample is drawn from a population distributed as N[0,1]

H_A: Not.

2) Select the test statistic and sampling distribution:

The first step in this process is to reorder the data in increasing order, i.e., as in the next table:

-1.805	-1.339	-0.439	0.464	1.179
-1.787	-1.186	-0.105	0.658	1.372
-1.501	-1.010	0.005	0.900	1.393
-1.399	-0.690	0.060	1.022	1.394
-1.376	-0.482	0.279	1.041	1.486

From these data one can begin to develop potential candidates for partitioning the range of both the observed and the expected frequency distributions. The intervals listed in the following table represents a reasonable first pass at such a partition.

Index	Interval	Interval Midpoint	Observed Frequency
1	(-4,-1.5)	-	3
2	[-1.5,-1.0)	-1.25	5
3	[-1.0,-0.5)	-0.75	1
4	[-0.5,0.0)	-0.25	3
5	[0.0,0.5)	0.25	4
6	[0.5,1.0)	0.75	2
7	[1.0,4)	-	7

The process of developing partitions illustrates one of the most significant limitations concerning the use of the P² goodness-of-fit tests.

The use of any nonparametric tests is critically dependent upon the number of data points, and the assumptions of the

P² goodness-of-fit test are:

All expected frequencies are at least 1.

At most 20% of the expected frequencies are less than 5.

We are casual in our respecification of our intervals, compressing the interval specification depicted in the table above to the version presented here:

Index	Interval	Interval Midpoint	Observed Frequency
1	(-4,-1.0)	-	8
2	[-1.0,0.0)	-0.5	4
3	[0.0,1.0)	0.5	6
4	[1.0,4)	-	7

Next we are required to determine the expected frequency of each interval in this partition.

further computation yields the following results:

Index	Interval	Interval Midpoint	Observed Frequency	Expected Frequency
1	(-4,-1.0)	-	8	3.9675
2	[-1.0,0.0)	-0.5	4	8.5325
3	[0.0,1.0)	0.5	6	8.5325
4	[1.0,4)	-	7	3.9675

A note for clarification is in order at this point.

The rule specifying that the frequency must be greater than or equal to 5 causes some problems at this point.

Clearly, in the above example, 2 of the expected frequencies are less than 5.

Since the sizes or number of intervals within a partition is at the analysts` discretion, these intervals must be combined further, as indicated below:

Index	Interval	Interval Midpoint	Observed Frequency	Expected Frequency
1	(-4,0.0)	-	12	12.5
2	[0.0,4)	-	13	12.5

But the difficulty here is that now our test statistic is incredibly small, i.e.,

Given our discretion on the interval specification, we compromise to determine the following:

Index	Interval	Interval Midpoint	Observed Frequency	Expected Frequency
1	(-4,-0.83)	-	8	5.082
2	[-0.83,0.0)	-0.425	4	7.418
3	[0.0,0.83)	0.425	5	7.418
4	[0.83,4)	-	8	5.082

Given this information, we then can ---

3) Determine the rejection region:

We needed to know the specifics of the interval structure that we were going to use for our test since we needed to know the degrees-of-freedom for determining our rejection and non-rejection region. In this example n = 4 and k = 0, so n - k - 1 = 4 - 0 - 1 = 3.

Reject if: P² 7.815

4 Compute the test statistic value:

= 1.676 + 1.575 + .788 + 1.676 = 5.715

5 Draw a conclusion: The data do not support the rejection of the null hypothesis that the data set is drawn from the standard normal distribution.

Normal distribution example taken from:www.som.clarkson.edu/~cmosier/simulation/Random_Numbers/Testing/Chi_Squared_gof/chi_gof.html

Degrees of freedom and P²

the degrees of freedom for the P² test is n-k-1

where n is the number of classes,

k is the number of restrictions in addition to the one imposed by the table total or the mean

the textbook shows it as n-r but you will always have to lose 1 degree of freedom because you will know the table total or mean so its an equivalent formula

in most situations k=0 so df=n-1

but when using P² for goodness of fit k may take on other values

in using the Poisson distribution k=1

for a normal distribution k=1 because of the mean and the variance

but if the data is already in standardized form as in the example k=0

if P² is being used as a test for independence in a contingency table df=(r-1)(c-1) where r is the number of rows and c is the number of columns