Friday, 16 March 2018

Chi-Square Test for independence - Question 18 (A publisher is interested...)

Question 18
A publisher is interested in determing whinf of three book cover is most attractive. He interviews 400 people in each of the three states (Califonia, Illinois and New York), and asks each person which of the  cover he or she prefers. The number of preference for each cover is as follows:

Califonia Illinois New York Total
First Cover 81 60 182 323
Second Cover 78 93 95 266
Third Cover 241 247 123 611
Total 400 400 400 1200
Table 1

Do these data indicate that there are regional differences in people's preferences concerning these covers? Use the 0.05 level of significance.


Solution Steps
As usual, we need to understand the problem and decide on which particular test to carry out.

In this case, since the question asks if there are regional differences in people's preference, it means we need to test if the groups of data depends on other groups. So it would be Chi-Square test for independence.
That is the hypothesis we are going to test.

Step 1: State the null and alternate hypothesis
H0: there are no differences in peoples preference
Ha: there are  significant differneces in people's cover

Since we are going to be using Excel to simplify the solving of this problem, I have transfered the table to MS Excel. This is shown in Table 1. You can get the completed excel sheet from here
 

Step 2: Calculate the Expected values
In this case totals have already been calculated for us. So we go ahead to calculate the expected values.
The expected value for each cell is calculated by multiplying the row total by the column total and dividing by the grand total

For example, the expected value for the first cell, containing 81 is given by

I have calculated for the rest of the values and the result is given in Table 2.

Table 2





Step 4: Calculate Squared Difference (O-E)2
Where O is the observed values in Table 2 and E is the expected values calcualted in Table 3. The first squared for:
O = 81
E = 107.667



I have calculated for the rest of the values and the result is shown in Table 3


Table 3: Squared Differences

Step 5: Divide by Expected Value
This is the squared deviation you calculated in step 4 divided by the corresponding expected values. For the first value it would be


I have done it for all the values and the result is tabulated in Table 4.


Step 6: Calculate the Test Statistic
This is simply the sum  of all the values in the last table

= 6.605 + 21.103 + 51.32 + 1.283 + 0.212 + 0.452 + 6.843 + 9.22 + 31.95
= 128.998

Step 7: Look up the critical Value from Chi-Square table
Get Statistical table from here
First we calcuale the degrees of freedom
df = (3-1) * (3-1) =  4
alpha = 0.01

The critical value from the table of Chi-Square distribution is written as

K0.05, 4 = 14.86

Step 8: State your conclusion
Since the calculated value of the test statistic is greater than the critical value, we therefore reject the null hypothesis and conclude that there are no differences between the groups of data.

The whole tables are shown below, you can also download it for free