Test of Normality
Use chi-square to determine if the variable shown in the frequency distribution is normally distributed. Use α=0.05.
Boundaries 89.5−104.5104.5−119.5119.5−134.5134.5−149.5149.5−164.5164.5−179.5Frequency 24627226124Total =200
H0 : The variable is normally distributed.
H1 : The variable is not normally distributed.
First find the mean and standard deviation of the variable. (Note: s is used to approximate σ.)
Boundaries 89.5−104.5104.5−119.5119.5−134.5134.5−149.5149.5−164.5164.5−179.5f24627226124200Xm97112127142157172f⋅Xm2,3286,9449,1443,6921,88468824,680f⋅Xm2225,816777,7281,161,288524,264295,788118,3363,103,220
Xˉs=20024,680=123.4=200(199)200(3,103,220)−24,6802=290=17.03
Next find the area under the standard normal distribution, using z values and Table E for each class.
The z score for x=104.5 is found as
z=17.03104.5−123.4=−1.11
The area for z<−1.11 is 0.1335.
The z score for 119.5 is found as
z=17.03119.5−123.4=−0.23
The area for −1.11<z<−0.23 is 0.4090−0.1335=0.2755.
The z score for 134.5 is found as
z=17.03134.5−123.4=0.65
The area for −0.23<z<0.65 is 0.7422−0.4090=0.3332.
The z score for 149.5 is found as
z=17.03149.5−123.4=1.53
The area for 0.65<z<1.53 is 0.9370−0.7422=0.1948.
The z score for 164.5 is found as
z=17.03164.5−123.4=2.41
The area for 1.53<z<2.41 is 0.9920−0.9370=0.0550.
The area for z>2.41 is 1.0000−0.9920=0.0080.
Find the expected frequencies for each class by multiplying the area by 200 . The expected frequencies are found by
0.1335⋅200=26.70.2755⋅200=55.10.3332⋅200=66.640.1948⋅200=38.960.0550⋅200=11.00.0080⋅200=1.6
Note: Since the expected frequency for the last category is less than 5, it can be combined with the previous category.
The table looks like this.
OE2426.76255.17266.642638.961612.6
Finally, find the chi-square test value using the formula χ2=∑E(O−E)2.
χ2==26.7(24−26.7)2+55.1(62−55.1)2+66.64(72−66.64)2+38.96(26−38.96)2+12.6(16−12.6)26.797
The critical value in this test has the degrees of freedom equal to the number of categories minus 3 since 1 degree of freedom is lost for each parameter that is estimated. In this case, the mean and standard deviation have been estimated, so 2 additional degrees of freedom are needed.
The C.V. with d.f. =5−3=2 and α=0.05 is 5.991, so the null hypothesis is rejected. Hence, the distribution can be considered not normally distributed. Note: At α=0.01, the C.V. =9.210 and the null hypothesis would not be rejected. Hence, we could consider that the variable is normally distributed at α=0.01. So it is important to decide which level of significance you want to use prior to conducting the test.