Question 11.4: Test of Normality Use chi-square to determine if the variabl......

Test of Normality

Use chi-square to determine if the variable shown in the frequency distribution is normally distributed. Use α=0.05\alpha=0.05.

Boundaries Frequency 89.5104.524104.5119.562119.5134.572134.5149.526149.5164.512164.5179.54Total =200\begin{array}{|r|r|} \hline \text{Boundaries }& \text{Frequency }\\ \hline 89.5-104.5 & 24 \\ 104.5-119.5 & 62 \\ 119.5-134.5 & 72 \\ 134.5-149.5 & 26 \\ 149.5-164.5 & 12 \\ 164.5-179.5 &4 \\ & \text{Total }=\overline{200}\\ \hline \end{array}

Step-by-Step
The 'Blue Check Mark' means that this solution was answered by an expert.
Learn more on how do we answer questions.

H0H_{0} : The variable is normally distributed.

H1H_{1} : The variable is not normally distributed.

First find the mean and standard deviation of the variable. (Note: ss is used to approximate σ\sigma.)

 Boundaries fXmfXmfXm289.5104.524972,328225,816104.5119.5621126,944777,728119.5134.5721279,1441,161,288134.5149.5261423,692524,264149.5164.5121571,884295,788164.5179.54172688118,33620024,6803,103,220\begin{array}{|r|c|r|r|r|} \hline \textbf{ Boundaries } & \boldsymbol{f} & \boldsymbol{X}_{\boldsymbol{m}} & \boldsymbol{f} \cdot \boldsymbol{X}_{\boldsymbol{m}} & \boldsymbol{f} \cdot \boldsymbol{X}_{\boldsymbol{m}}^{\mathbf{2}} \\ \hline 89.5-104.5 & 24 & 97 & 2,328 & 225,816 \\ 104.5-119.5 & 62 & 112 & 6,944 & 777,728 \\ 119.5-134.5 & 72 & 127 & 9,144 & 1,161,288 \\ 134.5-149.5 & 26 & 142 & 3,692 & 524,264 \\ 149.5-164.5 & 12 & 157 & 1,884 & 295,788 \\ 164.5-179.5 & 4 & 172 & 688 & 118,336 \\ &\overline{200}&&\overline{24,680} &\overline{3,103,220} \\ \hline \end{array}

Xˉ=24,680200=123.4s=200(3,103,220)24,6802200(199)=290=17.03\begin{aligned}\bar{X} & =\frac{24,680}{200}=123.4 \\s & =\sqrt{\frac{200(3,103,220)-24,680^{2}}{200(199)}}=\sqrt{290}=17.03\end{aligned}

Next find the area under the standard normal distribution, using zz values and Table E for each class.
The zz score for x=104.5x=104.5 is found as

z=104.5123.417.03=1.11z=\frac{104.5-123.4}{17.03}=-1.11

The area for z<1.11z<-1.11 is 0.13350.1335.

The zz score for 119.5119.5 is found as

z=119.5123.417.03=0.23z=\frac{119.5-123.4}{17.03}=-0.23

The area for 1.11<z<0.23-1.11<z<-0.23 is 0.40900.1335=0.27550.4090-0.1335=0.2755.

The zz score for 134.5134.5 is found as

z=134.5123.417.03=0.65z=\frac{134.5-123.4}{17.03}=0.65

The area for 0.23<z<0.65-0.23<z<0.65 is 0.74220.4090=0.33320.7422-0.4090=0.3332.

The zz score for 149.5149.5 is found as

z=149.5123.417.03=1.53z=\frac{149.5-123.4}{17.03}=1.53

The area for 0.65<z<1.530.65<z<1.53 is 0.93700.7422=0.19480.9370-0.7422=0.1948.

The zz score for 164.5164.5 is found as

z=164.5123.417.03=2.41z=\frac{164.5-123.4}{17.03}=2.41

The area for 1.53<z<2.411.53<z<2.41 is 0.99200.9370=0.05500.9920-0.9370=0.0550.

The area for z>2.41z>2.41 is 1.00000.9920=0.00801.0000-0.9920=0.0080.

Find the expected frequencies for each class by multiplying the area by 200 . The expected frequencies are found by

0.1335200=26.70.2755200=55.10.3332200=66.640.1948200=38.960.0550200=11.00.0080200=1.6\begin{aligned}& 0.1335 \cdot 200=26.7 \\& 0.2755 \cdot 200=55.1 \\& 0.3332 \cdot 200=66.64 \\& 0.1948 \cdot 200=38.96 \\& 0.0550 \cdot 200=11.0 \\& 0.0080 \cdot 200=1.6\end{aligned}

Note: Since the expected frequency for the last category is less than 5, it can be combined with the previous category.

The table looks like this.

O2462722616E26.755.166.6438.9612.6\begin{array}{|l|l|l|l|l|l|} \hline \mathbf{O} & 24 & 62 & 72 & 26 & 16 \\ \hline \boldsymbol{E} & 26.7& 55.1 & 66.64& 38.96 &12.6 \\ \hline \end{array}

Finally, find the chi-square test value using the formula χ2=(OE)2E\chi^{2}=\sum \frac{(O-E)^{2}}{E}.

χ2=(2426.7)226.7+(6255.1)255.1+(7266.64)266.64+(2638.96)238.96+(1612.6)212.6=6.797\begin{aligned}\chi^{2}= & \frac{(24-26.7)^{2}}{26.7}+\frac{(62-55.1)^{2}}{55.1}+\frac{(72-66.64)^{2}}{66.64}+\frac{(26-38.96)^{2}}{38.96} \\& +\frac{(16-12.6)^{2}}{12.6} \\= & 6.797\end{aligned}

The critical value in this test has the degrees of freedom equal to the number of categories minus 3 since 1 degree of freedom is lost for each parameter that is estimated. In this case, the mean and standard deviation have been estimated, so 2 additional degrees of freedom are needed.

The C.V. with d.f. =53=2=5-3=2 and α=0.05\alpha=0.05 is 5.9915.991, so the null hypothesis is rejected. Hence, the distribution can be considered not normally distributed. Note: At α=0.01\alpha=0.01, the C.V. =9.210=9.210 and the null hypothesis would not be rejected. Hence, we could consider that the variable is normally distributed at α=0.01\alpha=0.01. So it is important to decide which level of significance you want to use prior to conducting the test.

Related Answered Questions