Professional Documents
Culture Documents
xlsx>
This data set of size n = 17 contains observations from a small assembly plant. The predictors
are
x1=average monthly temperature of the plant (degrees Fahrenheit),
x2=amount of production (in pounds),fmn and the response is
Y =the monthly water usage (in gallons).
(1) Perform the stepwise regression analysis by hand, that is, following Slides 20-30 in
Chapter 5.
In your stepwise regression, set
Alpha-to-Enter: 0.15 ; Alpha-to-Remove: 0.30
In each step, you need to clearly state if a variable has been entered or removed,
and explain
Y why a variable has been entered/removed, or
Y why no variable qualifies to enter or leave.
Step 1:
Based on (1.1), (1.2), (1.3), (1.4), (1.5),
x2 is chosen to enter the model, since
it has the highest t value, 9.91 (corresponding to the lowest p value);
the p value is 0.007, less than the Alpha to Enter 0.15.
Step 2:
Based on (2.1), (2.2), (2.3), (2.4),
x4 is chosen to enter the model since
it has the highest t value 9.44 (corresponding to the lowest p value);
the p value is 0.003, less than the Alpha to Enter 0.15.
No variable is removed in this step, since
In the model with x2 and x4, x2 is still significant with a high t value 3.64,
which corresponds to a low p value of 0.003. The p value is less than Alpha to
Remove 0.15.
1
(2) Suppose that you have performed the best subset regression analysis on the dataset, and
obtained the following outputs from MINITAB:
2
APPENDIX
Analysis of Variance
Source DF SS MS F P
Regression 1 260702 260702 1.33 0.266
Residual Error 15 2931930 195462
Total 16 3192632
Analysis of Variance
Source DF SS MS F P
Regression 1 1270172 1270172 9.91 0.007
Residual Error 15 1922459 128164
Total 16 3192632
Analysis of Variance
Source DF SS MS F P
Regression 1 25190 25190 0.12 0.735
Residual Error 15 3167442 211163
Total 16 3192632
Analysis of Variance
Source DF SS MS F P
Regression 1 545213 545213 3.09 0.099
Residual Error 15 2647418 176495
Total 16 3192632
3
(1.5) The regression equation is Y = 3352 - 1.07 X5
Analysis of Variance
Source DF SS MS F P
Regression 1 13749 13749 0.06 0.802
Residual Error 15 3178883 211926
Total 16 3192632
Analysis of Variance
Source DF SS MS F P
Regression 2 1559525 779763 6.68 0.009
Residual Error 14 1633106 116650
Total 16 3192632
Source DF Seq SS
X2 1 1270172
X1 1 289353
Analysis of Variance
Source DF SS MS F P
Regression 2 1348259 674129 5.12 0.021
Residual Error 14 1844373 131741
Total 16 3192632
Source DF Seq SS
X2 1 1270172
X3 1 78087
4
(2.3) The regression equation is Y = 4601 + 0.203 X2 - 21.6 X4
Analysis of Variance
Source DF SS MS F P
Regression 2 1833271 916635 9.44 0.003
Residual Error 14 1359361 97097
Total 16 3192632
Source DF Seq SS
X2 1 1270172
X4 1 563098
Analysis of Variance
Source DF SS MS F P
Regression 2 1270243 635121 4.63 0.029
Residual Error 14 1922389 137313
Total 16 3192632
Source DF Seq SS
X2 1 1270172
X5 1 71
Analysis of Variance
Source DF SS MS F P
Regression 3 2017440 672480 7.44 0.004
Residual Error 13 1175191 90399
Total 16 3192632
Source DF Seq SS
X2 1 1270172
X4 1 563098
X1 1 184170
5
(3.2) The regression equation is Y = 6307 + 0.218 X2 - 23.5 X4 - 71.4 X3
Analysis of Variance
Source DF SS MS F P
Regression 3 2001031 667010 7.28 0.004
Residual Error 13 1191601 91662
Total 16 3192632
Source DF Seq SS
X2 1 1270172
X4 1 563098
X3 1 167760
Analysis of Variance
Source DF SS MS F P
Regression 3 1843427 614476 5.92 0.009
Residual Error 13 1349205 103785
Total 16 3192632
Source DF Seq SS
X2 1 1270172
X4 1 563098
X5 1 10156