9. MLR Hypothesis Testing
SLR Review
Recall our earlier F-tests for anova T tests with respect to correlation and our parameters
Where
MLR
Overall Test
where df numerator =
(remember
RR =
Addition of a Group of Variables
R = reduced , C = complete
RR =
Example Comparing Means
Bilirubin is formed in the liver, where haemoglobin and other haemoproteines are decomposed into bile pigments. Bilirubin is partly reabsorbed by the intestine, and returns to the liver. If the liver has suffered degeneration, if the decomposition of haemoglobin is elevated, or if the gall bladder has been destroyed, bilirubin can accumulate to high levels in the blood, leading to jaundice. Blood samples were taken from three young men at one-week intervals, and the concentration of bilirubin in the serum was measured. The measured concentrations are shown in the following Table.
A=c(14,20,23,27,27);
B=c(20,27,32,34,34);
C=c(32,41,41,55,55);
cbind(mean(A), mean(B),mean(C));
## [,1] [,2] [,3]
## [1,] 22.2 29.4 44.8
cbind(var(A), var(B), var(C));
## [,1] [,2] [,3]
## [1,] 29.7 35.8 100.2
Is there sufficient evidence to indicate a difference in mean bilirubin concentration for the
three young men? Fit appropriate linear model(s) to the data and test at the 0.05
level of significance. Provide conclusion in the context of this problem.
Complete model:
where
Reduced Model:
Hypothesis Test
our
Example 2 Testing Means
Rhree brands of batteries are under study. It is suspected that the average life (in
weeks) of the three brands is different. Five batteries of each brand are tested with the
following results.
One Two Three
Sample means 95.2 79.4 100.4
One Two Three
Sample variances 11.2 14.8 20.8
Answer each of the following questions by fitting an appropriate multiple linear regression
model and assuming that the errors are independent and Normally distributed with mean
0 and variance σ2. You have to show all your work to get full credit. Simplify final answers and round to 4 decimal places where appropriate.
i) Find
and numerical value.
Target:
For these types of questions
CI
For
steps are the same but
Estimator
General Testing of Parameters
Example Individual Variables
In a small-scale study of the relation between degree of brand liking (Y ) and moisture content (X1) and sweetness (X2) of the product, the following results were obtained (data are coded):
X1 X2 Y
1 4 2 64
2 4 4 73
3 4 2 61
4 4 4 76
5 6 2 72
6 6 4 80
7 6 2 71
8 6 4 83
9 8 2 83
10 8 4 89
11 8 2 86
12 8 4 93
13 10 2 88
14 10 4 95
15 10 2 94
16 10 4 100
A first-order model for mean brand liking,
modA=lm(y~x1 + x2);
summary(modA);
Estimate Std. Error t value
(Intercept) 37.650 2.9961032 12.566323
x1 4.425 0.3011197 14.695153
x2 4.375 0.6733241 6.497614
modA=lm(y~x1 + x2);
anova(modA);
Df Sum Sq Mean Sq F value
x1 1 1566.45 1566.45 215.947
x2 1 306.25 306.25 42.219
Residuals 13 94.30 7.25
i) Is moisture content
Test stat