3. Linear Models
"All models are wrong but some are useful" - George Box
Notation
Using
If not invertible no unique solution
Compare to knn bias-variance and Validation
TSS:=
Basically the training error if we used
A small
Proof
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 2.938889 0.311908 9.422 <2e-16 ***
TV 0.045765 0.001395 32.809 <2e-16 ***
radio 0.188530 0.008611 21.893 <2e-16 ***
newspaper -0.001037 0.005871 -0.177 0.86
Coefficients: information about
(Intercept): information about ˆb1, the intercept
TV: information the coefficient of predictor TV
Estimate: value of
t-value: value of t-test statistic. Pr(> |t|): p-value of test for t-test. This is a test for
H0 :
Residual Output Summary
Residuals:
Residual standard error:
on 196 degrees of freedom
Multiple
F-statistic:
Key Definitions
- Residuals:
-
Residual standard error:
Scaled square root of training error (formula above). -
Multiple
:
Proportion of variance explained by the model.
F-Test
Tests the null hypothesis:
Interpretation: none of the predictors have a linear effect on the outcome.
further information in Regression Analysis
Interactions
no longer linear