Regression AnalysisRegression analysis “fits” or derives a model that describes the variation of a response (“dependent”) variable as a function of one or more predictor (or “independent”) variables. The general regression model is one of several that share the same basic conceptual model data = systematic component + irregular component where the
systematic component is predictable or explainable by the predictor
variables, and is represented by the regression model, while the irregular
component is regarded as “noise” or prediction errors The specific bivarite, linear, regression model is where
There are several alternative ways of writing the regression equation or model ·
true model, no error: ·
true model, with error: ·
true model, no subscripts: ·
true model, with error: ·
estimated model ·
estimated model Other variables and quantities The are a number of other quantities that are important in the analysis, including: ·
the “fitted” or predicted values of the
response variable · the residuals or prediction errors · the sums of squared deviations and their cross products · and the residual sum of squares Fitting the regression equation (i.e. estimating parameters) The
regression equation is “fitted” by choosing the values of The
specific values of The
“goodness of fit” of the regression equation, or a measure of the strength of
the relationship between Where
An F-statistic that can be used to test the null hypothsis that the relationship between is not significant is The
denominator of this expression, Another
measure of the strength of the relationship between the response and
predictor variable is the “explained variance” (a proportion, but sometimes
expressed as a percentage), also known as the “coefficient of determination,”
or Significance of the regression coefficients There are a number of other quantities that are useful in interpreting a regression equation. These include standard errors for the slope and intercept Using these standard errors, t-statistics that can be used to test hypotheses about the regression coefficients can be constructed where
|