Multiple Linear Regression

The regression procedure is used when you are interested in describing the linear relationship between the independent variables and a dependent variable.

  1. Click Tools > Data Analysis.... If you don't have a Data Analysis option under Tools, see step #3 of "How do I get started with Excel?".
  2. In the window titled Data Analysis select Regression and click OK.
  3. A new window titled Regression should appear. This window has many options. Below is a brief explanation of each:

     


     

    Residuals ( Optional )


    Residuals are used to check the regression assumptions. Excel will output residual information when doing a regression analysis. The following is a short explanation about residual output.

    • Click Residuals to output the residual values. The residual value is simply the difference between the observed value and the fitted value.
    • Click Standardized Residuals to output the standardized residual values. Standardized residual values, unlike residual values, are invariant to the scale of measurement. Standardized residuals can be used to check for outliers in the data. If the standardized residual value is above 3 or below -3, the observation is a potential outlier.
    • Click Residual Plots to return the residual scatterplot(s). The scatterplot(s) allow us to visually check the regression assumptions. If the scatterplot(s) have any significant pattern, the regression assumptions are being violated.
    • Ignore Line Fit Plots when doing multiple regression.

       
    • Normal Probability Plots are used to check the normality assumption of the error term (or residuals). Click Normal Probability Plots to output the normal probability plot.
       


     

  4. Click OK. The test results and outputted scatterplots will be placed onto your spreadsheet.



Note:


An example, perform a regression analysis with Y (dependent variable) on X1 and X2 (the independent variables). Use = 0.05 for the analysis. Return the residuals, standardized residuals, and the normal probability plot. Place the output into cell $E$3.

 

Regression
graphic