# Excel Regression Analysis Output Explained

Probability and Statistics > Excel for statistics > Excel Regression Analysis Output Explained

Previous article: Excel 2013 Regression Analysis How To

Watch the video or read the steps below:

## Excel Regression Analysis Output Explained

In the previous article, I explained how to perform Excel regression analysis. After you’ve gone through the steps, Excel will spit out your results, which will look something like this:

## Excel Regression Analysis Output Explained: Multiple Regression

Here’s a breakdown of what each piece of information in the output means:

## EXCEL REGRESSION ANALYSIS OUTPUT PART ONE: REGRESSION STATISTICS

These are the “Goodness of Fit” measures. They tell you how well the calculated linear regression equation fits your data.

1. Multiple R. This is the correlation coefficient. It tells you how strong the linear relationship is. For example, a value of 1 means a perfect positive relationship and a value of zero means no relationship at all. It is the square root of r squared (see #2).
2. R squared. This is r2, the Coefficient of Determination. It tells you how many points fall on the regression line. for example, 80% means that 80% of the variation of y-values around the mean are explained by the x-values. In other words, 80% of the values fit the model.
3. Adjusted R square. The adjusted R-square adjusts for the number of terms in a model. You’ll want to use this instead of #2 if you have more than one x variable.
4. Standard Error of the regression: An estimate of the standard deviation of the error μ. This is not the same as the standard error in descriptive statistics! The standard error of the regression is the precision that the regression coefficient is measured; if the coefficient is large compared to the standard error, then the coefficient is probably different from 0.
5. Observations. Number of observations in the sample.

## EXCEL REGRESSION ANALYSIS OUTPUT EXPLAINED PART TWO: ANOVA

1. SS = Sum of Squares.
2. Regression MS = Regression SS / Regression degrees of freedom.
3. Residual MS = mean squared error (Residual SS / Residual degrees of freedom).
4. F: Overall F test for the null hypothesis.
5. Significance F: The significance associated P-Value.

The second part of output you get in Excel is rarely used, compared to the regression output above. It splits the sum of squares into individual components (see: Residual sum of squares), so it can be harder to use the statistics in any meaningful way. If you’re just doing basic linear regression (and have no desire to delve into individual components) then you can skip this section of the output.
For example, to calculate R2 from this table, you would use the following formula:
R2 = 1 – residual sum of squares (SS Residual) / Total sum of squares (SS Total).
In the above table, residual sum of squares = 0.0366 and the total sum of squares is 0.75, so:
R2 = 1 – 0.0366/0.75=0.9817

## EXCEL REGRESSION ANALYSIS PART THREE: INTERPRET REGRESSION COEFFICIENTS

This section of the table gives you very specific information about the components you chose to put into your data analysis. Therefore the first column (in this case, House / Square Feet) will say something different, according to what data you put into the worksheet. For example, it might say “height”, “income” or whatever variables you chose.

The columns are:

1. Coefficient: Gives you the least squares estimate.
2. Standard Error: the least squares estimate of the standard error.
3. T Statistic: The T Statistic for the null hypothesis vs. the alternate hypothesis.
4. P Value: Gives you the p-value for the hypothesis test.
5. Lower 95%: The lower boundary for the confidence interval.
6. Upper 95%: The upper boundary for the confidence interval.

The most useful part of this section is that it gives you the linear regression equation:
y = mx + b.
y = slope * x + intercept.
For the above table, the equation would be approximately:
y = 3.14 – 0.65X1 + 0.024X2.

Reference:: http://cameron.econ.ucdavis.edu/excel/ex61multipleregression.html

If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.

Excel Regression Analysis Output Explained was last modified: October 15th, 2017 by

# 37 thoughts on “Excel Regression Analysis Output Explained”

1. Anon

It also introduces additional errors, particularly;
“… and the total sum of squares is 1.6050, so:
R2 = 1 – 0.3950 – 1.6050 = 0.8025.”

“… and the total sum of squares is 2, so:
R2 = 1 – 0.3950 / 2 = 0.8025.”

2. Andale Post author

Thanks for spotting the error with the sum of squares. It’s now fixed.
I added credit to the article.

Regards,
S

3. Irfan

Hi stepahnie

I have more than 2 variables. my variable is 6. pls tell me how to calculate regresson eqution for more varaibles. I am in urgent need.

Thanks

Irfan

4. c

also, while i can’t see your original data, it appears there are two variables, so the equation should read:
y = 3.14 – 0.65 X1 + 0.024 X2
where X1 = House
and X2 = sqft

5. Hemali Bhimajiyani

What we interpret about the significance F while interpreting the regression output from Excel ??

6. Andale Post author

7. Hans Strasburger

Hi Stefanie,
in your video tutorial above you say
“The coefficient of determination tells you how many points, percentage wise, fall on the regression line.“
This is absolutely not the case!!! :-(
The number of points on the regression line is, in fact, *unrelated* to R². Not a single point can be on the regression line and still R² can be close to 1! Conversely, 99% of all points can be exactly on the line; with only one point far off the resulting R² will be very low.
R² is the percentage of explained variance, i.e. the percentage of variance of y that stems from the regression line. For a visualization, draw, for each data point, a vertical line to the regression line; also draw a horizontal line for the mean of y. For each vertical line, take the section between the horizontal line and the regression line. The sum of squares of these sections are the explained variance.
Cheers,
Hans
Another visualization is that

8. Andale Post author

Hi, Hans,
Thanks for your response. I was trying to word it for beginning statistics students who don’t have a clue what variance on a regression line means. In other words, in simple terms. I do agree that the wording as it is may be misleading. I think it would be better stated as “The coefficient of determination gives you an idea of how many points fall on the regression line.“ For example, if ALL the points WERE on the line, that would have a perfect coefficient of determination, right? And if the dots were scattered to the wind (with respect to the line), then there would be an insignificant CoD.

9. Sue

Very good information.
Told me everything I need to know about multiple regression analysis output.

Suggestion: Do you have any articles explained the t-test output or ANOVA output?

10. Pallavi

I am learning to use MLRA to study variation of wavelength upon some solvent parameters. But when I increase the number of independent variables there appears #NUM! in the in the F, Significance F and P value column. And also the predicted and experimental values remain the same giving R square value exactly equal to 1. I am not a statistics student and I am puzzled. If someone can help and mail me regarding this. I shall be highly obliged.
Regards
Pallavi

11. Andale Post author

Check your inputs. Something, somewhere on the worksheet (i.e. a non-numerical value) is causing that #NUM to appear.

I have 10 responses to be worked out from 5 input variables.
I have a database for 18 runs.
Pl tell me how to proceed for regression analysis.
Also I want to prepare mathematical equations for 10 output responses.

13. Mharge

Hi! How will I know if there is a significant difference? Like for instance, I got 0.402 as my significance F. What does it mean?

14. Ian

Below are the results for a 3rd order polynomial regression and a logarithmic regression using the same data:
Data (very small sample only to illustrate variable position in Excel)
X1 X2 X3 Y
14.8 26 36.7 20.8
14.5 26 116.4 22.998
Polynomial = Linest(y,x^{1,2,3,,true}) shift + ctrl + enter
2.635E-09 0.0561 -1.4218 25.584 (1st row of stats output)
Logarithmic = Linest(y, Ln(x),,true) shift + ctrl + enter
8.384 112.62 -28.17 -293.56
Can you assist me to obtain the relevant equations for the regression models please?
Ian

15. Andale Post author

Hi, Patrice,
There’s a lot of information in the output! Can you give me an idea of where you would like to see more of an explanation?

16. grace

pls i want to calculate for t-test,f-test and ordinary least square method, how will i do it in excel

17. Sonu

hi Stefanie,
I did analysis part wit help of youtube but not knowing how to give inference wit the result tat i got from regression analysis in excel.could u pls help me wat this result actually mean and how should be referred in analysis. pls tel me where should I send a snapshot of my result

18. Nancy

Hi Stephani,

I have a question about my accounting case study.
There is a question in my case study: explain the main feature of the regression analysis output and, making use of output, explain the relevance to cost estimation of each of the following: correlation coefficient, R square, standard error, 95% confidence limits and residuals.
As this is a first time that I solve a case study, I do not know how to explain this by words and from which perspective to explain this.

Nancy

19. Andale Post author

Hi Nancy,
That’s a huge question…you’re probably looking at a page or more for an answer. I don’t know what your teacher is looking for when they say “explain the main feature”. Perhaps the model? I’m afraid that cost estimation is a bit outside my knowledge area too. You may want to read correlation coefficient, R square, standard error, confidence level and residuals to see how they apply to your particular cost analysis.

20. Gaurav Vazirani

Hi,
Is the standard error same as the “standard deviation of the sampling distribution of the sampling mean”?

21. Andale Post author

Almost. The standard error of the mean is the standard deviation of the sampling distribution of the mean.

22. Anton AtoZ

Read thru your articles (ANOVA, F TEST, T TEST, P VALUE, REGRESSION, HYPHOTHESIS TEST, SOME MORE) in this web.

Nothing more I can say than thank you so much to clear up many questions in my head for analizyng my quality improvement project with excel.

You explained the definition, understanding, usage in the very short, simple yet clear way for common person like me.