residuals

Assumptions of Linear Models are about Errors, not the Response Variable

March 19th, 2024 by Karen Grace-Martin

I recently received a great question in a comment about whether the assumptions of normality, constant variance, and independence in linear models are about the errors, ε_i, or the response variable, Y_i.

The asker had a situation where Y, the response, was not normally distributed, but the residuals were.

Quick Answer: It’s just the errors.

In fact, if you look at any (good) statistics textbook on linear models, you’ll see below the model, stating the assumptions: (more…)

6 comments

One of the Many Advantages to Running Confirmatory Factor Analysis with a Structural Equation Model

February 23rd, 2020 by Jeff Meyer

Based on questions I’ve been asked by clients, most analysts prefer using the factor analysis procedures in their general statistical software to run a confirmatory factor analysis.

While this can work in some situations, you’re losing out on some key information you’d get from a structural equation model. This article highlights one of these.

(more…)

11 comments

Same Statistical Models, Different (and Confusing) Output Terms

January 7th, 2020 by Jeff Meyer

Learning how to analyze data can be frustrating at times. Why do statistical software companies have to add to our confusion?

I do not have a good answer to that question. What I will do is show examples. In upcoming blog posts, I will explain what each output means and how they are used in a model.

We will focus on ANOVA and linear regression models using SPSS and Stata software. As you will see, the biggest differences are not across software, but across procedures in the same software.

(more…)

No comments yet

Member Training: The Anatomy of an ANOVA Table

December 31st, 2019 by Jeff Meyer

Our analysis of linear regression focuses on parameter estimates, z-scores, p-values and confidence levels. Rarely in regression do we see a discussion of the estimates and F statistics given in the ANOVA table above the coefficients and p-values.

And yet, they tell you a lot about your model and your data. Understanding the parts of the table and what they tell you is important for anyone running any regression or ANOVA model.

(more…)

Comments closed

Member Training: The Multi-Faceted World of Residuals

July 1st, 2017 by Karen Grace-Martin

Most analysts’ primary focus is to check the distributional assumptions with regards to residuals. They must be independent and identically distributed (i.i.d.) with a mean of zero and constant variance.

Residuals can also give us insight into the quality of our models.

In this webinar, we’ll review and compare what residuals are in linear regression, ANOVA, and generalized linear models. Jeff will cover:

Which residuals — standardized, studentized, Pearson, deviance, etc. — we use and why
How to determine if distributional assumptions have been met
How to use graphs to discover issues like non-linearity, omitted variables, and heteroskedasticity

Knowing how to piece this information together will improve your statistical modeling skills.

Note: This training is an exclusive benefit to members of the Statistically Speaking Membership Program and part of the Stat’s Amore Trainings Series. Each Stat’s Amore Training is approximately 90 minutes long.

(more…)

1 comment

Incorporating Graphs in Regression Diagnostics with Stata

May 24th, 2016 by Jeff Meyer

You put a lot of work into preparing and cleaning your data. Running the model is the moment of excitement.

You look at your tables and interpret the results. But first you remember that one or more variables had a few outliers. Did these outliers impact your results? (more…)

No comments yet