OptinMon 30 - Four Critical Steps in Building Linear Regression Models

Same Statistical Models, Different (and Confusing) Output Terms

January 7th, 2020 by

Learning how to analyze data can be frustrating at times. Why do statistical software companies have to add to our confusion?Stage 2

I do not have a good answer to that question. What I will do is show examples. In upcoming blog posts, I will explain what each output means and how they are used in a model.

We will focus on ANOVA and linear regression models using SPSS and Stata software. As you will see, the biggest differences are not across software, but across procedures in the same software.

(more…)


What is Multicollinearity? A Visual Description

November 20th, 2019 by

Multicollinearity is one of those terms in statistics that is often defined in one of two ways:

1. Very mathematical terms that make no sense — I mean, what is a linear combination anyway?

2. Completely oversimplified in order to avoid the mathematical terms — it’s a high correlation, right?

So what is it really? In English?

(more…)


Recoding a Variable from a Survey Question to Use in a Statistical Model

March 18th, 2019 by

Survey questions are often structured without regard for ease of use within a statistical model.Stage 2

Take for example a survey done by the Centers for Disease Control (CDC) regarding child births in the U.S. One of the variables in the data set is “interval since last pregnancy”. Here is a histogram of the results.

(more…)


A Strategy for Converting a Continuous to a Categorical Predictor

February 18th, 2019 by

At times it is necessary to convert a continuous predictor into a categorical predictor.  For example, income per household is shown below.Stage 2

This data is censored, all family income above $155,000 is stated as $155,000. A further explanation about censored and truncated data can be found here. It would be incorrect to use this variable as a continuous predictor due to its censoring.

(more…)


A Useful Graph for Interpreting Interactions between Continuous Variables

February 11th, 2019 by

What’s a good method for interpreting the results of a model with two continuous predictors and their interaction?Stage 2

Let’s start by looking at a model without an interaction.  In the model below, we regress a subject’s hip size on their weight and height. Height and weight are centered at their means.

(more…)


Descriptives Before Model Building

January 28th, 2019 by

Stage 2One approach to model building is to use all predictors that make theoretical sense in the first model. For example, a first model for determining birth weight could include mother’s age, education, marital status, race, weight gain during pregnancy and gestation period.

The main effects of this model show that a mother’s education level and marital status are insignificant.
(more…)