Regression models

What’s in a Name? Moderation and Interaction, Independent and Predictor Variables

April 14th, 2014 by

One of the most confusing things about statistical analysis is the different vocabulary used for the same, or nearly-but-not-quite-the-same, concepts.

stage 1

Sometimes this happens just because the same analysis was developed separately within different fields and named twice.

So people in different fields use different terms for the same statistical concept.  Try to collaborate with a colleague in a different field and you may find yourself awed by the crazy statistics they’re insisting on.

Other times, there is a level of detail that is implied by one term that isn’t true of the wider, more generic term.  This level of detail is often about how the role of variables or effects affects the interpretation of output. (more…)


Anatomy of a Normal Probability Plot

April 7th, 2014 by

Stage 2A normal probability plot is extremely useful for testing normality assumptions.  It’s more precise than a histogram, which can’t pick up subtle deviations, and doesn’t suffer from too much or too little power, as do tests of normality.

There are two versions of normal probability plots: Q-Q and P-P.  I’ll start with the Q-Q.   (more…)


Member Training: Multicollinearity

March 1st, 2014 by

Multicollinearity isn’t an assumption of regression models; it’s a data issue.

And while it can be seriously problematic, more often it’s just a nuisance.

In this webinar, we’ll discuss:

  • What multicollinearity is and isn’t
  • What it does to your model and estimates
  • How to detect it
  • What to do about it, depending on how serious it is

Note: This training is an exclusive benefit to members of the Statistically Speaking Membership Program and part of the Stat’s Amore Trainings Series. Each Stat’s Amore Training is approximately 90 minutes long.

(more…)


Five Common Relationships Among Three Variables in a Statistical Model

February 7th, 2014 by

In a statistical model–any statistical model–there is generally one way that a predictor X and a response Y can relate:Stage 2

This relationship can take on different forms, of course, like a line or a curve, but there’s really only one relationship here to measure.

Usually the point is to model the predictive ability, the effect, of X on Y.

In other words, there is a clear response variable*, although not necessarily a causal relationship. We could have switched the direction of the arrow to indicate that Y predicts X or used a two-headed arrow to show a correlation, with no direction, but that’s a whole other story.

For our purposes, Y is the response variable and X the predictor.

But a third variable–another predictor–can relate to X and Y in a number of different ways.  How this predictor relates to X and Y changes how we interpret the relationship between X and Y. (more…)


Member Training: Discrete Time Event History Analysis

February 1st, 2014 by

What is the relationship between predictors and whether and when an event will occur?

This is what event history (a.k.a., survival) analysis tests.

There are many flavors of Event History Analysis, though, depending on how time is measured, whether events can repeat, etc.

In this webinar, we discussed many of the issues involved in measuring time, including censoring, and introduce one specific type of event history model: the logistic model for discrete time events.


Note: This training is an exclusive benefit to members of the Statistically Speaking Membership Program and part of the Stat’s Amore Trainings Series. Each Stat’s Amore Training is approximately 90 minutes long.

(more…)


Member Training: Interactions in ANOVA and Regression Models, Part 2

January 1st, 2014 by

In this follow-up to December’s webinar, we’ll finish up our discussion of interactions.

There is something about interactions that is incredibly confusing.

An interaction between two predictor variables means that one predictor variable affects a  third variable differently at different values of the other predictor.

How you understand that interaction depends on many things, including:

  • Whether one, or both, of the predictor variables is categorical or numerical
  • How each of those variables is coded (specifically, whether each categorical variable is dummy or effect coded and whether numerical variables are centered)
  • Whether it’s a two-way or three-way interaction
  • Whether there is a directionality to the interaction (moderation) or not

Sometimes you need to get pretty sophisticated in your coding, in the output you ask for, and in writing out regression equations.

In this webinar, we’ll examine how to put together and break apart output to understand what your interaction is telling you.


Note: This training is an exclusive benefit to members of the Statistically Speaking Membership Program and part of the Stat’s Amore Trainings Series. Each Stat’s Amore Training is approximately 90 minutes long.

Not a Member? Join!

About the Instructor

Karen Grace-Martin helps statistics practitioners gain an intuitive understanding of how statistics is applied to real data in research studies.

She has guided and trained researchers through their statistical analysis for over 15 years as a statistical consultant at Cornell University and through The Analysis Factor. She has master’s degrees in both applied statistics and social psychology and is an expert in SPSS and SAS.

Not a Member Yet?
It’s never too early to set yourself up for successful analysis with support and training from expert statisticians.

Just head over and sign up for Statistically Speaking.

You'll get access to this training webinar, 130+ other stats trainings, a pathway to work through the trainings that you need — plus the expert guidance you need to build statistical skill with live Q&A sessions and an ask-a-mentor forum.