The Analysis Factor Statwise Newsletter
Volume 1, Issue 1
July, 2008
In This Issue

A Note from Karen

Featured Article:
The 3 Stages of Mastering Statistical Analysis

Resource of the Month

What's New

About Us

 
Quick Links

Our Website

More About Us

You received this email because you subscribed to The Analysis Factor's list community. To change your subscription, see the link at end of this email. If your email is having trouble with the format, click here for a web version.

Please forward this to anyone you know who might benefit. If you received this from a friend, sign up for this ezine now!
A Note from Karen

Karen Grace-MartinDear %$firstname$%,

Wow, what a month. There I was, raring to go with fall workshops and lots of new projects, when WHAM! Just as the kids got back to school, I got the flu. Swine flu? Probably. They're not testing anymore. It really knocked me flat. I've been slowly recovering from that one, and just got a cold last week. But after sleeping for much of the weekend, I think I'm finally coming ot of it.

So, first, I appreciate the patience of everyone who participated in the Assumptions of the GLM workshop, which occurred while I still had the flu. I managed not to postpone it, but I certainly wasn't my usual energetic self. Already some of the participants have signed up for the upcoming workshop, so I'm taking that to mean it was still useful. The other consequence was I realized I needed to slow...down...a...bit. I moved back the second workshop of the semester: Running Regressions and ANOVAs in SPSS GLM.

It was supposed to start today, but instead will start on November 3rd. This is an important workshop, and I really wanted to make sure I gave it the time it needed to be really useful--it's a skill that anyone who uses SPSS will need over and over again. SPSS is on the surface easy to use, but there are a lot of options and defaults that can affect your analysis that the manuals do NOT explain well. Join us in this workshop if you use SPSS and I promise running your linear models will be much easier.

Happy analyzing,
Karen

Featured Article: 5 Steps for Calculating Sample Size

Like any applied skill, mastering statistical analysis requires:

1. building a body of knowledge

2. adeptness of the tools of the trade (aka software package)

3. practice applying the knowledge and using the tools in a realistic, meaningful context.

If you think of other high-level skills you've mastered in your life--teaching, survey design, programming, sailing, landscaping, anything--you'll realize the same three requrements apply.

These three requirements need to be developed over time--over many years to attain mastery. And they need to be developed together. Having more background knowledge improves understanding of how the tools work, and helps the practice go better. Likewise, practice in a real context (not perfect textbook examples) makes the knowledge make more sense, and improves skills with the tools.

I don't know if this is true of other applied skills, but from what I've seen over many years of working with researchers as they master statistical analysis, the journey seems to have 3 stages. Within each stage, developing all 3 requirements--knowledge, tools, and experience--to a level of mastery sets you up well for the next stage.

Knowing what stage you're in can help you figure out where to put your energy, time, and resources to progress forward.

Stage 1. Mastering the Basic Approach

At this stage, knowledge-building focuses on the basic concepts and vocabulary--hypothesis tests and sampling--up through basic multiple regression with continuous predictors and simple factorial ANOVAs. It usually requires 2-3 statistics classes to master the knowlege in this stage.

Mastery of software usually includes a good working ability to enter and manipulate data, and run descriptive and inferential statistics, as listed above. At this stage, most researchers use a menu-based software program, like SPSS, Minitab, or JMP, but could include software with steeper learning curves, like SAS, Stata, or R.

To master this basic level, a researcher needs experience with running the data analysis for a few research projects--an honor's or master's thesis is usually the first, and many dissertations give a really solid foundation at this level.

Stage 2: Mastering Linear Models

Exactly what this stage entails will depend on your field and the specific type of research you do. But usually the focus is on statistical modeling. The beauty of statistical models (they are beautiful, no?) is they all have the same core structure. There is always a response variable, a set of predictors, an estimate of the nature of their relationship, and a residual. The details vary, but if you can master one basic type of modeling, any other is a step or two away.

So wheras the first stage took you up through basic regression and ANOVA, this stage is about mastering the entirety of linear modeling. Topics will include dummy variables, interactions, polynomial effects, random effects, model building, model fit, etc. To truly master this stage means a thorough understanding of how ANOVA and regression fit together into the General Linear Model, and to be able to fluently move from one to the other.

It will also include other methods that are used in your field. These could include structural equation modeling, multivariate techniques, survival analysis, or complex survey techniqes, among others.

In software, the same programs I mentioned in stage one work well here. But they need to be approached with a higher level of skill--SPSS users should use syntax as well as menus. The methods used in the software will, of course, be more sophisticated, and you should have not just a working knowledge, but real understanding of the program's defaults, vocabulary, and what each bit of output means.

In this stage, for a number of reasons I've written about, I often recommend that you master one, and become conversant in a second statistical software package. You want another option there in your back pocket when you need it (and you will need it).

Most researchers move well into this stage with their dissertation. While they learn much, most don't master it with that single project. To really master linear modeling requires experience with different data sets, models, and research questions, and it can take years to gain experience with a variety of models.

It's not uncommon for even seasoned researchers with strong quantitative skills to have knowledge gaps in this area. It's hard not to unless you've worked on many dozens of models.

Even 10 years ago, most researchers could stop here. But with the enormous capacity of computing power has come the availability of increasingly sophisticated statistical techniqes. These techniques account for issues that we previously had to gloss over with the GLM. Because sophisticated techniques now have widespread availability, journal editors and grant issuers no longer let you get away with glossing anything over.

Stage 3: Beyond Linear Models

The knowlege base in stage 3 includes truly sophisticated statistical methodology, such as generalized linear models for categorical and discrete response variables, multilevel models, generalized linear mixed models, modern techniques for missing data, robust regression models, nonlinear models, among many, many others.

I've said it before and I'll say it again--do everything you can to master linear models before moving on to these techniques. Many are extensions of the general linear model, so if you're still struggling with interpreting interactions in a linear model, it will be doubly hard to interpret interactions that involve odds ratios.

At this point your old faithful software package may fail you. No statistical software package can do everything, and this is why you want an extra one in your repertoire. Stata, R (or SPlus) and SAS are all quite comprehensive, and SPSS is one step behind. It has made impressive inroads into high-level techniques in recent years, but still cannot do all that the others do. JMP and Minitab just aren't contenders at this level.

The other thing to remember as you emerge into this stage is you can't master all of it. No one can. Things branch out widely at this point, and you just can't learn all of it. But you don't need to either. You may need to master two or three, but hopefully not all at once. And if you can confidently implement linear models, you are in an excellent position to take on any of its extensions.

Resource of the Month

Interpreting Regression Coefficients Ebook. A thorough, intuitive understanding of regression output at your fingertips, whenever you need it. This ebook contains the transcript, slides, examples, and instructions for interpreting coefficients of intercepts, correlated, centered, and polynomial predictors; standardized coefficients, dummy- and effect-coded variables, and interactions between two continuous predictors or between a continuous covariate and a binary or multicategory dummy variable. If you need to understand regression coefficients better, check it out.

What's New

1. Free Webinar at 1pm on October 28th: Running Repeated Measures as a Mixed Model.

There are two ways to run repeated measures data--a multivariate approach in which each repeated response is a unique variable, and a univariate, mixed-model approach in which all responses are a single variable, and the effects of the respeated measurements are directly estimated. This webinar will give you an overview of both approaches, with examples.

2. Online workshop at 12pm on November 3, 10 & 17: Learn how to run regressions and ANOVAs in SPSS GLM accurately and efficiently

The beauty of SPSS GLM is its versatility--it can run any number of linear models. But the dark side of versatility is dozens of options that may or may not apply to your analysis. This workshop will clear all that up. We will work through two examples, and as we go, we will discuss the relevancy, meaning, and how to use each of the buttons and options in SPSS GLM, both in the menus and in syntax.

About Us

What is The Analysis Factor? The Analysis Factor is the difference between knowing about statistics and knowing how to use statistics. It acknowledges that statistical analysis is an applied skill. It requires learning how to use statistical tools within the context of a researcher’s own data, and supports that learning.

The Analysis Factor, the organization, offers statistical consulting, projects, resources, and learning programs that empower social science researchers to become confident, able, and skilled statistical practitioners. Our aim is to make your journey acquiring the applied skills of statistical analysis easier and more pleasant.

Karen Grace-Martin, the founder, spent seven years as a statistical consultant at Cornell University. While there, she learned that being a great statistical advisor is not only about having excellent statistical skills, but about understanding the pressures and issues researchers face, about fabulous customer service, and about communicating technical ideas at a level each client understands. 

You can learn more about Karen Grace-Martin and The Analysis Factor at analysisfactor.com.

Please forward this newsletter to colleagues who you think would find it useful. Your recommendation is how we grow.

If you received this email from a friend or colleague, click here to subscribe to this newsletter.

Need to change your email address? See below for details.

No longer wish to receive this newsletter? See below to cancel.