guest contributer

Structural Equation Modeling: What is a Latent Variable?

March 7th, 2016 by guest contributer

By Manolo Romero Escobar

What is a latent variable?

“The many, as we say, are seen but not known, and the ideas are known but not seen” (Plato, The Republic)

My favourite image to explain the relationship between latent and observed variables comes from the “Myth of the Cave” from Plato’s The Republic. In this myth a group of people are constrained to face a wall. The only things they see are shadows of objects that pass in front of a fire (more…)

6 comments

Member Training: Inter-Rater Reliability

March 1st, 2016 by guest contributer

In many fields, the only way to measure a construct of interest is to have someone produce ratings:

radiologists’ ratings of disease presence or absence on an X-ray
researchers rate the amount of bullying occurring in an observed classroom
coders sort qualitative responses into different response categories

It’s well established in research that multiple raters need to rate the same stimuli to ensure ratings are accurate. There are a number of ways to measure the agreement among raters using measures of reliability. These differ depending on a host of details, including: the number of raters; whether ratings are nominal, ordinal, or numerical; and whether one rating can be considered a “Gold Standard.”

In this webinar, we will discuss these and other issues in measures of inter and intra rater reliability, the many variations of the Kappa statistic, and Intraclass correlations.

Note: This training is an exclusive benefit to members of the Statistically Speaking Membership Program and part of the Stat’s Amore Trainings Series. Each Stat’s Amore Training is approximately 90 minutes long.

Not a Member? Join!

About the Instructor

Audrey Schnell is a statistical consultant and trainer at The Analysis Factor.

Audrey first realized her love for research and, in particular, data analysis in a career move from clinical psychology to research in dementia. As the field of genetic epidemiology and statistical genetics blossomed, Audrey moved into this emerging field and analyzed data on a wide variety of common diseases believed to have a strong genetic component including hypertension, diabetes and psychiatric disorders. She helped develop software to analyze genetic data and taught classes in the US and Europe.

Audrey has worked for Case Western Reserve University, Cedars-Sinai, University of California at San Francisco and Johns Hopkins. Audrey has a Master’s Degree in Clinical Psychology and a Ph.D. in Epidemiology and Biostatistics.

Not a Member Yet?

It’s never too early to set yourself up for successful analysis with support and training from expert statisticians.

Just head over and sign up for Statistically Speaking.

You'll get access to this training webinar, 130+ other stats trainings, a pathway to work through the trainings that you need — plus the expert guidance you need to build statistical skill with live Q&A sessions and an ask-a-mentor forum.

No comments yet

Member Training: (Just About) Everything You Need To Know Before Starting a Survey

February 1st, 2016 by guest contributer

This webinar, presented by Yasamin Miller, will cover broadly survey design and planning.

It will outline the advantages and disadvantages of the various data collection modes, types of samples available to target your population, how to obtain a representative sample, and how to avoid the pitfalls of bad questionnaire design.

(more…)

No comments yet

Why do I need to have knowledge of multiple regression to understand SEM?

January 27th, 2016 by guest contributer

by Manolo Romero Escobar

General Linear Model (GLM) is a tool used to understand and analyse linear relationships among variables. It is an umbrella term for many techniques that are taught in most statistics courses: ANOVA, multiple regression, etc.

In its simplest form it describes the relationship between two variables, “y” (dependent variable, outcome, and so on) and “x” (independent variable, predictor, etc). These variables could be both categorical (how many?), both continuous (how much?) or one of each.

Moreover, there can be more than one variable on each side of the relationship. One convention is to use capital letters to refer to multiple variables. Thus Y would mean multiple dependent variables and X would mean multiple independent variables. The most known equation that represents a GLM is: (more…)

2 comments

Member Training: A Gentle Introduction to Propensity Score Adjustments and Analysis

December 1st, 2015 by guest contributer

So you can’t randomize people into THAT condition? Now what?

Let’s say you’re investigating the impact of smoking on social outcomes like depression, poverty, or quality of life. Your IRB, with good reason, won’t allow random assignment of smoking status to your participants.

But how can you begin to overcome the self selected nature of smoking among the study participants? What if self-selection is driving differences in outcomes? Well, one way is to use propensity score matching and analysis as a framework for your investigation.

The propensity score is the probability of group assignment conditional on observed baseline characteristics. In this way, the propensity score is a balancing score: conditional on the propensity score, the distribution of observed baseline covariates will be similar between treated and untreated subjects.

In this webinar, we’ll describe broadly what this method is and discuss different matching methods that can be used to create balanced samples of “treated” and “non-treated” participants. Finally, we’ll discuss some specific software resources that can be found to perform these analyses.

(more…)

No comments yet

Member Training: Mixture Models in Longitudinal Data Analysis

November 2nd, 2015 by guest contributer

This webinar will present the steps to apply a type of latent class analysis on longitudinal data commonly known as growth mixture model (GMM). This family of models is a natural extension of the latent variable model. GMM combines longitudinal data analysis and Latent Class Analysis to extract the probabilities of each case to belong to latent trajectories with different model parameters. A brief (not exhaustive) list of steps to prepare, analyze and interpret GMM will be presented. A published case will be described to exemplify an application of GMM and its complexity.

Finally, an alternative approach to GMM will be presented where the longitudinal model approach is linear mixed effects (also known as hierarchical linear model or multilevel modeling). The idea is the same as in GMM using growth curve modeling, mainly that the latent class membership specifies specific unobserved trajectories. These models are equivalent to GMM and are sometimes referred to heterogeneous linear mixed effects, underlining the idea that the sample may not belong to one single homogeneous population, but potentially to a mixture of distributions.

(more…)

No comments yet