Survival data models provide interpretation of data representing the time until an event occurs. In many situations, the event is death, but it can also represent the time to other bad events such as cancer relapse or failure of a medical device. It can also be used to denote time to positive events such as pregnancy. Often patients are lost to follow-up prior to death, but you can still use the information about them while they were in your study to better estimate the survival probability over time.
This is done using the Kaplan-Meier curve, an approach developed by (more…)
By Manolo Romero Escobar
If you already know the principles of general linear modeling (GLM) you are on the right path to understand Structural Equation Modeling (SEM).
As you could see from my previous post, SEM offers the flexibility of adding paths between predictors in a way that would take you several GLM models and still leave you with unanswered questions.
It also helps you use latent variables (as you will see in future posts).
GLM is just one of the pieces of the puzzle to fit SEM to your data. You also need to have an understanding of:
(more…)
By Manolo Romero Escobar
What is a latent variable?
“The many, as we say, are seen but not known, and the ideas are known but not seen” (Plato, The Republic)
My favourite image to explain the relationship between latent and observed variables comes from the “Myth of the Cave” from Plato’s The Republic. In this myth a group of people are constrained to face a wall. The only things they see are shadows of objects that pass in front of a fire (more…)
In many fields, the only way to measure a construct of interest is to have someone produce ratings:
- radiologists’ ratings of disease presence or absence on an X-ray
- researchers rate the amount of bullying occurring in an observed classroom
- coders sort qualitative responses into different response categories
It’s well established in research that multiple raters need to rate the same stimuli to ensure ratings are accurate. There are a number of ways to measure the agreement among raters using measures of reliability. These differ depending on a host of details, including: the number of raters; whether ratings are nominal, ordinal, or numerical; and whether one rating can be considered a “Gold Standard.”
In this webinar, we will discuss these and other issues in measures of inter and intra rater reliability, the many variations of the Kappa statistic, and Intraclass correlations.
Note: This training is an exclusive benefit to members of the Statistically Speaking Membership Program and part of the Stat’s Amore Trainings Series. Each Stat’s Amore Training is approximately 90 minutes long.
About the Instructor
Audrey Schnell is a statistical consultant and trainer at The Analysis Factor.
Audrey first realized her love for research and, in particular, data analysis in a career move from clinical psychology to research in dementia. As the field of genetic epidemiology and statistical genetics blossomed, Audrey moved into this emerging field and analyzed data on a wide variety of common diseases believed to have a strong genetic component including hypertension, diabetes and psychiatric disorders. She helped develop software to analyze genetic data and taught classes in the US and Europe.
Audrey has worked for Case Western Reserve University, Cedars-Sinai, University of California at San Francisco and Johns Hopkins. Audrey has a Master’s Degree in Clinical Psychology and a Ph.D. in Epidemiology and Biostatistics.
Not a Member Yet?
It’s never too early to set yourself up for successful analysis with support and training from expert statisticians.
Just head over and sign up for Statistically Speaking.
You'll get access to this training webinar, 130+ other stats trainings, a pathway to work through the trainings that you need — plus the expert guidance you need to build statistical skill with live Q&A sessions and an ask-a-mentor forum.
This webinar, presented by Yasamin Miller, will cover broadly survey design and planning.
It will outline the advantages and disadvantages of the various data collection modes, types of samples available to target your population, how to obtain a representative sample, and how to avoid the pitfalls of bad questionnaire design.
Note: This training is an exclusive benefit to members of the Statistically Speaking Membership Program and part of the Stat’s Amore Trainings Series. Each Stat’s Amore Training is approximately 90 minutes long.
(more…)
by Manolo Romero Escobar
General Linear Model (GLM) is a tool used to understand and analyse linear relationships among variables. It is an umbrella term for many techniques that are taught in most statistics courses: ANOVA, multiple regression, etc.
In its simplest form it describes the relationship between two variables, “y” (dependent variable, outcome, and so on) and “x” (independent variable, predictor, etc). These variables could be both categorical (how many?), both continuous (how much?) or one of each.
Moreover, there can be more than one variable on each side of the relationship. One convention is to use capital letters to refer to multiple variables. Thus Y would mean multiple dependent variables and X would mean multiple independent variables. The most known equation that represents a GLM is: (more…)