Missing data is a common problem in data analysis. One of the successful approaches is k-Nearest Neighbor (kNN), a simple approach that leverages known information to impute unknown values with a relatively high degree of accuracy. (more…)
Missing data is a common problem in data analysis. One of the successful approaches is k-Nearest Neighbor (kNN), a simple approach that leverages known information to impute unknown values with a relatively high degree of accuracy. (more…)
As it has been said a picture is worth a thousand words and so it is with graphics too. A well constructed graph can summarize information collected from tens to hundreds or even thousands of data points. But not every graph has the same power to convey complex information clearly. (more…)
One component often overlooked in the ‘Define & Design’ phase of a study, is writing the analysis plan. The statistical analysis plan integrates a lot of information about the study including the research question, study design, variables and data used, and the type of statistical analysis that will be conducted.
Statistical contrasts are a tool for testing specific hypotheses and model effects, particularly comparing specific group means.
What are goodness of fit statistics? Is the definition the same for all types of statistical model? Do we run the same tests for all types of statistic model?
Before you can write a data analysis plan, you have to choose the best statistical test or model. You have to integrate a lot of information about your research question, your design, your variables, and the data itself.