Hi all,
I hope anyone can help me and save my life!
I have been trying to gather as much information as possible to decide which analysis method I should use for my collected data, but I can’t make a decision.
In particular, I am struggling to decide whether I should see my data as non-normal or normal, then I can decided which statistical method I can adopt.
For my thesis, I am currently working on analyses with the collected primary data (n=110) based on a cross-sectional, non-probability and self-reported survey for a theory-testing purpose.
All constructs are reflective measures and multi-item scales (between 3 to 6 items) across 2 IVs, 2 DVs, 2 moderators. .
1 IV is a higher-order construct (3 dimensions), and another IV is unidimensional (5 items).
All of them are seven points Likert scale that is previously developed and validated in other papers
Positive sides:
After the preliminary analysis, reliability (Cronbach’s a >.7), correlation, and regression analyses (OLS) show a good sign and are mostly in line with the conceptual and theoretical models from the role model papers.
Negative sides:
· The results of EFA (to check the unidimensionality) and CFA analysis (to test the model) were horrible. The EFA results were messy as per image below. Many of them are cross-loaded or loaded on unmeaningful components.
· KMO is >.850, suggesting that a sample size is not an issue?
· Basically, many OLS assumptions are violated because of leptokurtic distribution, heteroscedastic, and non-random sampling.
· Normality tests (e.g. Shapiro–Wilk test ) says all of them are nonnormally distributed. Besides, across all constructs, the Skewness and Kurtosis values are around -2 and +6.5, respectively. However, although I am aware of the threshold line, +- 2 for BOTH skewness and Kurtosis, I can chose to follow the Hair’s (2010) guideline, which suggests that +- 2 for skewness and +- 7 for Kurtosis for data to be considered as “normal”.
· Lastly, Common method bias is checked through Harman's single factor test (34% of total variance)
Based on this information, I have questions.
(1) Which regression model should I use? (OLS, PLS and so on). Please provide me with some justifications of choice (e.g. Sample size).
(2) Then, which statistical tool do you recommend? ( I am currently using SPSS and AMOS, but I can use R, SmartPLS or MPlus if needed.
(3) Should I consider my data as normal or non-normal?
Thank you so much for your help in advance!!
Ted