I am new to statistics and currently trying to conducting data analysis for my pilot study and I am confused about the procedures. Overall, I have around 150 respondents. Attached there is a model that I am trying to analyse.
The affective variable is a latent factor that is presented in 4 dimensions based on the degree of extrinsic and intrinsic influence (28 items). It affects learning phenomenon that influences students' achievements in language learning. The phenomenon might have receptive and productive nature and therefore there were two scales (8 and 6 items respectively) to measure each. Finally, the achievements are represented by two separate lexical tests with two separate overall scores and one variable that is accounted to measure students' aspiration to practice their language (8 items). All items were assessed by respondents by using a 7-point Likert scale, except for the lexical tests which were measured by overall score.
All the instruments I have been using were validated before in a different context. However, since I have been conducting study in a new context, I have decided to conduct EFA to ensure validity and reliability of scales. For affective variable, in order to understand whether each of 4 different dimensions were represented separately, I have conducted EFA. It was successful and after deleting several factors that had cross-loadings on other dimensions, all 4 dimensions were represented separately with minimum of 3 variables per item.
Along with that, I have checked reliability and conducted EFA for the items that were used to measure productive and receptive nature of phenomenon to ensure that they do not have any cross loadings.
During the next step I used AMOS. I have inserted all items for 4 dimensions of affective variable, all items that represented receptive and productive items for measuring the learning phenomenon and one item that was used to measure one language learning outcome. For measurement model, Chi-square was 460.648, df=309 with probability level =.000., the data for model fit was the following CFI=.930, RMSEA=.058 with PCLOSE=.114, however AGFI=.784, GFI=.823 and NFI=.816 were low. All factor loadings of items were above .60. Being new to statistics, I believe that it might be due to a small sample size and a number of variables. So, now I am uncertain about my following actions.
I need to check my hypotheses by checking structural model, but I am hesitating about further procedures. The first option that I have: to get factor scores for all latent factors and then to conduct path analysis. The second option: to use Partial Least squares in PLSmart since it can deal with formative and not only reflective scales and to measure my structural model for all factors at once.
I apologise If I have described wrong procedures but I would really appreciate any suggestions and feedback. Sorry for a long text.