I have two datasets in two files: a. survey data from wave 1 in 2010 2 b. survey data from wave 2 in 2015. The survey questions are unchanged. The two samples are not matched though are from the same population. All answers are in 'yes/no' format. The samples are fairly large (over 10k per wave with a participation rate of over 75% for each wave).
Some survey questions ask about whether participants provide certain evidence based practices, for example CBT.
My questions are:
a. I would like to test whether the proportion of participants providing CBT increased significantly from wave 1 to wave 2 or if there is no difference.
b. Assuming there is a difference, I would also like to see if there are other surveyed factors associated with increase in provision of evidence based practice, for example is there a significantly larger increase in proportion of participants providing CBT at public sites vs private sites.
I use SPSS ver 23.
1. How would I merge the two data files correctly without matching case identifiers?
2. What test can I use on the combined data to answer each of the questions above?