I am in a small disagreement with my statistician and would appreciate some additional perspective. This statistical question is regarding a cohort of participants with cancer. I am interested in investigating whether participants who have changed a behaviour illustrate different sociodemographic/treatment associations, compared to those participants who have not illustrated a behaviour change. To do this, I have created a datapoint from two survey questions I collected.
>Thus, for Data point 3, if a difference does not exist (i.e. 'no'), then a participant has consistently reported 'no' across both Data point 1 and 2, or alternatively, 'yes' across both Data point 1 and 2. If a difference does exist ('yes'), then the participant has at least 1 'yes' and 1 'no', across Data point 1 and 2, regardless of order.
I have created this (data point 3) as I specifically wish to compare these groups (participants who illustrate behaviour change vs. those that do not) against sociodemographic and treatment variables (e.g. Age =65). I believe these populations may differ based on other literature.
My statistician believes that this requires a McNemar test and that this is paired data. I disagree as I feel I am not comparing paired data, i.e. not directly comparing how many men say yes vs no between 2 timepoints.
Instead, my belief is that I have created a new data point (datapoint 3) from the original paired data (data point 1 and 2). Data point 3 is itself a new item which stands alone, and is thus not paired. Ergo, in this instance, when I compare in a 2x2 (e.g. behaviour change status vs age 65) a chi-square test is fine.
Your thoughts? Thank you in advance for reading, and any advice/perspective. Apologies for the length of my query.