I have trained a machine learning model to predict the outcome of a therapy based on some pretreatment information. Outcome is based on a questionnaire that is administered both before (pre) the treatment started and after (post) it ended.
As seen in other similar paper and challenges, I used as outcome of the machine learning model the post-pre change score. It should be also of primary importance to clinicians. Then I reconstruct the post score = observed pre score + predicted post-pre change score.
If I calculate the (nested cross-validated, outer loop) r-squared with the classical formula (not the square of the correlation coefficient) I obtain r-squared post score < r-squared post-pre change score. Nothing strange as to calculate them, the numerator is the same but the denominator is different, with the variance of the post score smaller than the variance of the post-pre change score.
However, I’m quite confused about which of the two, either the r-squared post score or r-squared post-pre change score, is the most useful metric to judge how good my model would be in practice.
Thank you,
Massi