My goal is to predict which depressed patients respond well to a specific treatment. Let's assume I have 2 groups, one has received the treatment ('active group'), the other not ('control group'). It's relatively easy to build a machine learning model which can predict symptom change over time, but how do I identify the factors which predict a positive response to the treatment? Just looking at symptom improvement is not enough, as depressed patients might improve over time without treatment. My guess is that I have to look into interaction features. Any ideas or suggested readings specifically for applying machine learning algorithms to experimental studies would be highly welcome.