I submitted an article about a mediated moderation model to an international journal. I carried out the simple slopes graphics to explain the role of moderator in the relationship between X and Y. A referee suggested to add a simple slope test.
If I understand it correctly, you have a continuous response variable Y and a continuous predictor X and a dichotomous variable M. You want to find out whether M affects the regression of Y on X. Is that right? If not, please ignore.
Given that this is so, you first create a binary variable I which is =0 for the one category of M and =1 for the other category of M. You then also create an interaction term IX (I times X). You then fit a (multiple) regression of Y on I, X and IX, say Y=a+bI+cX+d(IX). This means that Y=a+cX for the I=0 category and Y=(a+b)+(c+d)X for the I=1 category. If b and d are both not significant, then one model will suffice for both categories, i.e. M does not affect the relationship between X and Y; if b is significant then you have different intercepts; if d is significant then you have different slopes.
If I understand it correctly, you have a continuous response variable Y and a continuous predictor X and a dichotomous variable M. You want to find out whether M affects the regression of Y on X. Is that right? If not, please ignore.
Given that this is so, you first create a binary variable I which is =0 for the one category of M and =1 for the other category of M. You then also create an interaction term IX (I times X). You then fit a (multiple) regression of Y on I, X and IX, say Y=a+bI+cX+d(IX). This means that Y=a+cX for the I=0 category and Y=(a+b)+(c+d)X for the I=1 category. If b and d are both not significant, then one model will suffice for both categories, i.e. M does not affect the relationship between X and Y; if b is significant then you have different intercepts; if d is significant then you have different slopes.
Another interesting paper: http://mlrv.ua.edu/2013/vol39_1/Robinson%20et%20al..pdf
It is easier than it first appeared. You have to run two independent regressions (one per category in the moderator) and then compute a t-test with the difference between slopes in the numerator and the pooled standard deviation from both groups in the denominator. I have learnt a very interesting point for my own research. Thanks!
Maybe I'm missing something here, but the suggestion Francois Steffens gave you seems to me complete and absolutely correct. What the need for simple slope test when you have one categorical variable (binary, to be worst)?? The interaction term significance is the test you need (I suppose you have test the significance of the interaction term estimate). You can interpret it as a change in slope of the predictor form group 0 to 1. As simple as that.
If you want to give them some formal test (F, df, eta and so on) you could perform a simple hierarchical regression (model comparison or whatever name do you want to gave it). First model, main effect model (so, y=b0 +b1*X + b2*M+error). Second model add the interaction ( y=b0 +b1*X + b2*M+b3*M*X+ error) end test for R^2 increase (performed automatically in canned software like SPSS and similar). You will have an F with 1 degree of freedom and an effect site (difference in R^2 of the two models). I think it is as good as it gets (all other analyses are completely redundant and not that precise).
I presented both delta R^2, that is significant, and the interaction term significance (significant), but referee explicitly requested simple slope tests, that I did not know.
I will follow the suggestion of Olga, because I think that is easier one sample - one model. I will perform this analysis and I will update you about the referee response.
Please check DeShon & Alexander (1996). This is my guess but I believe that the reviewers asked for the simple slope tests because of the heterogeneous errors in the two groups on moderator. This is one situation where the simple slope test has more power than moderated multiple regression (MMR). But, if MMR showed a significant interaction effect, you would get the same results. Maybe you can conduct the analysis to make sure you get the same results and explain why you would get the same results anyway based on this article: With a less powerful test, you get a significant interaction effect.