I have data on number of counselling sessions attended by music students in higher education (counts) for a period of 15 years. I also have data on their gender (male/female), level (undergraduate/postgraduate), main instrument (strings/keyboard/wind, brass, percussion/singers) and nationality (UK/EU/Overseas). I don't have a specific hypothesis (as this data was already there), so I want to explore whether the number of sessions attended is predicted by any of the other variables. Do I look at all of them at the same time as part of a regression analysis and report just the significant ones?

I already know that I can either use multiple regression analysis (as I have counts as dependent variable), or I can also use logistic regression analysis once I apply a median split to the dependent variable so that instead of counts I have a binary variable of "low number of sessions" and "high number of sessions".

However, I'm not sure whether to look at all independent variables together, or separately. I don't have any hypothesis or model that I want to promote. I'm just exploring the data (although I am interested in whether the undergraduate/postgraduate variable might predict the number of sessions attended...)

Alternatively, as I am only using SPSS, I could perhaps investigate the association between counts (although I'd need to transform them into a categorical variable) and each of the independent variables through chi-square tests for association?

Thanks so much!

More Raluca Matei's questions See All
Similar questions and discussions