I am having a difficult time modeling a binary dataset that my team has gathered. It is a dataset of different prescriptions per each patient and an outcome of a certain event.

The goal is to determine if certain prescriptions lead to decreased rates of alerts.

However, there are two main problems:

The data is fully binary without much overlap or interaction between columns

There is a lot of noise in the data, and the chance of the alert occurring is due to chance.

I have run the data through different algorithms to determine any trends or factors that may be important, but have not been very successful.

I was thinking of doing a simple bayesian analysis to determine if each setting has an impact on the outcome of the alerts, but would love to be able to involve many features if possible.

Is there anything I am missing or that I could try to determine the influence of different treatments?

More Christian Randolph Lynn's questions See All
Similar questions and discussions