12 March 2017 6 1K Report

I am working on a project that involves a population-based data set, to study the impact of a clinical variable on the mortality of patients with certain cancer. In all similar studies, age, which is normally a continuous variable, was categorized into "age groups" according to each study's methodological reasons. I have also reviewed a lot of studies that show that "discretization" of continuous variables when creating predictive models is better than using them as continuous variables because it decreases noise and thus increases the accuracy of prediction. I was told by one of my senior co-authors that I should redo the analysis after going back and using "age" as a continuous variable. Was anybody confronted with a similar issue in his research? If so, what is the most solid evidence that you've got to defend categorizing age, or any other continuous variable in clinical or epidemiological data? 

More Amr Ebied's questions See All
Similar questions and discussions