05 November 2019 7 5K Report

I am currently building multi-variable linear regression model for water consumption prediction.

Y is monthly household water consumption. I have demographics data for each household for my predictor variable Xi. They are:

  • individual age ,
  • gender,
  • race,
  • marital status,
  • building age,
  • No. of rooms.

I have separated the total Number of people into three categories:

  • Number of adult,
  • Number of children,
  • Number of elderly.

then I build the model:

consumption of household∼Number of adult+Number of children+Number of elderly+Number of female+Number of unmarried+building age+No. of rooms (ignored the coefficients).

My confusion is that the Number of adult + Number of elderly + Number of children = Total Number of People. Should I include Number of female and Number of unmarried into the model also? but if not including them, the model will lose those information right? Please help me to clarify those doubts. Great appreciation in advance!

More Yang Kewen's questions See All
Similar questions and discussions