I have a data set as shown in the image attached. the data set is about peak expiratory flow rate (PEFR) of five different locations (Johnsganj, ashoka, katra, alopi & rambagh)in the city with varying level of pollution. These locations are set as dummy variable. PEFR at each locations are surveyed by using peak flow meter. Individuals related health parameters like BP, height, weight etc are recorded.

My questions are:

1. Is this the right arrangement of data for cross - sectional data analysis?

2. if not, what should be the arrangement of data?

3. what are the preliminary steps needed to make the data ready for cross sectional data analysis?

4. How to fit the most optimal regression model for this data set?

More Sugandh Kumar Choudhary's questions See All
Similar questions and discussions