I am using a huge dataset to access the relationship of socio-economic and bio-geophysical factors with deforestation in a tropical forest. Most of the variables are non normally distributed. I was wondering if one can give some suggestion on the analysis that are currently used for this purpose.