10 October 2019 6 9K Report

Hi everyone,

I have some issues to determine how to pick correctly the right distance calculation method in R.

I have two datasets : one containing habitat variables (worldclim data, so including negative values) and another one containing values for several phenotypic traits. For each dataset I would like to create a distance matrix in order to perform some Mantel tests between the two matrices (phenotype vs habitat). However, there are so many distance methods that I am a bit lost on which one should be picked. In addition I have some zeros in the habitat distance matrix (replicates which have the same abiotic data) which might restrict the use of some methods.

For phenotype information I created a distance matrix in R using several methods : bray-Curtis, Euclidean, Manhattan, Canberra, Minkowski, maximum --> it doesn't change the conclusion of the Mantel test whatever matrix I use. However, for the habitat matrix results vary. Since there are negative values I guess it is an issue for the Bray-Curtis when not transforming the data (R gives me a warning about misleading calculated values). From what I read, Euclidean distance seems to be used quite a lot for abiotic data. Is there any other method which would be more appropriate ?

If anyone could help me I would be very grateful !

Thanks,

Lola

More Lola Toomey's questions See All
Similar questions and discussions