I would like to compare the performances of 9 algorithms (those implemented into the biomod2 package of R) with two different databases. More in detail, I'm using a national forest inventory (approximately 7000 points) that I group in presence/absence using two methods:

1) Selection of the forest category (e.g. presence if = 'Beech', absence if not = 'Beech'

2) Selection of the points where the relative contribution of the species, in terms of basal area (BA), is higher than 1% (i.e. the point is a presence if = BA of Beech >= 1%, absence if less)

What I would like to do, repeating this procedure for three testing species, is to asses if and which method is superior (1 or 2), for which studied species and, in case, which algorithm is more suitable for the situation (e.g. RF with method 1 and species 1, ANN with method 2 and species 3 and so on..). The index of goodness of fit is TSS (True Skill Statistic), not AUC, calculated using a cross validation procedure with a random 75%-25% partition and repeated 50 times for each algorithm.

Any suggestions (possibly in R)? ANOVA? Linear Mixed Models? 

Many thanks, maurizio

Similar questions and discussions