When performing non-linear correlation, I have been using AIC to perform a preliminary selection of what models could be a potential good fit for my correlations.
I was toying around with the idea of statistically removing outliers from the data based on robust non-linear regression (been using graphpad prism for this) for each model, independently. Essentially, a point could be an outlier in model X but not model Y so it would only be removed for model X, creating a new model (Xo)
However, once you remove the outlier and refit the data, the value of AIC is going to change both because the model "fits better" but also because the actual correlation data changed, making it impossible to distinguish both components and preventing comparison of the AIC of data where no outliers where removed.
Is there some sort of mathematical way that I could determine whether model X, Y or Xo is the best fit for my data?
Essentially, some of my data might be correlated with an unknown model, and some might not. Some of the data might have outliers that would disqualify one model, but I cannot be sure of that.
I know people are just going to say to plot the data and see, but I am trying to do a lot of correlations (over thousands) and I cannot plot every single graph, so having some sort of value that I could use as a first selection criteria prior to checking the residuals would be much appreciated!
Edit:
I have included a very crude drawing of what I am trying to understand. Model 1 works best if not outliers are removed, but if the one outlier is removed, the best fitting model changes completely (model2). How to show which of these 2 choices is the best fit?