Hello there, I am conducting listening experiments for the perception of various of noises. The test persons can listen to a reference noise and compare it with 20 other noises using a unipolar likert scale (more annoying (-5) - similarly annoying (0) - less annoying (+5), in total of 11 scales). For the analysis I used box plots in order to find the outliers. The persons who had the highest numbers of outliers where removed from the analysis, raising the coefficient of determination from 0,9692 to 0,9734, which is a good sign. How can I furthermore decide what data are to be removed in order to improve my model? I also checked if the test persons used the whole range of the scales or how long they needed for the test. How can I check if those criteria matter on the consistensy of their answer? Should I remove all the answers of a test person which is overall suspicious or can I remove just the supsicious single ones and replace them?
Thank you in advance, and I am glad to hear your answers!