When deleting trials beyond 3SDs of mean RT in within-subject designed experiments, are the calculation of mean and SD based on the whole sample, or on the single participant, or on the conditions, or even on every condition of every participant? Is there any principles?