What data are to be removed from a quantitative analysis regarding noise perception?

More Ifigenia Aslanidou's questions See All

How can I calculate the standard error of measurement (SEM) for crossed data that were repeated once?

I have acquired the responses of 10 persons that rated 15 items twice. All persons rated all 15 items, so responses are crossed in raters and items. I have calculated the test-retest reliability...

26 November 2021 2,119 4 View

How do I interpret that the negative ratings have a better p-value than the positive ones, although the combination of them has the highest p-value?

Hello everyone, I am relatively new to statistics and would need some guidance. So I did a linear regression of some ratings and out of curiosity I wanted to compare the results of my model of...

11 September 2021 5,331 27 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Why does my protein refolded to beta sheet during thermal denaturation analysis?

Hi! So i attempted to understand a novel protein behavior towards heat application by analyzing its secondary structure change. I subjected the protein to a thermal denaturation analysis using...

06 August 2024 1,989 3 View

Blaine Tomkins

You should always decide on which method you're going to use to detect and/or remove outliers prior to analysis and there should always be theoretical justification for removing participants/data. Inconsistent responses is not sufficient to remove participant data unless you included some kind of manipulation check in the experiment that a participant failed.

On another note, the coefficient of determination seems remarkably high. I wonder if the model is overfit to the data?

David Morse

Hello Ifigenia,

There's no assurance that people who elect extreme option responses are wrong, biased, malicious, or inattentive to the task: only that they appear--relative to others in your sample--to be different. As well, the tendency to avoid extreme responses is a bias that has been long recognized in both survey and scaling literature.

I agree with Blaine Tomkins that criteria for declaring a case as unsuitable for inclusion in analysis should have been declared a priori. Doing so and subsequently checking to see that model fit is improved sounds like "cherry picking," and in all likelihood would yield an overly optimistic estimate of model-data fit or S-N ratio (in other words, overfitting).

Good luck with your work.

Ifigenia Aslanidou

Thank you very much for both answers! David Morse Blaine Tomkins

I forgot to mention that the rsquare of 0,9734 corresponds to the normalized mean one (using mean evaluation of each sound as input). The normalized rsquare was found to be 0,7107.

Martin Schmettow

First, to repeat what Blaine Tomkins said, because it is very important: Do never remove outliers to "improve" your model! This can be considered fraud.

If you must identify outliers, never use boxplots! They only work for symmetric distribution. Rating scales normally produce asymmetric distribution (such as all other measures, strictly).

Your outliers could be extreme responses of extreme responders. If you have repeated measures, you can estimate a multi-level. The participant-level intercepts correct for response style (to some extent). The residuals of the model can be used for outlier detection (with care!).

Something else you can do, is estimate the same model with and without outliers, to determine how much influence these points have. That's called influential points or leverage analysis and is fully legit if you report it. It is a way to show that the overall trend is not just an artifact, created by a few extreme observations.

Martin Schmettow Thanks for taking the time to reply!

I didn't quite understand the last paragraph ''If you have.. repsonse style.''. Could you clarify that for me?

I describe this in my book:

https://schmettow.github.io/New_Stats/glm.html#rating-scales