What is the cut-ff level for Z-score?

More Saeid Nosrati's questions See All

Surface adsorption with the help of iron and UV/H2O2 is innovative?

1.The topic I am going to research is "Advanced Gray Wastewater Treatment Output from the Biological Process by Iron-assisted surface adsorption and UV/H2O2". I think this method has been used a...

08 July 2024 9,464 1 View

How validate the dynamic molecualr simulation with experimental data?

I used forcite module of Material Studio software to simulate the dissolution of zinc production waste in deep eutectic solvent. UFF force field has been used for the optimization of molecules,...

02 July 2024 7,494 1 View

dhb jhdcin cdjjncd ?

Fjkvchj

12 December 2023 224 1 View

How is The Volume Expansion Coefficient for Low Solute Concentration Obtained?

Hello dear researchers, Do you know how to solve and prove the formula 5.27 (The Volume Expansion Coefficient for Low Solute Concentration) mentioned in the "diffusion mass transfer book" by...

18 November 2023 302 2 View

Does anyone know about postdoctoral position in psychology in the USA or Canada?

I hope that if you know of a postdoctoral position in the field of psychology, you will introduce me to it

31 October 2023 5,188 0 View

Custom siRNA design?

Hi, I would like to know how I can order a siRNA (5 or 10 nm) for a protein(X) based on the following information that has been published: " siRNA against X was synthesized from the following...

07 July 2023 5,842 1 View

Suggestion for proposal phd?

Hello everyone I am a master's business management from Sapienza University. I want to make a new proposal for applying Ph.D. in field management. I am thinking that traditional training ( teacher...

13 June 2023 8,214 4 View

FE-SEM or TEM?

What analysis, SEM or TEM, is better for MOF identification? Unfortunately, I only have one choice. I have the crystal size average with XRD, and therefore, I think SEM is better. please help me.

25 May 2023 9,391 4 View

How can I get the following article?

Enzymatic modification of starch: A green approach for starc...

30 April 2023 8,258 1 View

How to design a Rigid Frame to test Reinforced Masonry Walls in laboratory?

I've been asked to design and build a rigid solid frame to test our reinforced masonry walls in the laboratory. the frame's span is 8 m and its Heigh should be at least 3.5 m. I know that our...

30 December 2022 5,943 0 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

Should I remove an item from a scale to raise Cronbach's alpha and McDonald's omega or is it better to leave it if they are both over .7 already?

Hello! I have this scale which had 10 items initially. I had to remove items 8 and 10 because they correlated negatively with the scale, and then I removed item 9 because Cronbach's alpha and...

01 August 2024 4,606 7 View

I need a reliable source or an example supported by excel sheet to understand Fuzzy Vikor?

27 July 2024 5,916 1 View

Which file formats are accepted for supplementary material?

I have a dataset consisting of json files. i tried to upload a zip or tar of it but the system tells me that the file format is not accepted... br

25 July 2024 1,316 3 View

Dataset of synchronized cardiac angiography and ECG?

Hello, I'm working on medical project and I would need synchronized angiography with ECG? Does anyone know if some open source dataset of this kind exist? Regards, Bruno

25 July 2024 2,214 2 View

Is a reliability test necessary in my survey on translations?

Dear all, I gave 116 respondents 18 translated sentences and asked them to indicate their levels of acceptance of these translations on a five-point scale. Some translations result from strategies...

24 July 2024 8,245 5 View

How to Select the most suitable machine learning algorithm depending on the characteristics of the given dataset ?

I'm working on a project that involves analyzing a new dataset, and I'm at the stage of selecting the most appropriate machine learning algorithm. The dataset consists of both numerical and...

22 July 2024 6,097 7 View

How to test multivariate outlier in STATA?

Hey all, I need help testing for multivariate outliers using STATA for my master thesis. The literature recommends the Minimum Covariance Determinant (MCD) (Verardi & Dehon, 2010). I found the...

22 July 2024 8,821 2 View

How to use evolutionary algorithms with real parameters in ryu sdn controller with large scale?

Hi, I wanna to implement evolutionary algorithms in ryu sdn controller in mininet, i have some challenges, how i can run the big scale topo with one sdn contoller??? and another question is to...

21 July 2024 246 2 View

Can you suggest reliable procedures to get displacements from accelerations in frequency domain ?

I have identified many solutions. I need suggestion from somebody with application experience of this topic to identify the most reliable and robust procedure.

21 July 2024 3,465 5 View

David Morse

Hello Saeid,

If there was a single such cut score which unambiguously separated genuine cases from outliers (or, "outright liars"), I believe we all would have heard of it!

There isn't one; you simply have to make a judgment call. No matter what you select as a "critical" z-score threshold, do understand that there will inevitably be instances of false positive ("outlier") and false negative ("non-outlier") cases involved. As well, the shape of the distribution matters.

If a distribution is normal in shape, then cases having a z-score with magnitude of +/-2.58 or more would occur less than 1% of the time. A threshold of +/-2.81 would represent a value beyond which no more than 0.5% of cases would fall. Assuming a normal distribution, I don't think too many people would think a z-score threshold of +/- 3 would be too aggressive a choice for flagging cases as outliers.

However, for non-normal distributions, these thresholds would under-estimate the number/proportion of false positives for outlying cases. As well, if your research aim is to sort cases from overlapping distributions (e.g., mixture models), then a different approach should be applied.

Good luck with your work.

Abolfazl Ghoodjani

Hello Saeid Nosrati

If you are looking to identify outlier data, drawing a simple box plot or using the Grubbs' test or the ROUT method can be helpful.

They are also done in most softwares.

Some of them (Grabbs' or Dixon's Q Test) are statistical tests and therefore have a probability value (P value). So you do not have to look for a cut-off.

David L Morgan

I agree that there is no formal cut-off value for outliers. Instead, I would plot the distribution of the Z-scores and look for values that are "detached" from the rest of the distribution (which is literally what "outliers" means).

Annamma Kunjukunju

Here are some guidelines that I use for the Z score.

Sample size 100 Z score ± 3.29

Statistical experts are welcome to comment on this. Thanks

Steven Prevette

Since z-scores go out to infinity, there is no single "cutoff". A lot depends on your risk. Traditionally, two standard deviations (z score of < -2 or > +2) is used, that represents a spread of 95% of the data IF your data are normal. Tchebychev inequality gives you a worst case of 75%( 1 - 1/ z squared). Statistical Process Control uses 3 standard deviations. The "Six Sigma" folks use 6. I once heard a Boeing (aircraft) presentation where they were using 9 for certain aircraft manufacturing tests.

BOTTOM LINE - it all depends upon your risk level and to some extent the distribution of the data (Normal vs ???). There is no "ANSWER" to your question.

Saeid Nosrati

Dear Steven Prevette Annamma Kunjukunju David L Morgan Abolfazl Ghoodjani David Morse

thank you so much for your kind response and for sharing your tactful opinions.

Dear Annamma Kunjukunju would you please give the reference for this cut-off level?

Meisam Alipour

Please take a kook ar the book below page 210 including the diagram:

Ho R (2017) Understanding statistics for the social sciences with IBM SPSS. CRC Press.

Thank you Meisam Alipour

Jochen Wilhelm

"Would you please tell me about the reliable and most used cut-off level among scholars?" -

Real scholars would not propose any general cut-off.

Stefano Nembrini

@jochen i just love to wake up to your comments 😁

Also, you don't say why you would need to identify outliers other than the mere identification itself

Wouldn't you be better off using a method that's robust to outliers?

Removing outliers is usually a road paved with good intentions that is one step away from p-hacking, that's why I'm asking

Babak Jamshidi

The approach you adopt, the sensitivity of the subject, the dispersion of the data, the number of outliers for different cut-offs, ... determines the suitable cut-offs. Sometimes to prevent losing a significant percentage of data, we use wide bands. Sometimes the the distribution tends to normal, narrower bands are preferable.