What is the best method to detect outliers in this data set?

More Volkan Mehmet Cinar's questions See All

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

How to generate a citation of my paper from ResearchGate?

How we can cite the papers from ResearchGate. I am trying to create citations for this article, Quantum Machine Learning Algorithms for Optimization Problems: Theory, Implementation, and...

08 August 2024 6,690 3 View

Does Anyone have expertise in in vitro transcription and RNA pull down assay?

I am currently working on LncRNA; to know the lncRNA-protein interactions I want to do RNA pull down assay, so I need to design primers with T7 promoter. I need assistance in this regard.

07 August 2024 6,622 1 View

How to fix background error in rietveld refinement of one XRD peak using GSAS-II?

I want to refine one XRD peak of my in-situ xrd but the background is never working good which ultimately fails the refinement. How to refine and adjust the background using GSAS-II

05 August 2024 5,291 2 View

How can I add own Henry coefficients in Aspen Plus?

Hi, i would like to simulate an absorption process in Aspen Plus. I want to use the NRTL model und would like to add some individual Henry coefficients. Is that possible and how?

05 August 2024 2,333 2 View

Why might the impedance values for DI water and 0.1X PBS buffer solution exhibit a decreasing and increasing trend, respectively over time (HP 4194A)?

Hello everyone, I'm encountering an issue with my electrochemical impedance spectroscopy (EIS) measurements and would appreciate some insights. Experimental Setup: Electrodes: Gold interdigitated...

05 August 2024 3,783 2 View

Can usage of AI tools like chat GPT in research work is recommendable ?

AI tools like ChatGPT can enhance research work significantly when used responsibly and in conjunction with thorough human oversight.

05 August 2024 1,842 3 View

Usage of internal standards in LC-MS/MS analysis?

Have you ever seen a LC-MS/MS method uses both internal standards and external standards (in matrix matching purpose) but the concentrations of internal standards are outside the calibration curve...

05 August 2024 3,084 6 View

ANY free software for reconstructing neurons in the microscopic image?

Hi everyone, I am working on brain slices for visualizing a protein in the soma and dendrites, using a fluorescence tag. However, I need a tool (not paid) for reconstruction of the whole neuron,...

04 August 2024 4,725 2 View

How effective is the Citi Bloc standard basket in enhancing the accuracy and comparability of international construction cost assessments?

Citi BLOC Standard Basket Definitions: A standardized unit representing a fixed basket of construction materials, labor, and equipment costs priced in various cities. Purpose: To create a common...

04 August 2024 8,997 1 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

Which file formats are accepted for supplementary material?

I have a dataset consisting of json files. i tried to upload a zip or tar of it but the system tells me that the file format is not accepted... br

25 July 2024 1,316 3 View

Dataset of synchronized cardiac angiography and ECG?

Hello, I'm working on medical project and I would need synchronized angiography with ECG? Does anyone know if some open source dataset of this kind exist? Regards, Bruno

25 July 2024 2,214 2 View

How to Select the most suitable machine learning algorithm depending on the characteristics of the given dataset ?

I'm working on a project that involves analyzing a new dataset, and I'm at the stage of selecting the most appropriate machine learning algorithm. The dataset consists of both numerical and...

22 July 2024 6,097 7 View

How to test multivariate outlier in STATA?

Hey all, I need help testing for multivariate outliers using STATA for my master thesis. The literature recommends the Minimum Covariance Determinant (MCD) (Verardi & Dehon, 2010). I found the...

22 July 2024 8,821 2 View

How to use evolutionary algorithms with real parameters in ryu sdn controller with large scale?

Hi, I wanna to implement evolutionary algorithms in ryu sdn controller in mininet, i have some challenges, how i can run the big scale topo with one sdn contoller??? and another question is to...

21 July 2024 246 2 View

How to use NCBI datasets ?

I have been trying to extract genome from NCBI using their dataset tool, however some examples seem not to work : ./datasets download genome taxon "Homo Sapiens" --annotated --assembly-level...

20 July 2024 1,339 2 View

How do I access .vcf files without an R statistical package?

I am currently working on a mendelian randomization study, and I have downloaded the datasets needed from the ieu opengwas project (mrcieu.ac.uk) in .vcf format. I do not have access to an R...

19 July 2024 2,342 5 View

Which is the best approach for anomaly detection in scanned image data set?

Anomaly detection in scanned image data set

18 July 2024 3,578 3 View

"Hello, I am trying to find public datasets containing FTIR spectra of blood samples (both healthy and disease-related)?

These datasets will be used in the training of machine learning algorithms. Does anyone know any available data?"

17 July 2024 6,519 3 View

Sukumaran Clementswami

Firstly, the Shapiro-Wilk Test Results for your data:

Plant 1:Mean: 16.75 Standard Deviation: 6.18 Shapiro-Wilk Test Statistic: 0.696 p-value: 0.0103 (indicating non-normal distribution)
Plant 2:Mean: 12.00 Standard Deviation: 8.04 Shapiro-Wilk Test Statistic: 0.724 p-value: 0.0213 (indicating non-normal distribution)
Plant 3:Mean: 13.50 Standard Deviation: 5.57 Shapiro-Wilk Test Statistic: 0.957 p-value: 0.7593 (indicating normal distribution)
Plant 4:Mean: 13.50 Standard Deviation: 5.45 Shapiro-Wilk Test Statistic: 0.893 p-value: 0.3948 (indicating normal distribution)
Plant 5:Mean: 10.50 Standard Deviation: 5.00 Shapiro-Wilk Test Statistic: 0.982 p-value: 0.9109 (indicating normal distribution)

Note:

Plants 3, 4, and 5 do not have any outliers and likely follow a normal distribution based on their p-values (greater than 0.05).
Plant 1 has an outlier with a value of 26. Plant 2 has an outlier with a value of 24. Therefore, plants 1 and 2 do not follow a normal distribution (p-values less than 0.05).
Hence, Plants 1 and 2 show deviations from normality,
so using the IQR method to detect outliers is a more robust choice for these plants.

Chebyshev’s Inequality is too conservative, giving broader ranges where outliers might be expected, thus potentially identifying fewer outliers than the IQR method.

it is less specific and often less practical for detecting outliers in continuous datasets.

Use Chebyshev’s Inequality for distribution-agnostic analysis and IQR for a straightforward approach when the distribution is known or approximately normal. Combining methods can provide a comprehensive view of outliers.