How we can prove we have outliers in the results?

More Eghbal Rahimikia's questions See All

How can I prevent TMB from turning yellow?

I am working on cerium oxide and after adding TMB, the color of the solution changes to green and then yellow after a few minutes. How can I prevent this process? I want the color of the solution...

27 December 2021 3,656 0 View

Can we use negative values as inputs or outputs in data envelopment analysis(DEA) in a financial case?

I'm comparing some financial companies using data envelopment analysis(DEA) using their financial statements. Suppose that one of the outputs is income or profit. In some companies we have...

16 April 2016 9,837 7 View

How implement sampling methods (for unbalanced data) in k fold cross-validation?

Suppose that we have a unbalanced data-set for a binary classification problem and we want use 10-fold cross validation for training and testing fitted model. * Is this correct that we only use...

20 February 2016 9,397 3 View

Any advice in using Data Envelopment Analysis(DEA) by 6 months financial reports data (or yearly reports)? Which one is true?

I want use 6-months financial data of some companies financial reports to create an DEA model. Suppose that we have these financial reports: 6-2013 12-2013 6-2014 12-2014 6-2015 12-2015 We can use...

30 December 2015 8,786 12 View

What are data envelopment analysis (DEA) alternatives to calculate efficiency?

We can calculate efficiency using parametric (econometric models) and non-parametric models. one of the well-known non-parametric models to calculate efficiency (for example banking efficiency in...

14 September 2015 1,469 16 View

How can I find the pricing of garbage collection in different municipals of a city?

How we can price(calculate cost) garbage collection in different municipals of a city? These municipals in a city are out-sourcing all garbage collections using contracts with different companies....

26 July 2015 9,805 3 View

Is there any suggestion for a good reference book or manual for Verilog?

I know VHDL and did some projects, I am familiar with the hardware design concepts. What i need is a reference book in which i can get the rules in Verilogsomething like It is not possible to...

08 February 2015 2,759 15 View

Absorption coefficient of methane?

Hello, Can anyone provide me with the absorption coefficient of methane gas at 7.7 um? Any reference?

06 August 2024 980 5 View

How to determine method detection limit in an analytical method?

I know the difference between instrumental LOD and method LOD but my query is - in case of any sample whose concentration is zero or not detected by the instrumental LOD, is it possible to get...

24 July 2024 6,592 5 View

How to test multivariate outlier in STATA?

Hey all, I need help testing for multivariate outliers using STATA for my master thesis. The literature recommends the Minimum Covariance Determinant (MCD) (Verardi & Dehon, 2010). I found the...

22 July 2024 8,821 2 View

How to determine LOD values?

I am performing fluorescence experiments using a ligand to detect metal ions. I want to determine the Lowest Detection Limit (LOD) using the formula LOD = 3σ / K. However, I'm uncertain about...

19 July 2024 1,086 1 View

Can any of you suggest how to find the detection limit.?

I am working on flourescence sensing of heavy metals using quantum dots.Can any of you suggest how to find the detection limit. Whether it is calculated from formula or we can get from instrument?

19 July 2024 7,411 3 View

Which is the best approach for anomaly detection in scanned image data set?

Anomaly detection in scanned image data set

18 July 2024 3,578 3 View

How to label synapses in over-fixed mice brain sections (40 um) via immunohistochemistry (IHC)?

We have mice brains that were over-fixed due to old PFA used during perfusion. Thus, the synapses are no longer being labeled by the Synaptophysin (SY38) mAB. which works perfectly every other...

17 July 2024 7,767 3 View

How do you decide an optimal AC amplitude for non-faradaic electrochemical impedance spectroscopy?

I work with planar interdigitated electrodes (IDEs). The finger width and the spacing between the fingers are 5 micron. The surface of the IDEs is coated by a non-conductive polymer layer...

16 July 2024 5,031 1 View

Why flow rate of MFC for pure NH3 gas decreases with time?

We have been using MFC to control the flow rate of pure ammonia gas. The MFC is placed in an oven at 50 deg. C to avoid blockage due to NH3. However, after a few days of usage, the MFC is unable...

11 July 2024 5,796 1 View

What are the criteria that must be retained for the development and validation of a qualitative NMR method ?

Dear Community, I would like to develop and validate a qualitative NMR method for the analysis of a specific category of chemicals, and my question is the following: What are the criteria that I...

08 July 2024 1,402 4 View

Subhash Chandra

Run your model BOTH with and without the observations you think are outliers (whatever criterion you use for identify outliers). If you get more or less the same results, don't remove outliers. Note that removal of observations, for whatever reason, reduces your sample size, and tends to weaken the inferences you draw.

Stam Nicolis

There isn't *any* general way of stating what constitutes an ``outlier''-it assumes prior knowledge of what the signal is expected to be and what the model is expected to deliver and the only issue that can be meaningfully addressed is the distribution of the deviation between the two, under the assumptions. In particular, it isn't true that this deviation has a Gaussian distribution-it can and must be reconstructed from its moments and the relations between moments quantify what the distribution is. So the average value need not be a good approximation to the typical value, for instance.

Chandra Sekhar

Univariate -> boxplot. outside of 1.5 times inter-quartile range is an outlier.

Bivariate -> scatterplot with confidence ellipse. outside of, say, 95% confidence ellipse is an outlier.

Multivariate -> Mahalanobis D2 distance.

Otherwise:

Run a logistic regression (on Y=IsOutlier) to see if there are any systematic patterns.

Remove ones that you can demonstrate they are not representative of any sub-population.

Mariano Pierantozzi

Outliers...

What is outliers? If we're talking in a statistical point of view outliers are the same you cut in your work, but if we're talking in an experimental point of view perhaps outliers are the true data... So I think that remove outliers are a very delicate procedure.

mariano

Ali H Abuzaid

Removing outliers is not a recommended approach is statistical analysis, unless we justify the removal of such values.

The detection of outliers depends in both the nature of data and model you are looking for. One may use median instead of mean or use a robust model.

One common way to prove the necessity for removing outliers is to run your model with and without suspected outliers, if you find a significant difference in the results of two models then these outliers are also influential. Then you either use robust models or remove such points.