How to check normality before a linear regression?

More Hanif Qureshi's questions See All

Why my gel electrophoresis have shadow bands? Please see the attached picture for the gel electrophoresis ?

Sometimes I see the shadow like bands and its not true band. I want to know that what's the reason for it. I am using 2% gel for running genotyping samples I have uploaded the gel picture in both...

19 July 2024 148 6 View

What should be the maximum length of any research paper ?

Hi Everyone! what should be the maximum length of any research paper which is extracted from PhD thesis, Any reference?

16 July 2024 5,238 4 View

Why a policy paper like "how to address trade deficit" is NOT finding place in academic journals?

Research articles aimed at addressing direct policy issues are NOT welcome by academic journal, what is the reason?

01 June 2024 5,623 5 View

Total carbohydrates calculation using phenol sulfuric method?

Hello esteemed researchers, I am seeking guidance on how to calculate the total carbohydrate content. I've observed that some researchers divide the concentration found by 0.1 and then multiply it...

14 April 2024 8,888 8 View

What is the best method to isolate CD138+ cells from frozen primary Bone Marrow MNCs?

I have collected fresh primary bone marrow samples (within 1 day) and have isolted the MNC and stored them in Nitrogen tank. I am not quite confident about which method to be used. Some say...

02 April 2024 6,814 2 View

How can I calculate concentration (mg/Kg) of pesticide residues by using the area of peak acquired using the HPLC?

How can I calculate concentation (mg/kg) of sample by using area of peak during analyzing the pesticide residues by HPLC?

25 March 2024 4,392 6 View

Query Regarding 2D Simulation with Sliding Mesh for Vertical Axis Wind Turbine: Average Torque Analysis?

Hello fellow researchers, I am currently engaged in a 2D simulation of a three-blade vertical axis wind turbine using CFD Fluent, incorporating sliding mesh simulationI. I have conducted various...

15 January 2024 7,583 2 View

Why does the stress strain curve in Abaqus of my concrete model look like this?

in the unreinforced concrete model, after reaching the maximum force, it should experience failure and no increase in force occurs. Can anyone help?

09 January 2024 6,548 3 View

How to Make Boundary Condition and continous loads?

In Abaqus, I want to create boundary conditions like the example in the attached image. But I'm having difficulty how to make it. Especially in creating continuous loads in the middle of the...

26 December 2023 8,842 5 View

How to pass multiple signals in Fractional-order Integration in SIMULINK (MATLAB)?

Here is the pasted screenshot. The FOMCON fractional-order integral block is designed for taking only a scalar signal. If I try to apply Multiple inputs to the integration (FOMCON) block it shows...

12 November 2023 1,286 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

GC-MS retention index prediticon?

Hello experts, Does anyone know any free software about retention index prediction ?

08 August 2024 7,403 2 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

Why do we equate male and female arousal?

Women, on the other hand, can become physically aroused (increased blood flow in the reproductive organs) without becoming psychologically aroused even in the slightest. (Robert Weiss)

05 August 2024 9,537 2 View

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

Are there any statistical methods to justify your sampling technique using SPSS or AMOS?

05 August 2024 9,153 4 View

How to report results of Generalised Linear Mixed Models in a journal article?

Hi everyone, If you have written or come across any papers where Generalised Linear Mixed Models are used to examine intervention (e.g., in mental health) efficacy, could you please share the...

04 August 2024 4,130 4 View

Request a single Lecture notes for math as detailed as this that I can find in one place?

- The Existence/Uniqueness of Solutions to Higher Order Linear Differential Equations - Higher Order Homogenous Differential Equations - Wronskian Determinants of $n$ Functions - Wronskian...

03 August 2024 2,366 0 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

Should I remove an item from a scale to raise Cronbach's alpha and McDonald's omega or is it better to leave it if they are both over .7 already?

Hello! I have this scale which had 10 items initially. I had to remove items 8 and 10 because they correlated negatively with the scale, and then I removed item 9 because Cronbach's alpha and...

01 August 2024 4,606 7 View

Dler Kadir

Normality can be checked with a goodness of fit test, e.g., the Kolmogorov-Smirnov test. When the data is not normally distributed a non-linear transformation (e.g., log-transformation) might fix this issue.

Mohammad Ali Koushesh Vatan

Dear Hanif,

I highly recommend you to use Kolmogorov-Smirnov or Shapiro-Wilk. Use first one if your sample size is bigger than 30 and use second test if your sample size is under 30.

David Eugene Booth

Please remember that the most important check of normality is on the residuals after fitting the model. Best wishes, David Booth

Elham Abdulkreem Hussain

Analysis -- descriptive statistics -- Explore -- trans the variable to the dependent list - Plot - Normality plot s with test

Vero Calderón M.

Analyze - Descriptive stats - Frequency:

Statistics: check Skewness (should be near 0) // Kurtosis (should be near 3)

But I suggest you using Stata or EViews. Both will provide easy answer to your question. Hanif Qureshi

James R Knaub

Hanif Qureshi, et.al. -

I think that the confusion many people have with normality and regression is that it is best if the Yi are close to "normally" distributed, but that refers to the conditional Y given the ith case, or

Y|predicted-y, where

Y = predicted-y + epsilon.

Actually, in addition, though it's not convenient for the Gauss-Markov Theorem, the sigma in each i-case often becomes larger with larger predicted-y, naturally. (See https://www.researchgate.net/publication/320853387_Essential_Heteroscedasticity, bounded as justified by Ken Brewer.)

The unconditional distribution of y-values, and all xs as well, can be anything. With establishment survey data, for example, the data are generally highly skewed. The residuals can still be (close to) "normally" distributed, though the variance increases with larger predictions. Weighted least squares (WLS) regression is then used. Note that OLS regression is just a special case of WLS regression, with often unrealistically equal weights. (See "When Would Heteroscedasticity in Regression Occur?" Preprint, June 2021, J. Knaub, https://www.researchgate.net/publication/352134279_When_Would_Heteroscedasticity_in_Regression_Occur.)

Cheers - Jim

Daniel Wright

Your data are not normally distributed. As Tukey (1986) says, all assumptions are wrong, and this includes the normality assumption. Thus, the K-S test is just checking if your sample size is large enough for the non-normality to be detected. Are you more interested in deciding how much the deviations from the assumptions may affect the conclusions or looking for approaches that are less influenced by these deviations.