I have a dependent variable series having negative values; is it fine to go ahead to regression or should I convert the series into absolute form?

More Muhammad Aftab's questions See All

How to use Density Functional Theory to calculate carrier mobilities of solid system?

Hello, everyone. I have tried to determine carrier motilities of some materials, by Density Functional Theory, using Quantum ESPRESSO. There are a few methods to do it, like a package called...

04 August 2024 8,894 1 View

How combine yolo with Faster R-CNN?

I want a model that is balanced with accuracy or speed, faster rcnn has high accuracy while yolo have fast speed. i am thinking to combine them to get a hybrid model to achieve both speed and accuracy

02 August 2024 3,104 0 View

What should be the sample container for the hydrothermal reaction in a microwave reactor at 180 °C for 10 min at the heating rate of 5 °C per minute?

Suggest one of them 1. Teflon-lined stainless steel autoclave: 2. Alumina (Al2O3) ceramic container

30 July 2024 7,326 1 View

Addition of EDTA during the synthesis of copper nanoparticles to prevent it from being oxidized?

I've attempted to use this method to synthesize copper nanoparticles. Copper nanoparticles can be synthesised using a variety of precursor materials. CuSO4, distilled water, NaOH, and EDTA are...

28 July 2024 8,027 3 View

Can I please ask why my samples from anaerobic bioreactor giving me different size PCR product even after multiple runs?

Hi everyone, I have extracted DNA from a biogas bioreactor using Qiagen kit and prep cDNA library then used this library as template to optimize primers for qPCR (taken from papers). Some of the...

23 July 2024 1,329 5 View

Swerling Characteristic functions?

Hello!!! I want to implement the Swerling characteristics functions (CF) directly in MATLAB without using its Fourier integral pairs...the Swerling CFs are actually Laplace Transform of the signal...

23 July 2024 4,925 1 View

Radar Detection Probabilities?

Currently I need to calculate detection probabilities (PD) from RCS data. Beta distribution parameters for this RCS data are calculated and will be used in Swerling0 Equation. The idea is based on...

22 July 2024 2,851 0 View

Why methanol and sulphuric acid used in the analysis of polyhydroxyalkanoates (PHA) by GC-MS?

Why methanol and sulphuric acid, used in the analysis of polyhydroxyalkanoates (PHA) methyl esters by GC-MS? Additionally, why do we typically use non-polar solvents in GC-MS?

22 July 2024 1,210 2 View

Radar Detection Probabilities using beta distributed Scattering Cross section?

Currently I need to calculate detection probabilities (PD) from radar cross section (RCS) data. Beta distribution parameters for this RCS data are calculated and will be used in Swerling0...

22 July 2024 868 0 View

I want to buy Hydrothermal Synthesis Autoclave from any European company. Can anyone suggest any company inside Europe?

Thanks

22 July 2024 1,143 3 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

Are there any statistical methods to justify your sampling technique using SPSS or AMOS?

05 August 2024 9,153 4 View

Could dyes amplify the spectrum of light to a specific wavelength?

I am interested to know the behavior of dyes toward light. Specifically, Blue dyes re-emit the spectrum, especially from the green zone (known as principal in LED lamps, and blue dyes are known...

05 August 2024 3,290 1 View

How to report results of Generalised Linear Mixed Models in a journal article?

Hi everyone, If you have written or come across any papers where Generalised Linear Mixed Models are used to examine intervention (e.g., in mental health) efficacy, could you please share the...

04 August 2024 4,130 4 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

Why 3 replicates for most biological assays? Is it enough to examine the data fits normal distribution?

Just bounced on me. Before statistically analysing significant difference, shouldn't we see if data fits normal distribution first? Is 3 replicates enough to testify the hypothesis of normal...

31 July 2024 8,141 13 View

Normality assumption for linear regression is The assumption of normality is whether for residual errors or predictor variavble?

When we conduct linear regression, there are several assumptions. The assumption of normality is whether the residual errors are normally distributed, not whether a predictor is normal?

31 July 2024 6,164 3 View

Jochen Wilhelm

What *is* your dependent variable? This is much more important to know when one needs to find an appropriate regression model.

Generally, the simple regression has no restriction on the dependent variable. Only the residuals should be roughly normal distributed.

Muhammad Aftab

Thanks Prof. Jochen for answering my question. My research model dependent variable is the correlation between capital markets and independent variables are macroeconomic indicators . I am wondering unrestricted specification might affect my results.

Hume F. Winzar

There is no problem with negative values for a dependent variable in Multiple Regression. It would be misleading to change your negative and positive values to absolute values.

You explain that your DV is a correlation coefficient. That is, values potentially range from -1 to +1. In linear models such as regression, variables are assumed to be able to continue beyond those limits. You should consider transforming your correlations into Z-scores before applying regression. This is called a Fisher Transformation.

Thanks Prof Hume for answer. Is it appropriate to do fisher transformation when variables violate normality assumption? My bivariates of correlation are not normally distributed.

Yes it is quite okay to use a Fisher transformation on correlations where your sample of correlations is not normally distributed.

It is also quite okay to run regression with data that are not normally distributed - your results are still BLUE (Best Linear Unbiased Estimates). The standard errors of your coefficients may be biased in such situations, so be careful with your interpretation of significance levels in marginally significant regressors.

Pelumi Oguntunde

It is not necessary to use absolute values of your DV. I believe you can go ahead with your regression analysis.

Abdulrazzak Charbaji

It does not make a difference. You can change negative values by adding the same constant to the variable keeping in mind that adding or subtracting a constant affeects the mean but does not affect variation (SD and Variance).

It does not make a difference. You can change negative values by adding the same constant to the variable keeping in mind that adding or subtracting a constant affects the mean but does not affect variation (SD and Variance).

Patrizio Vanella

Hey Muhammad,

I do not understand what you are trying to model, exactly. What are you trying to model in the capital markets. What is your endogenous variable exactly?

Sangita C. Patil (Birajdar)

Hi,

I think Abdulrazak Charbaji is correct when you add any constant to your variable, the correlation coefficient as well as regression coefficients remain unchanged. If you are using t-test for the significance of regression coefficient then normality is required otherwise for z-test normality is not required.

Shane McGee McMahon

Adding an arbitrary constant to your independent variables indeed would not introduce any difficulty. You should not take the absolute value of the variable unless the absolute value of the DV is what you are trying to predict/understand. In either case, neither of these 'solutions' are necessary or even helpful.

Hume suggests that 'It is also quite okay to run regression with data that are not normally distributed.' Indeed, this is true. But in fact we would generally not expect the data to be normally distributed. We may, however, expect the residuals to be normally distributed. Although, as Hume suggests, for a linear model, even if the residuals are not normally distributed, the parameter estimates from ordinary least squares regression are still the best linear unbiased estimator (BLUE). Best is a little bit subjective, what we really mean to say is minimum variance unbiased estimator (MVUE). In this case, we still require that the residuals are heteroskedastic for the estimator to remain BLUE. However, they are no longer the maximum likelihood estimator and the standard errors calculated in the usual manner are no longer meaningful.

For further information of basic regression methods, I would recommend Frank Harrel's book, 'Regression Modelling Strategies.'

Richard Young

Regressions run fine with negative values. There is no need to add a constant.

Shuichi Shinmura

I agree Richard simple opinion. Today, I finish to write my paper.

I may be free after it. If you offer the sample data by Excel, I can analyse your .

Variable names are first row. After second row, data are imputed.

Natalie E Meyer

I have a similar question, with running OLS regression when the dependent variables has negative postive and zeros.

How does one interpret the results, if the dependent variables has -1,0,1 and -4 through 4.

Lets says B1=gender (0=male 1=female) and the dependent variables is -1, 0. or 1, who would you interpret B1= -.002??