How can I make prediction on stata and prevent stata from taking into account the non significant coefficients? (what command?)

More Mina Sami's questions See All

Leaf area of tomato ?

Hi How can this equation Ln(LA) = 1.038 + 0.89 ln(X) be applied to calculate the leaf area of a tomato? Can you explain with an example and what is the substitution of Ln and ln?

06 August 2024 2,508 2 View

How to interpret good fit but insignificant indirect effect?

Hi everyone, I have conducted a mediation analysis with SEM. My model fit was good but there was no significant mediation. Which aspects of this should I discuss?

19 July 2024 4,272 5 View

How can I adjust the cell number (cfu/ml) of Pseudomonas aeruginosa ATCC 27853 to OD600 nm absorbance and the McFarland standard?

Hi everyone Good time. I am working on Pseudomonas aeruginosa ATCC 27853 in our lab and I want to know whether there is any strong literature or previous data about the correlation between the...

13 May 2024 6,888 0 View

How to find fully funded PhD scholarship in the subject of spatial and/or regional economics?

Hello researchers and professors I'm from Pakistan and have published few articles in health economics and poverty from spatial perspectives. I have good expertise in spatial analysis and now I...

30 April 2024 3,048 2 View

What do you advise me?

I want to read a book about human resources management. What do you advise me?

27 April 2024 2,645 3 View

Leaf area of tomato ?

hi every one What is the correction factor in the tomato leaf area equation? leaf area = maximum length of the leaf * maximum width * correction factor

10 April 2024 3,958 1 View

Estimation lycopene and caroten ?

After extracting lycopene from tomato fruits using the 4:6 acetone:hexane method, can it be measured with a spectrophotometer the next day? Does the pigmints damage?

01 April 2024 8,510 3 View

Estimation lycopene and caroten ?

Hi everyone After extracting lycopene from tomato fruits using the 4:6 acetone:hexane method, can it be measured with a spectrophotometer the next day? Does the pigments damage?

01 April 2024 4,859 1 View

How important is AI for marketing?

How can AI improve marketing for businesses?

19 March 2024 2,867 5 View

How to calculate youngs modulus from the curve?

how to calculate youngs modulus from the curve?

16 March 2024 4,392 2 View

How can I prepare virus for a TEM or SEM imaging?

I have virus (viral hemorrhagic septicemia virus) in suspension and the experiment will not involve cells. What level of TCID50 is preferred?

11 August 2024 3,115 1 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Is it possible to use the Fused Deposition Modeling (FDM) to additively manufacture interconnected porous structure generation of >100-200 micrometer?

Usually, additive manufacturing techniques like SEBM, SLS, and SLM are used for interconnected porous lattice structure generation with sizes of >100–200 micrometers. Can the Fused Deposition...

09 August 2024 7,892 0 View

How to define an anisotropic material with asymmetric elastic compliance/stiffness matrix in ANSYS APDL?

I need to model an anisotropic material in which the Poisson's ratio ν_12 ≠ ν_21 and so on. Therefore, the elastic compliance matrix wouldn't be a symmetric one. In ANSYS APDL, for TB,ANEL...

09 August 2024 5,048 2 View

GC-MS retention index prediticon?

Hello experts, Does anyone know any free software about retention index prediction ?

08 August 2024 7,403 2 View

How can I apply boundary conditions in an orthotropic steel deck numerical model using ABAQUS software?

I am trying to simulate vehicular loading on an orthotopic steel deck bridge section in ABAQUS software. The red arrow mark in the attached figure indicates the direction in which the vehicle will...

08 August 2024 719 0 View

Explain theoretically and with the aid of an example the concept of equation linear and not linear in variables and parameters?

In Econometrics

07 August 2024 6,142 4 View

Can you suggest reliable sources defining "3D mesh" and "3D city models"?

Dear fellow researchers, I am currently working on a paper where I need to provide a reliable reference that defines and distinguishes between 3D mesh models and 3D city models. Although I am...

06 August 2024 9,986 2 View

Please explain how the plastic input value should be considered from the true stress-strain curve for the bilinear elastoplastic material model ?

I am working on Abaqus/Explicit(Quasistatic ) for the deformation of the auxetic structure model. Please explain how the plastic input value should be considered from the true stress-strain curve...

05 August 2024 454 3 View

What are the shear and normal stiffness values of an LLDPE liner in 3D numerical modeling of a stockpile?

I am seeking experimental or applicable data for the liner (LLDPE) interface in FLAC3D numerical modeling of a large stockpile. Could you please recommend suitable references? The preferred data...

05 August 2024 3,665 0 View

Marybeth Walker

I don't think it is a great idea to drop the insignificant coefficients. Look at the point estimates, if they are large in magnitude, you are better off not dropping them.

Mina Sami

The point estimates are not large. I prefer to drop the insignificant coefficients as I am estimating a variable the is not complete for all the observations.

Adi Schnytzer

Why not drop the insignificant variables from the regression? But you should be aware that the precision of predictions has not a whole lot to do with significance of the variables. See my papers:

“The Prediction Market for the Australian Football League”, in Vaughn Williams, L., Prediction Markets, Routledge, 2011, pp. 221-234.

and

“The Regression Tournament: A Novel Approach to Prediction Model Assessment”, (with Janez Sustersic), Journal of Prediction Markets, Volume 5, no.2, 2011, pp. 32-43.

Maria Kaneva

I advise against dropping insignificant variables. For instance, in probit and logit regressions increases the quality of the model (adjusted R2).

Vince Daly

Roughly speaking, a variable is statistically insignificant if zero is a likely value for its true slope parameter. Whenever a variable is statistically insignificant then some non-zero but possibly small values are also likely as values for the true slope parameter. I suggest that your first decision is to identify those variables that diagnostic statistics make you believe have zero influence. Drop these from the model and keep any that are statistically insignificant but you believe, nevertheless, possibly have non-zero influence. Then re-estimate and predict. (Or take a Bayesian approach?)

Ivan Faiella

Beware Mina: from a mechanical point of view if you drop some of the estimated coefficients then fitted(y|x) is different from mean(y) (i.e. residuals mean is nonzero).

There is a chapter in this book (http://www.stat.columbia.edu/~gelman/arm/) where Prof. Gelman explains why keeping statistical insignificant covariates is sensible.

Best.

Ivan

Mohaned Abd Alrahman

I agree with ivan

James R Knaub

Because relative "significance" between regression coefficients change with the mix of predictors used, this is not a good way to determine which predictors to use. (Also, if you change sample size, then for a given "significance" level, you would even change the Number of predictors kept!)

If you compare models using a "graphical residual analysis," then you can see which fits better. If the models being compared differ by just the presence or absence of a single predictor, you can see how much difference it makes, but ONLY for that set of predictors, and ONLY for that sample. Regarding the sample, "cross-validation" can be used to try to avoid fitting more closely to a particular sample than can be supported by the population or subpopulation for which the model is supposed to apply.

To keep this visual (a picture being worth a thousand words ... or statistics), you might, as perhaps one option, do a graphical residual analysis with more than one model represented on the same scatterplot for a given sample, and then do the same thing on another scatterplot for another sample. Performances can be compared between models in each case.

Further please note that graphical residual analyses also can be used to study heteroscedasticity. Please see

https://www.researchgate.net/project/OLS-Regression-Should-Not-Be-a-Default-for-WLS-Regression, and the various updates there, in reverse chronological order.

James R Knaub is taking the discussion in a good direction. It's a "model selection" issue, possibly with good prediction as a selection criterion