Missing Value In A Classification Model?

More Shivi Bhatia's questions See All

P-nitrophenol acetate assay for esterase and media supernant colour interference. what can be done?

I am performing p nitrophenol acetate assay for esterase enzyme. My media contains a different concentration of peptone (0.5, 2, 5%) and after centrifugation the supernatant is slight yellow....

16 June 2024 4,381 0 View

Can anyone recommend a company that does LC-MS/MS for silver stained gels where I can outsource my samples for proteomics analysis?

Looking for most cost-effective manner to outsource my samples (silver stained gels) for proteomics analysis using LC-MS/MS. If anyone has outsourced samples outside or in INDIA for LC-MS/MS and...

21 April 2024 309 2 View

What could be the cause of getting such blots?

my protein is a nucleolar protein which is tagged with HA, FLAG and myc tag. i have over-expressed it and good GFP expression was observed which is directly proportional to the amount of my...

11 January 2024 2,057 2 View

What is new in Menstrual Health and Hygiene in terms of marketing?

I am researching the taboo behavior around women's menstrual health and hygiene, and exploring ways to shift it towards marketing through better research design.

16 December 2023 2,530 1 View

When I published my article to a journal, it was not Scopus indexed. Now the journal is indexed in Scopus. Will my paper also be indexed now?

The paper was published one year back in a peer-reviewed DOAJ indexed journal by Elsevier. Now the journal has been indexed in Scopus. Will my previously published article in that journal be also...

11 December 2023 4,932 4 View

What are R packages for detecting peaks and peaks area for HPLC chromatogram?

I am performing snake venom fractionation using Reverse-phase HPLC. It generates multiple peaks with different area under the curve. Since I'll be analyzing multiple chromatograms at different...

08 November 2023 9,134 0 View

Can anyone guide how to use the minitab software for optimization studies ?

I want to learn minitab software and how to use that software for optimization studies of the enzyme. How we can find the actual and predicted values from the software.? how we can make Plackett...

23 September 2023 5,983 2 View

Looking for a Topic for my Phd project research which could solve a Problem?

I enrolled for a PhD Programme in Computer Science. For my research work I am looking for a topic to choose which could make an impact and solve a Business problem. my area of interest are Data...

19 March 2023 8,842 4 View

What could be reason for my protein bands not resolving properly?

I have performed venom fractionation using HPLC and run the samples in SDS PAGE. I wanted to know what could be the possible reason for my protein bands to look like this. It should have been...

15 March 2023 3,542 6 View

If I have to perform a spectrophotometric assay using p-nitrophenyl acetate, then the standard graph should be made of p-nitrophenyl??

the standard graph of p-nitrophenyl acetate, how we can make it, which concentrations can be used?

04 August 2022 841 0 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

Normality assumption for linear regression is The assumption of normality is whether for residual errors or predictor variavble?

When we conduct linear regression, there are several assumptions. The assumption of normality is whether the residual errors are normally distributed, not whether a predictor is normal?

31 July 2024 6,164 3 View

A Question about Phd thesis?

Hello everyone What is your opinion about the introduction of an expert decision support system in which the rules are extracted from existing data without human intervention, instead of being...

31 July 2024 5,785 4 View

Are these cassettes suitable for expressing PETase mutant in E. coli?

I created two potential gene expression cassettes (constitutive and inducible) for expression of a mutant PETase gene on PeptiCloud using the version tree feature, which allows users to create...

28 July 2024 7,559 1 View

Please, what is the memory consumption of the Matlab function quad tree decomposition procedure [S = qtdecomp(I)] with respect to the input set I?

27 July 2024 5,455 2 View

Is it redundant to use both Random Forest and Decision Tree algorithms in the same regression project?

I am currently working on a regression model for a project and considering using both Random Forest and Decision Tree algorithms. Given that Random Forest is essentially an ensemble of Decision...

23 July 2024 4,306 3 View

If in a panel data, T>N then which model is appropriate ?

In my data set, T is greater than N, so I chose quantile regression for my data set. So is it appropriate for that?

15 July 2024 6,416 4 View

What are the problems we face when we directly inverse a multivariate regression equation?

Why direct inversion of mutivariate regression equation is not preferred and instead optimization techniques are used?

15 July 2024 8,642 3 View

How to interpret a Low R squared and negativ adj. R on my fixed effects panel analysis?

Hi guys, in the context of my master thesis i analyze the statistical relationship between income and subjective well-being (Panel: SOEP, n: 300.000 observations over 10 years). After creating a...

13 July 2024 7,539 6 View

Samer Sarsam

Hi Shivi,

People tend to apply several approaches to deal with missing data. Replacing missing values with a central tendency for the attribute", e.g., the mean or median, is one of those approaches. Another approach is using 0 to indicate that no value has existed for the specific instance. The results differ from case to another based on the characteristics of the utilized data.

HTH.

Samer

Sergey Porotsky

In my opinion, it isn't fully correct to insert mean value instead of missing value. Some parameters may be strong correlated, and in this case to insert missing value of some parameter we should take into account values of other (not missing!) parameters and values of correlations. So, you should use expressions for conditional mean value for multi-dimensional normal distribution.

R.M. Kapila Tharanga Rathnayaka

This is good reference for that;

https://www.yorksj.ac.uk/media/content-assets/schools/psychological-social-sciences/documents/How-to-enter-missing-data-in-SPSS.pdf

Shivi Bhatia

Thank you all for the above valuable answers.