Dear Colleagues, what are the consequences of performing statistical imputation?

More Pierre Bush's questions See All

Please, could you delete that page of Pangou ?

Dear Editor, I'm shocked that you publicized some papers that are fake here see https://www.researchgate.net/scientific-contributions/Serge-Valentin-Pangou-2003482514 See also the pdf attached...

26 March 2024 7,871 2 View

Planning a 19 storey building with varied concrete grades (M50 for columns, M25 For beam and slab ) what is the permissible grade difference ?

In planning a 19-story building, I want to use varied concrete grades—lower for slabs and beams, higher for columns (e.g., M50 for columns, M25 for beams and slabs). How much can I differ between...

28 January 2024 583 7 View

Do you know a free software for estimating fish-body size on stereoscopic footages ?

Stereo-video surveys of demersal fish assemblages are widely used. A field and video‐annotation guide for baited remote underwa... Stereoscopic footages are used to assess fish-body size. A...

22 January 2024 6,253 0 View

Please suggest any documents or studies that use an emissions multiplier product matrix (EMPM)?

The emissions multiplier product matrix (EMPM) is an extension of the multiplier product matrix (MPM) based on the input-output table that takes into account the CO2 emissions generated by...

15 January 2024 7,111 0 View

Which chemicals that could be added in a sample to increase the level of dissolved oxygen?

My sample's dissolved oxygen is decreasing dramatically.What can I do to keep it stable or increase it again?

14 January 2024 3,308 7 View

Call for Papers: Sustainability and Artificial Intelligence - interested?

I will publish a book with Springer Nature on Sustainability and Artificial Intelligence. Please forward my CfP to anyone...

09 January 2024 6,264 1 View

A good pedagogic wargame?

hello, can you advise me a good wargame for pedagogy and training in strategy and tactic. I am searching something a bit serious designed in research and open.

30 November 2023 9,208 3 View

Is someone here with knowledge in TRNSYS software ?

I want help on design the draw profile daily on domestic water (throughout the day) using TRNSYS. Please your help !!!

29 November 2023 1,165 0 View

ResearchGate ne me reconnait pas Pierre Mbang comme étant Pierre Mazono Mbang. Comment y remedier?

Dans cet article Engineering homologous platelet-rich plasma, platelet-rich p... , je suis co-auteur. Lorsque je tente de le poster sur Researchgate, mon nom s'affiche comme Pierre Mbang et un...

22 August 2023 5,455 2 View

Hello scientists, I have a question: For citation and referencing, which citation style that combines both superscripts and square brackets?

Hello scientists, I am writing a paper to be published. I need to carry out in-text citations and referencing. Therefore, my question is: which citation style that combines both superscripts and...

21 June 2023 2,773 2 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

Is this a facetotecta nauplius?

This larva was captured using a plankton net in the Persian Gulf during the summer. I believe it may be a Facetotecta nauplius.

08 August 2024 3,746 4 View

May members post flyers about opportunities to present at a conference? If so, where to post?

May members post flyers about opportunities to present at a conferehttps://veraeducation.com/nce? If so, where to post for the Virginia Educational Research Association? https://veraeducation.com/

08 August 2024 4,585 1 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

Hello all, Looking for international reviewer to review Ph.D thesis in wireless sensor network.Can anybody help?

My name is Apurva Saoji. I am a Ph.D scholar in Computer engineering in India. I am looking for international expert in reviewing my PhD thesis, "Competitive Optimization Techniques to Minimize...

07 August 2024 4,600 2 View

Andrew Paul McKenzie Pegman

It depends if the data is missing at random and are not related to each other. If the data are unrelated, estimated values will be unbiased with no loss of power. There are two methods of performing imputation:

1. Multiple Imputation (MI) fills in estimates for the missing data, but to capture the uncertainty in those estimates, MI estimates the values multiple times. The result is multiple data sets with identical values for all of the non-missing values and slightly different values for the imputed values in each data set.

2. The second method is to analyze the full, incomplete data set using maximum likelihood estimation. This method does not actually impute any data, but rather uses available data to compute maximum likelihood estimates. The maximum likelihood estimate of a parameter is the value of the parameter that is most likely to have resulted in the observed data.

I hope this helps :)

James R Knaub

Hello Pierre -

There are several 'ways' data could be missing, but essentially, if a missing datum is from an "ignorable nonresponse," then you might basically assume the mean of the collected data could be used, but it does not count toward your variance estimate. If a missing datum is a "nonignorable" nonresponse, then it is not to be considered to be generated by the same mechanism as the collected data, and use of the mean of collected data would bias results.

If you have related auxiliary data, then they might be used for regression to "predict" missing data.

You might research "response propensity" groups.

If you can stratify data by like characteristics, you might reduce bias for nonignorable nonresponse.

The idea is that there may be a reason(s) for nonresponding, which tends to make responses, say for continuous data, larger (or smaller) than the nonresponses would be if possible to reliably collect.

Cheers - Jim

PS - So the consequences you asked about are variance and bias considerations, which depend upon the type of nonresponse, and the imputation procedure, of which there are several.

Abayomi Dawodu

If you use a required package (e.g. Solas) then any other risk will be "empirical" or somewhat more so if you had used "Bayesian imputation" but then an imputation is not "real", it is just the most theoretical and appropriate value that estimates the missing value. If we carry-out a posterior survey (i.e. if the missingness is in a sampling experiment), we may find out that the actual value is even an extreme value (i.e. an outlier) which will could be why it is missing in the first place.

Linda Remy

I also suggest a dummy variable (yes, no) if you decide to impute a value. Include the dummy in your model. If it is non-significant, then you know that imputing did not have an important effect on your results.

It is ABSOLUTELY critical to know if your data are missing not at random, a very hard lesson I learned early in my career. I recommend the Analysis of Messy Data series by Milliken and Johnson. Volume 1 (Designed Experiments), Volume 2 (Nonreplicated Experiments), Volume 3 (Analysis of Covariance) They seem to be available used, new, and E-Books..