Normality of censored data?

More Henry T Robertson's questions See All

Is it reasonable to calculate the overall mean score of a scale with several subscales as an indicator of this scale?

In a published paper（Tušl, M., Bauer, G. F., Kujanpää, M., Toyama, H., Shimazu, A., & de Bloom, J. (2024). Needs-based job crafting: Validation of a new scale based on psychological...

21 May 2024 10,045 2 View

How to calculate mass of reagents based on desired pH and molarity?

Dear all, I am looking for guidance relating to a formula to calculate mass of reagents based on the desired pH and molarity. Is there a formula for this? I am looking to create, and justify with...

28 March 2024 7,457 1 View

Can you correct my profile stats?

My Hirsch index is incorrect - it is 5 not 4. can you please correct it?

19 March 2024 7,921 3 View

How to write an abstract for research?

Abstract

16 March 2024 2,612 2 View

Hallo, i am looking for introductory notes to statistical modelling of diseases or/and spatial modeling of diseases. Who can recommend literature?

Hallo, i am looking for introductory notes/literature to statistical modelling of diseases or/and spatial modeling of diseases. Who can recommend relevant literature? I am conversant with...

07 March 2024 7,441 4 View

Why is my DNA band and molecular weight marker not visible after a run?

I loaded the DNA with its respective buffer load in one lane. Then, in the other lane I loaded the molecular weight marker and ran the electrophoresis. When I do the disclosure in the...

22 February 2024 1,439 3 View

CERA 100 Gas Chromatograph Manual Sargent Welch?

Hello. My name is Henry, and I Search the manual of CERA 100 Gas chromatograph of Sargent-Welch. Can you help me? Best regards Henry Lozano.

04 February 2024 5,021 0 View

What is the best 17 beta insulin release system?

Standard insulin pellets from Innovative Research of America have lots of variability on the release of insulin. Is there a better system. Would intermittent insulin dosing (i.p.) be better- if...

16 January 2024 3,697 0 View

🚀 🤝💡 Seeking Collaborators: Skill-Based NFTs for Industries ?

Greetings, fellow researchers and industry enthusiasts, Are you intrigued by the potential of Skill-Based NFTs in revolutionizing industries? I'm passionate about exploring this transformative...

28 December 2023 399 1 View

How does one perform minimum-maximum normalization in SPSS?

I wish to use SPSS to achieve minimum-maximum normalization (from 0 to 1)) for 5 variables. How can I do that?

28 December 2023 2,582 5 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Vahid Tadayon Popular answer

Please see my paper:

Article Bayesian Analysis of Skew Gaussian Spatial Models Based on C...

Timothy A Ebert

If this is the primary goal, then this might help (but might not):

http://artax.karlin.mff.cuni.cz/r-help/library/CvM2SL2Test/html/cvmts.pval.html

The site uses the package CvM2SL2Test. However, when I went to install it would not install. I do not know if I made some mistake, or if the package is not available. I did not see a simple fix.

If this is a prelude to other analyses, then I would ask a few questions.

If you fail to reject the null hypothesis is it appropriate to conclude that the null hypothesis is true?

Maybe: if you fail to reject the null hypothesis is it safe to assume that the robustness of the statistical method that assumes normality is sufficient to overcome any minor departure from normality that is present but undetectable with the existing sample size?

Can you do it graphically? Yes, the approach is crude and you don't get p-values. However if you plot a histogram of the data over a normal curve does the fit look reasonable? Maybe try a Q-Q plot? If there are too few data points to make the determination, would you trust the results of any test statistic?

Henry T Robertson

I have already made graphs and they appear to be reasonable fits. I am evaluating the distribution of rodent longevities from multiple studies.

Also, I show that package has been removed from R.

Mehmet Sinan Iyisoy

Could you give some more information about your aim? Why do you need normality etc?

I want to show that normal distributions are reasonable approximations to rodent longevities. Accordingly, both the design and analysis of experiments can be done much more efficiently when we do not have to use nonparametric methods such as the Cox model. (Yes, I know that technically Cox is "semiparametric".)

I think the key to all of this is to find a quantifiable value that defines "reasonable approximation" and the scientific justification for that choice. Once that is done, the rest should be relatively easy. Where do the departures from normality no longer bias the statistical results sufficient to have economic, political, social, legal consequence?

At this point, my reaction is to suggest that the question is unanswerable, especially with the application of a single statistic no matter how sophisticated. Sometimes it is good to tackle what seem to be unsolvable problems.

James R Knaub

Henry -

It is not clear to me what you are doing, and why you need a distribution. At any rate, "normality" is usually not the "norm." Here you say "I want to show that normal distributions are reasonable approximations to rodent longevities." But longevity sounds like a kind of reliability problem to me, so if you are doing something that requires a distribution, don't you want the Weibull? Normality does not seem the case, and if you are "censoring," I expect that means you are cutting off the tail of your distribution, which means you should not expect any kind of classic fit, unless you cut very little.

Or, by "censored data," do you just mean the end of the period of longevity?

Anyway, it sounds like a reliability problem, just like estimating the life of a light bulb.

Best wishes - Jim

PS - Beware of p-values. They do not stand alone. A type II error analysis, or similar, to account for effect is needed. Please see below:

Regarding misused p-values:

Press release for the American Statistical Association:

http://www.amstat.org/newsroom/pressreleases/P-ValueStatement.pdf

My letter in The American Statistician:

https://www.researchgate.net/publication/262971440_Practical_Interpretation_of_Hypothesis_Tests_-_letter_to_the_editor_-_TAS

Article Practical Interpretation of Hypothesis Tests - letter to the...

The Weibull distribution is just another two-parameter distribution. I've found that in comparisons of AIC, normal comes out on top.

Normal distributions are not usually used for reliability.

I'm not testing for "reliability" in an industrial sense. I'm comparing longevities of different treatment groups. If the normal distribution is a better fit to reality and the mathematics are a lot simpler, why use Weibull?

If you are comparing two groups, you could get a confidence interval about the difference in their means. With a large enough sample size, normal is ok for standard errors of means, for use in confidence intervals. If the population standard deviation is anywhere close to "normal," then the sample size does not have to be so large to look very "normal," for the distribution of that statistic (i.e., the mean). (You might look at Chebyshev as a worst case.) The central limit theorem helps with means, if I understand your problem.

A confidence interval for your difference in means, like a p-value, is sample size dependent, but more practically interpretable. If you have good "confidence" in an interval that does not include zero, then this gives you an idea as to how different those means might be (an effect size).

Emilio José Chaves

If Dr. Robertson has already measured different longevities for different treatments, then it is not necessary to use normal distributions. If well measured data is censored, then analysis is "abnormal" because it effects the media of longevity of the particular group. Is it necessary to measure probabilities with a model? In that case each treatment will have its own media and its own distribution -clearly different from normal ones-.

Normal distributions theory is a wrong theory and practice in my opinion. Sometime ago, I gave you an alternative that only requires mínimum longevity, maximum longevity and media of longevity (for each treatment). It does not use any dispersion parameter, only media. Of course it is only a proxy model that fits perfect extreme values and media using data plus media of data.

Why does normal distributions have so many followers but so many wrong results?

Is Dr. Chaves familiar with survival analysis? In real life, most experiments will have a portion of longevity data lost to follow-up (right censored). There may also be right truncation due to time limits; most grants will only fund for two years, while rodents live longer. There will also typically be left truncation, since the rodents will be a few weeks old when the experiment begins. Parametric survival analysis incorporates the censored and truncated data and fits them to known distributions; some distributions will fit better than others. Semiparametric models such as Cox will lose power because it does not try to fit the hazard function into curves, when in fact the data may fit curves. The assumption that the data cannot fit a known distribution is potentially a strong one.

Ramakrishnan Ramachandran

Hi Henry Roberton You can read the article online through jstor http://www.jstor.org/stable/2335622. Also please see Testing for Normality of Censored Data - DiVA by J Anderson available at https://www.diva-portal.org/smash/get/diva2:816450/FULLTEXT01.pdf

Hope it helps. regards