How to perform Monte-Carlo simulation for censoring times?

More Alex Yiolkin's questions See All

What do you mean for “Deep Learning”?

Dear friends, hello! I have seen some Matlab examples of Deep Learning, but unfortunately didn't understand them clear. Please, answer for my questions: 1. Deep Learning is some Generic...

05 June 2017 4,097 6 View

What are the fundamental ideas of Monte Carlo dimensionality estimation, compared to Rasch residuals PCA?

I read an interesting article from Christensen (2007): A Monte Carlo approach to unidimensionality testing in polytomous rasch models. But do you have some further thoughts or maybe other...

08 August 2024 4,245 0 View

"A Markov-like Model for Patient Progression"?

A Markov-like Model for Patient Progression" Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC) is a powerful computational technique used to draw samples from a probability...

05 August 2024 10,079 0 View

How to start a Molecular Dynamics Simulation?

Is it possible to conduct a molecular dynamics simulation to see the effects of a specific carbohydrate on the structure of lipids (e.g., micelle structure)? I am a beginner in this field and plan...

03 August 2024 3,371 3 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

Which test should be used to study association among demographic profile and awarness level?

i have to study the awareness and adoption level of cloud computing in a district of India. i also want to use association among demographic variables like gender, age, education, income etc and...

02 August 2024 2,420 3 View

Which will be the best software for the Hydration shell analysis with molecular dynamics?

I am using a windows system, what software I should use for hydration shell analysis with molecular dynamics?

02 August 2024 3,143 4 View

Should I remove an item from a scale to raise Cronbach's alpha and McDonald's omega or is it better to leave it if they are both over .7 already?

Hello! I have this scale which had 10 items initially. I had to remove items 8 and 10 because they correlated negatively with the scale, and then I removed item 9 because Cronbach's alpha and...

01 August 2024 4,606 7 View

Why 3 replicates for most biological assays? Is it enough to examine the data fits normal distribution?

Just bounced on me. Before statistically analysing significant difference, shouldn't we see if data fits normal distribution first? Is 3 replicates enough to testify the hypothesis of normal...

31 July 2024 8,141 13 View

Normality assumption for linear regression is The assumption of normality is whether for residual errors or predictor variavble?

When we conduct linear regression, there are several assumptions. The assumption of normality is whether the residual errors are normally distributed, not whether a predictor is normal?

31 July 2024 6,164 3 View

Can we patent a process flow diagram developed using a process simulator but no actual cases is carried out?

Can we patent a process flow diagram developed using a process simulator but no actual cases is carried out? For example consider a process for certain product manufacture where a new process flow...

31 July 2024 781 1 View

Joachim Arts

If you know when measurements will be cencored, this doable. This is best explained by example:

Suppose we run a reliability test with 5 items for 2 years. Just generate 5 samples from the lifetime distribution (Weibul in your example). If any of these 5 samples exceed 2 years, just replace them with 2 years and indicate that this is a censored measurement.

Alex Yiolkin

Dear Joachim, thank you very much for your answer. Unfortunately, I did not know times for censoring. Example : I have labeled array of times with labels "failure" or "censored". By means of Least Squares method I have got the parameters of CDF of failure times and now I want to generate similar array of times of failures/ and censored events.

Hindolo George-Williams

Dear Sasha,

I've a bit of a problem understanding what you want to do exactly. Could you please state the application (e.g Matlab etc) you are using to generate the samples?

Ilya B. Gertsbakh

The reply of Joachim is absolutely sufficient. Write me on my email

[email protected] and probably we could clarify the situation. I am near you- in Tel-Aviv.

Dear Hindolo and Ilya, thanks for your answers. For example, input array of times of events is:

41.1c , 77.8 , 83.3c , 88.7c , 101.8, 105.9 , 117 , 126.9 , 138.7 , 148.9, 151.3c , 157.3 , 163.8 , 177.2c , 194.3c, 195.6c , 207 , 215.3c , 217.4 , 258.8c, where c indicates censored event. We see, that it isn't situation, that Joachim mentioned. By means of LS using following values of Weibull parameters were got : beta = 3.4, teta = 190

Now I want to generate another sample (with another size!) both with failures and censored events, according these parameters. For failures it is evident, but how to generate censored events?

Sergey Porotsky

My recommendations (but I didn't check its -:) :

to calculate t_failure = F^(-1)(rand) and t_censored = (1-F)^(-1)(rand), where F is CDF for failure time, F^(-1) and (1-F)^(-1) are inverse functions for F and (1-F). After this to calculate t_event = min( t_failure, t_censored).

Dear Sasha, about parameter estimation, you should used maximum likelihood method.

See in Gertsbakh, Reliability theory with applications to preventive maintenance,

Springer, 2000.p 54. As to the generation, you must know the random censoring mechanism. Right now, it is not clear to me how to "restore" it from the data observed.

Ilya tel 03-7419056.

Dear Sergey and Ilya, thank you very much for your answers. I will try to follow for your recommendation. Sincerely, Sasha.

Dear Sasha, thanks for the explicit explanation. However, may I know why you need to sample the censored times? If your aim is survival analysis, the failure time distribution is enough, you do not need to sample the censored times. For instance, to determine the number of surviving elements at time, t, simply generate n failure times, n being the total number of elements. The number of failure times exceeding t, denotes the number of surviving elements.

There's an inbuilt Matlab function that computes Weibull parameters with one line of code. You may want to visit http://www.mathworks.com/help/stats/wblfit.html for details. This function yielded (200.24,3.0) as the Weibull parameter set for the data you supplied above.

Answering the previous letter. About parameter estimation -all correct. But Sasha probably wants to understand the probabilistic mechanism governing the censoring process. And this is not that simple. In my view, there might be many such mechanisms which would produce similarly looking samples.

Ilya Gertsbakh

Dear Hindolo and Ilya, thanks for your answers. Reason to take into account censored events is following - to simulate different situations with same input parameters and to analyse sensitivity. So, Ilya is absolutely right, I "want to understand probabilistic mechanism governing the censoring process".

About my answer (above). It is, certainly, only one of the possible expressions. You can also use some others, e.g., to select some value of K and to calculate:

if t_failure < K*t_censored, t_event = t_failure; else t_event = t_censored.

So, I fully agree with Ilya's note "As to the generation, you must know the random censoring mechanism"

The problem is that there might be many random mechanisms producing seemingly

similar results. For example, if observation x_i ends by 1,2,3,4, the next observation will be censored by r.v. Z-> F_1(t). Otherwise, the censoring variable will be W-> F_2(t).

And so on

Ilya

Dear Ilya, what is (possible) conclusion ? Using different rules for censoring, we can get essentially different data sets of failures and censored data. But.. with same values of teta and beta parameters! It seems me, in this case MLE estimations for these data sets also will be essentially different for any (even very large) size of these data sets. So, these estimations will be asimptotically shifted, in differ of regular (only failures) data sets?

Dear Ilya and Sergey, I've been following your interesting stances regarding the censored times. Out of curiosity, what's the physical implication of generating the censored times in survival analysis? Cheers!

I fully agree with reason, mentioned by Alex and Ilya - " to understand probabilistic mechanism governing the censoring process". Another (possible) reason - to generate a very large sample to get/analyse some "rare-event" estimations.

Some remarks about my answers above ("if t_failure < K*t_censored, t_event = t_failure; else t_event = t_censored... " and "Using different rules for censoring, we can get essentially..." I have performed some numerical experiments: generated samples of 100...10,000 events (failures+censored) according some fixed values of Weibull parameters and after this estimated parameters by means of MLE. Short conclusion:

If K >= 1 (Amount Failures > 50%), results are good.

If K < 0.5 (Amount Failures < 20%), accuracy is bad for any size of sample.

Dear Sergey, thank you very much for your detailed answer.