Why do we need to discretize continuous probability distributions?

More Sabina Shahin's questions See All

How can we deal with big decimal factorials in MATLAB?

i am working on some posterior distributions which involve big fractional values , i tried alot to figure out any possible way in Matlab ,but still i could not. any help and guidance in this...

06 July 2015 3,782 0 View

How can we use PCAs to compare two groups?

I want to compare a survey data for two groups using their corresponding PCAs but I have no clue that how can we use PCAs for comparison.

05 June 2014 2,484 6 View

How can we set a prior for scale parameter in the case of compound weibull distribution with two shape parameters?

Optimal bayesian parameter estimation of compound weibull distribution. Any good suggestion please?

05 June 2014 1,116 3 View

Can I use progressive left censoring for regressions models?

i want to research partially linear progressive censored regression models using heavy tailed distributions but I am unable to find some useful literature on progressive left censoring (i also...

31 December 2013 6,628 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Moinak Bhaduri

This might be motivated by the problem at hand. For instance, in reliability, failure data may be measured as discrete variables, like the number of shocks or cycles. Assigning a non zero prob to the number of shocks between 3.6 and 3.8 should be odd !!

There can be different ways of discretizing a continuous distribution, though, depending on the property we want to preserve.

Sabina Shahin

thank you Mr Moinak Bhaduri.

Jose A Fernandes

There is methods that allow you to avoid discretization e.g.:

https://www.researchgate.net/publication/223504164_Bayesian_classifiers_based_on_kernel_density_estimation_Flexible_classifiers?ev=prf_pub

However, discretization have advantages such as faster computation or more easy to show results to non-scientists:

https://www.researchgate.net/publication/215739042_Fish_recruitment_prediction_using_robust_supervised_classification_methods?ev=prf_pub

Article Bayesian classifiers based on kernel density estimation: Fle...

Article Fish recruitment prediction, using robust supervised classif...

Richard David Gill

We don't need to discretize. One can fit parametric models or smooth non-parametric curves if we need to using standard statistical principles e.g. maximum likelihood

Maria de Lourdes Centeno

Sometimes discretization of continuous random variables is very useful. An example is when you need to compute the distribution of a compound random variable. This is the case in actuarial science, where the aggregate claims are sum of individual claims, and the number of individual claims is itself a random variable. See for instance the book Loss Models, by Stuart A. Klugman, Harry H. Panjer, Gordon E. Willmot.

The following article can be very useful to answer you:

Chen, S., Pollino, C., 2012. Good practice in Bayesian network modelling. Environ.

Model. Softw. 37, 134e145.

Emiliano A. Valdez

This is an interesting question. For actuarial science and insurance, we also are sometimes interested in the opposite: continuitizing a discrete distribution. Such usually can be accomplished by introducing a so-called "jitters". Please refer to my paper with P. Shi on "Longitudinal Modeling of Insurance Claim Counts Using Jitters". Our purpose was mainly to be able to directly apply copulas to continuitized multivariate discrete random variables.

The discretization of continuous distributions has also been used in the actuarial literature. For example, in a life or mortality table, death rates are sometimes reported for age intervals.

Marek Wojciech Gutowski

I will side with Richard Gill. Don't do it, unless you really have to. This way you will certainly distort your real data. However:

- in physical sciences your "continuous" data are in fact discrete. This is because of finite resolution of independent variable, say temperature. Even the best temperature controler keeps it within some range. Even if you record the "momentarily values" of temperature, then you know it only up to the resolution of your thermometer (plus the inertia of a sensor ...). Of course, you can discretize such data using much wider "bins", but such a procedure will only work satisfactorily when the dependent variable in in fact temperature almost insensitive. Well, this example has (almost) no relation to probability distribution.

- in less precise sciences: we have to classify somehow things like "degree of angriness", say as "crazy", "extremely angry", "very angry", "angry", "moderately angry", "not angry at all". We simply don't know how to do better.

The unquestionable advantage of discretizing the probability distributions, those based on experimental data, is enormous economy of storage keeping such data.

There is also the question of how to discretize: how many bins? bins of equal or not equal width?, etc.

Thank you all i appreciate your help.:)

David Vose

People discretise continuous variables to calculate aggregate distributions (FFT, Panjer, etc) as already mentioned.

Discretising can also be used when you want to build a decision or event tree with a chance node that is a continuous variable.

Anan Ayoub

any software to discretize continuous probability distributions?