Density estimation or probability estimation?

21 April 2012 1 2K Report

I am working on histograms because histogram is a very parsimonious way of storing a distribution of observed values. In order to overcome the problem of the choice of the width of bins, I devised a method where, chosen the desired number of bins, the domain is binned into bins that have different width. OK, nothing new, just a piecewise interpolation of the distribution function. But I intend to compare the procedure against other methods. The kernel density estimation looks the more competitive (a lot of references stated its superiority in being consistent and having a fast convergence rate to the "true" density). However, I performed a test to asses the superiority of KDE vs "my histogram". I generated 1million points from a mixture of two normals. I performed the KDE with the RBF kernel storing the distribution function estimated into 500 points. I estimated "my histogram" with just 16 bins. Well, after that I simulated 10k random queries about the probability of an interval of values. Whit my great surprise, "my histograms" is more accurate (MSE) in the prediction of the probability than the KDE.

So, my question is: "Is it more correct (and\or useful) to predict density or probability?"

If you are curios of that, I have implemented the procedures in MATLAB.

Emmanuel Curis

(no way to remove a comment it seems; I changed it, seeing your profile my remarks were certainly not appropriate...)

Badges
Science topic

Similar topics
Mathematics
Statistics

More Antonio Irpino's questions See All

¿Are oxygen vacancies in semiconductors capable to change the band gap?

Dear all! Oxygen vacancies do raise negatively the Fermi level by increasing the negative charge in the semiconductor crystal structure. In the case of n-type semiconductors, I wonder if this...

30 July 2024 8,339 2 View

What is the solubility of Iron (+3) in Bismuth Oxide?

Dear all: I am modifying Bismuth Tungstate with Hematite (Fe2O3), so I would like to know about how much Iron(+3) can form a solid solution in a Bismuth Oxide matrix. Best wishes Marco

29 July 2024 2,163 1 View

How to express spinors with an arbitrary change of coordinates?

A spinor field in classical differential geometry is defined as a section of a spinor bundle, which is by definition the associated bundle constructed from the spinor representation and the...

14 May 2024 400 0 View

Would it be proper to mark species that do not migrate as "stationary" or "sedentary" ?

I am curious how should i label species in a certain swamp habitat, since i want to make a difference between the migratory and the "stationary" species. Both of the terms i mentioned in the title...

11 May 2024 3,216 5 View

Scotch tape Life Cycle Assessment or production environmental impacts?

Hi all, In the framework of a LCA of perovskite solar cells, I am looking for information about scotch tape, that is used to recover the metals deposited in some layers. Does anyone know of...

18 April 2024 1,678 0 View

Is there any relation between surface area and amorphization degree in high-energy ball-milled activated carbon? The main source could influence?

I milled two activated carbons, with different main sources (shell nutt and bituminous coal). The shell nut AC did not increase in surface area but had some degree of amorphization after milling....

17 April 2024 3,897 1 View

Culturing and mineralization protocol for MC3T3-E1 Sub 14 osteoblasts?

Recently, we have been encountering challenges in our laboratory in achieving satisfactory mineralization of MC3T3-E1 cell line. We would like to determine the optimal concentration of Ascorbic...

13 March 2024 7,500 1 View

¿Cómo puedo saber si una intervención con adolescentes problemáticos ha sido eficaz, o si por el contrario no ha sido adecuada?

Esta pregunta se refiere ha como podemos saber, si después de una intervención, ya sea en grupo o individualmente, a la hora de tratar con adolescentes problemáticos, nuestra intervención a sido...

11 March 2024 7,801 2 View

¿Cuáles son las principales acciones de respuesta del trabajo social, con respecto a menores en riesgo de exclusión social?

Esta pregunta hace referencia a las principales actuaciones, que un trabajador o una trabajadora social, debe de aplicar cuando se encuentra con un menor en riesgo de exclusión social, para que no...

11 March 2024 3,551 2 View

When and where was the term "Univalvia" used for Mollusca?

while attending class my professor asked us to count benthic species,and in my counting i mentioned sea snails,certain i have read multiple times that their classification is "univalvia" since...

27 February 2024 365 5 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Posthoc test lettering in JAMOVI?

Does anyone know of a module for the JAMOVI software that is capable of generating mean separations using the classic letters based on post hoc results (e.g., Tukey test)? If, as I believe, such...

31 July 2024 3,333 4 View

How to back transform the results generated from analyses using log transformed with In(X+1) data?

I am conducting my analysis using SPSS. I log transformed my data using In(X+1) as my data contain zero values. However, when I want to back transform the regression coefficients generated from my...

31 July 2024 7,860 3 View

Have you tried using Vizly for your data analysis? Use the link: https://vizly.fyi/?via=olatomide. How do you see it?

AI has made it easier to code and analyze data

25 July 2024 9,861 1 View

Is it appropriate for researcher(s) to collapse five or four rating Likert scales to three or two as the case maybe during data analysis?

Five or four rating Likert scales e.g. Strongly agree, agree, neutral, disagree and strongly disagree or Strongly agree, agree, disagree and strongly disagree are usually collapse to SA/A, N, D/SD...

24 July 2024 9,841 4 View

How to test multivariate outlier in STATA?

Hey all, I need help testing for multivariate outliers using STATA for my master thesis. The literature recommends the Minimum Covariance Determinant (MCD) (Verardi & Dehon, 2010). I found the...

22 July 2024 8,821 2 View

Who wants opportunities for scientific cooperation?

Dear Colleagues, I hope this message finds you well. My name is Noor Al-Huda K. Hussein, and I am a researcher specializing in deep learning applications in genetic data analysis. I am currently...

16 July 2024 3,981 6 View

Suggestion for PhD Research Topic/Topics in Applied Statistics?

Hi All I recently get admission in PhD statistics. After a long discussion with my supervisor, the topic I selected for PhD is " Air Pollution and its impact on Economy: A case study of...

15 July 2024 1,820 5 View

What is the difference between OTU and ASV analysis?

For microbiome data analysis

13 July 2024 4,542 2 View