Does binning the data lead to overfitting and misinterpretation of correlation?

More Mohammad Alem Sultani's questions See All

Binder for ZnO nanoparticles in graphite paste electrode?

I have been using paraffin, but the deposited ZnO still detach from electrode. What is the best binder to modify graphite paste electrode with ZnO nanoparticles?

03 August 2024 4,624 3 View

If the emission of all greenhouse gases is stopped now, will the temperature of the earth continue to rise?

Here's a serious question: If we were to stop emitting all greenhouse gases right now, would the Earth's temperature start to cool down, or would the existing greenhouse gases continue to warm the...

30 July 2024 5,123 4 View

Wrong out put for gmx x2top?

Dear users, I would like to simulate a zeolite structure in gromacs. I got the .cif file from IZA expanded in one direction in Materials Studio and exported a .pdb file. However, when I want to...

24 July 2024 367 4 View

Is there a good macro photography kit for on-site metallurgical investigation?

I am looking to purchase a photography kit for onsite metallurgical investigations, such as capturing photos of worn areas, hot tears, cold tears etc. Could anyone suggest me a good...

22 July 2024 3,490 2 View

The measurement of cellular uptake of nanoparticles?

What protocol can be used to measure the cellular uptake of a nanoparticle? Is there a way to get more of a non-targeted nanoparticle into the cell? Why a fluorescent nanoparticle does not enter...

18 July 2024 7,153 2 View

How to define a zeolite structure with rigid framwork?

Dear Gromacs users, I would like to simulate a zeolite-water system. In different literature, it is suggested to consider a rigid framework of zeolite to decrease the computational cost. I would...

15 July 2024 4,590 1 View

How to classify 3D Bioprinting as a part of Additive Manufacturing?

Hello, I have a problem in classification of "Additive Manufacturing" techniques. There are several classification for Additive Manufacturing e.g. Powder, Extrusion, Resin etc. Actually, I am OK...

15 July 2024 139 3 View

Is there a difference between the curriculum and the style of the study?

A question about scientific methodology

13 July 2024 2,160 8 View

How to calculate Cohen's d from CI 95 and t value from a paired sample t test?

We have conducted a systematic review to investigate the effectiveness of a treatment for a psychological disorder. We aim to report effect sizes and p values of the reviewed studies but one study...

10 July 2024 7,186 4 View

Authorship for data analysis?

I participated in 2 research projects in which I was responsible for part of data analysis and figures production. Also, I spent hours trying a new statistical approach -for me- to implement in...

10 July 2024 1,226 5 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Why does my protein refolded to beta sheet during thermal denaturation analysis?

Hi! So i attempted to understand a novel protein behavior towards heat application by analyzing its secondary structure change. I subjected the protein to a thermal denaturation analysis using...

06 August 2024 1,989 3 View

Blaine Tomkins

Assuming you're talking about Pearson correlation, you should not bin the raw data. One of the assumptions of the Pearson correlation test is the data are measured on a continuous scale (interval or ratio). If you group the data into bins, they are no longer continuous, but discrete.

David L Morgan

One thing that could degrade your correlation is the presence of outliers. I suggest you examine a scatter plot.

Mohammad Alem Sultani

Blaine Tomkins, Thank you for your answer. That is a good point you mentioned. I have seen in many publications that people use grouping data with Pearson's correlation analysis. However, I realized that a very high correlation can be observed in the case of grouping regardless of what type of correlation we apply.

David L Morgan Thank you for your answer. Yes, you are right there are significant outliers in my raw data.

Mohammad Alem Sultani Yes, unfortunately many published studies use Pearson's correlation with non-continuous data. If the data are not continuous (i.e., binned), authors should be using Spearman rank-order correlation since binned data are measured on an ordinal scale.