Statistical tests (t Student or Mann-Whitney tests) with enormous sample size - is it a good practice?

More Mateusz Soliński's questions See All

Do you think can be any Uranium bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about Uranium ore deposits in world.

11 August 2024 6,720 0 View

Do you think can be any diamond bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about diamond ore deposits in world.

11 August 2024 2,167 1 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

If Banks do not provide credit facility, what are the options available for FPOs and impact on producer’s income?

10 August 2024 8,198 5 View

Controlling for pupil light reflex when analyzing pupil size time course?

I used eye tracking to examine how participants from two different populations (A and B) react to an image. Participants in population A exhibit larger pupil sizes over time, but they also have...

10 August 2024 3,229 0 View

What are a “Farmers Producer Organization” (FPO) and its essential features?

10 August 2024 477 5 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

Do interactions between biosphere, carbon cycle, & water cycle impact global warming & interaction between atmosphere & hydrosphere?

How do interactions between the biosphere, the carbon cycle, and the water cycle impact global warming and interaction between the atmosphere and the hydrosphere?

09 August 2024 3,291 2 View

How to get moment output in Abaqus Standart?

I have input a moment load in module load Abaqus, i put my moment load on the node surface (using reference point). I have define moment in history output and make a set for moment too. But the...

08 August 2024 4,831 4 View

How is energy cycled through the Earth's climate system and how do matter cycle and energy flow through the rock cycle?

08 August 2024 8,162 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Why do i have a weak plasma and no coating during rf sputtering?

I have been getting a low plasma and no coating while doing rf sputtering of copper doped ZnS using a power of 140 watts and at pressure of 6.5x10-3 Mb.

07 August 2024 4,526 4 View

How to assess the osmotic power of a substance?

Hello, I suspect that a molecule I am using is causing bacteria to experience osmotic shock. What chemical properties should I look for to compare this capability with those of other substances?...

06 August 2024 977 1 View

Why do we equate male and female arousal?

Women, on the other hand, can become physically aroused (increased blood flow in the reproductive organs) without becoming psychologically aroused even in the slightest. (Robert Weiss)

05 August 2024 9,537 2 View

Could dyes amplify the spectrum of light to a specific wavelength?

I am interested to know the behavior of dyes toward light. Specifically, Blue dyes re-emit the spectrum, especially from the green zone (known as principal in LED lamps, and blue dyes are known...

05 August 2024 3,290 1 View

How to report results of Generalised Linear Mixed Models in a journal article?

Hi everyone, If you have written or come across any papers where Generalised Linear Mixed Models are used to examine intervention (e.g., in mental health) efficacy, could you please share the...

04 August 2024 4,130 4 View

Do you make articles from the Journal of Hypertension available (not the American Journal of Hypertension)?

I haven't seen one in a while so I wondered if that journal prohibits that form of redistriibution.

04 August 2024 5,137 3 View

Which distribution type should I use when calculating the average particle size from TEM image? and how to calculate the error ?

average particle size calculation from TEM

04 August 2024 2,921 1 View

How to calculate effect size of AMCE (Average Marginal Component Effect) in Randomized Conjoint Experiment?

I am following Hainmueller, Hopkins, and Yamamoto's (2014) paper for my randomized conjoint experimental data analysis. The link to the paper is provided below. I received a comment from the...

02 August 2024 4,406 0 View

Mananalage Prabath Gayan Vijerathna

@Mateusz Soliński. We can't say much about the significant difference between variables by just looking at two values. We have to consider some other factors. As an example, if we are considering the Z test, it depends on both mean and standard deviation (standard error). So, two central tendency values can be very close, but the test result can be shown as significant. Evan 6.05 and 6.10 can be significantly different from each other due to lower variance.

If consider the two approaches you mentioned, it has to be decided according to your situation. However, the nonparametric test is less powerful than a parametric test. Also, non-parametric tests are better, if there is no way to use parametric tests. On the other hand, for better generalization parametric tests are better. I prefer a parametric test but decide it according to your data.

Another thing is that it is better to represent the results of students' T-test with mean and standard deviation. Then your interpretation will be very easy.

Jochen Wilhelm

You have 1000 replicate measurements from each subjects. These 1000 values are correlated and they should not be analyzed as if they were independent. So your model is wrong and you should identify a more sensible model. Eventually, the test of the difference between your groups should not have more than 98 degrees of freedom (it should have less, since a sensible model will surely include some other parameters than just the tow means). Having 1000 replicate measurements seems an overkill to me if there was no other aspect that should be considered in an analysis (like a change over time, with age, something like that). If there is nothing else that should be considered, the simplest analysis is to average the 1000 values per patient and do a t-test on 2x50 (averaged) values.

If you had a sample of independent thausands of samples per group, estimation would be mor interesting than testing. You should then better interpret the 95% confidence interval of the estimate (biological relevance) rather than the (in this respect silly) fact whether it is just in the positive or in the negative range.

Ronán Michael Conroy

A significant difference is not necessarily an important difference. With a large amount of data, it can amount to what G.K. Chesterton called a "tremendous trifle" (Chesterton himself was on the large side…)

This is where hypothesis testing once again isn't up to the job. You need to measure effect size.

Incidentally, if your hypertensives included isolated systolic hypertension, then by definition they will have a wider pulse pressure.

And beware : distributions of blood pressure in hypertensives are truncated by the very diagnostic process.

Is there an actual clinical question underlying this? What is it, if so? Just out of curiosity.

Sal Mangiafico

If you have many samples per individual, you might consider a mixed-effects (or hierarchical) model to account for this stratification. ... Or, it may be reasonable to just use the average, or median, or some other summation value per subject.
Yes, what test you choose should be based on the hypothesis you wish to test.
Really, don't use a test for normality, like Shapiro-Wilks, to determine what test you use.... It can be helpful to plot the residuals from a model. (Which is simple for a t-test).

Mahdi Khorsand Ghaffari

you could also analyze systolic or diastolic peaks. maybe you find better distribution or even significant differences

David A. Jones

Here you start by saying "My null hypothesis is: that there is no difference in distributions of the measured variable between analysed groups." But it seems that you are only considering differences in location. Some form of probability plot may reveal something important. The apparent presence of outliers on only one side of the distribution is also of some concern.