Hello, everyone

I want to discuss with you about Hypothesis Testing.

Briefly speaking,

about 350 thousand people have bad liver(bad AST, ALT) so they got medical test for Hepatitis C

and only 38 people among them really have virus of Hepatitis C

and other 850 thousand people didn't get a medical test for Hepatitis C because they have normal AST, ALT

(So they don't know if they are patients or not for Hepatitis C)

In this case, we want to do test between two groups

group C : All people who have virus of Hepatitis C

group D : All people who don't have virus of Hepatitis C

We want to test if there is a significant difference between the mean of BMI of group C and the mean of BMI of group D.

(Also we want to test similarly if there is a significant difference between the mean of weights of group C and the mean of weights of group D

and so on)

Two serious things in this situation are

We don't know, for some of them, if they belong to group C or group D.

And, there is extreme imbalance between the number of group C and the number of group D (group C is too small)

In this case, I want to discuss with you

What is the best strategy or test for this situation

Thank all of you

More Kyoungmun Chang's questions See All
Similar questions and discussions