I have two dataframes coming from two populations that I want to compare. Rows are the features I want to compare and columns are individuals. The values are the copy numbers (only positive integer) of the features.

I want to find all features (rows) that are significantly 'different' between the two populations.

Either higher or lower, or more 'variable'. I am having a bit of trouble with this because of how the data looks. Most rows have the same value for all columns so im not sure if there is a formal statistic test applicable.

For example:

Pop 1

feature1 4 4 4 4 4 4 4 4 4 5

feature2 3 5 3 4 3 3 2 3 3 7

Pop 2

feature1 6 6 6 5 6 6 6 6 6 6

feature2 3 3 3 3 3 3 3 3 3 4

* Feature 1 has a higher copy number in Population 2.

* Feature 2 is more 'variable' in Population 1

What are good statistical parameters/tests to do this?

More Mike de Groot's questions See All
Similar questions and discussions