We have a proteomic dataset where many proteins have been identified in samples derived from two classes of samples with three biological replicates each. Each of these samples was analysed (injected into an LCMS instrument) twice, generating a technical duplicate (this means, that we have 12 LCMS results in total, six sets of duplicates). We have used a technique (SWATH) where we have essentially no missing values for any of the proteins.

When playing with the data (e.g. making a x-y graph where two different replicates are compared to each other), it is clear that (as expected) technical noise is much lower than biological noise (differences in protein abundances between different sample sources).

Can anybody advise me how this observation could be quantified? As we only have duplicates for each sample, we cannot work with standard errors when trying to capture technical variability. If we just take the difference of a value between technical duplicates, we neglect that we have duplicates for EACH sample. What does the average of the six differences tell us and how can this be related to the average of three means in each class?

Similar questions and discussions