I have multiplex protein levels from patient plasma and would like to compare expression patterns between groups. I plan to perform the following tests: (1) hierarchical clustering of correlation coefficients to identify factors with similar expression patterns, (2) Mann-Whitney tests to identify differences in individual factors, and (3) non-metric multidimensional analysis (NMDS) followed by a permutations test to identify differences in multivariate patterns of expression. I am stuck, however, at the pre-processing step in deciding how to handle values that fell below the limit of detection of my assay. These values fall into three categories: (1) below the published LOD of the kit, (2) below the standard curve but extrapolated by the machine, and (3) reported as undetectable. In essence, I am deciding how to define the LOD of my assay given the discrepancy between published and actual results. My inclination is to use the published LOD, but this eliminates a lot of potential valid low values. Does anyone have experience with this type of analysis?
Thank you in advance for your insights and advice.