I am planning to do a topomer comfa analysis of series with similar scaffolds of around 80 compounds which have been tested against cancer cells. The activity range is around 3 log units (nM to micromolar) with a fairly even spread. I have a couple of questions about this: a) can comfa analysis be done on whole cell data instead of isolated enzymes? b) some of the data have EC50 values reported as > some value instead of a definite number. How should these values be treated?
Discarding these compounds reduces the number of compounds available and the activity range? I was thinking of entering the biological activity values as the max value (for eg. >10 micromolar to be entered as 10 micromolar). Are there better approaches?
I did not want to do a comfa analysis since we don't know about the biologically active pose of these compounds. Can docking poses be considered to be the bioactive conformation for the purpose of comfa studies?