I've sampled the influent and effluent of six wastewater treatment plants and analyzed for a large number of trace organic chemicals.  There are many instances where the commercial lab conducting the analysis reports the concentration as not-detected (ND). This occurs most often for effluent samples, but occasionally for influent samples as well. 

I am interested in determining the percent removal (i.e. [[Cx,inf - Cx,eff]/Cx,inf]*100) for each compound in each treatment plant, and then calculating measures of central tendency and dispersion for those removals across the six treatment plants. 

Data reported as ND (censored data) obviously hinder the calculation of percent removal.

As a first approach, I replaced all values reported as ND with 0, but I am wondering if there is a better method. 

The analytical lab reports all values below the Method Reporting Limit (MRL) as ND, though the raw data shows that in some cases there was no instrument response (i.e. raw value = 0.0), while in other cases the raw value is equal to some value below the Method Reporting Limit (MRL).  It seems prudent to treat these cases differently when substituting values for censored data in order to calculate percent removals. 

An obvious choice seems to be substituting 0 when there is no instrument response and substituting the MRL when there is a measured value below the MRL. 

The substitution of MRL (vs. substituting 0) tends to yield more conservative percent removal values when NDs occur in the effluent, but can inflate percent removals when NDs occur for influent samples and always yields 0 percent removal when ND occurs in both the influent and effluent for a particular compound in a particular treatment plant.

Can anyone with experience calculating percent removals from censored data offer their advice? 

Similar questions and discussions