based on poisson distribution, probability of each nucleotide will be sequenced is P(Y=y)= (Cy)(e-c)/y!
y=number of times base is sequenced
C= coverage =NL/G = (#reads)(length of read)/genome size
if y=0 it mean that from the equation, I will get probability of no nucleotide sequenced because of 0 time each base sequenced
make probability into percentage is just multiply by 100, Now I got %nucleotide base that is NOT sequenced
I understood that this percentage multiply by genome size equal to total number of nucleotide that is NOT sequenced (total gap length) but I cannot imagine how
"the percentage multiply by number of reads equal to number of gap" WHY
there are more explanation about my question here
http://www.illumina.com/documents/products/technotes/technote_coverage_calculation.pdf
go to page 2, last paragraph
thank you in advance
Chooseel