based on poisson distribution, probability of each nucleotide will be sequenced is P(Y=y)= (Cy)(e-c)/y!

y=number of times base is sequenced

C= coverage =NL/G = (#reads)(length of read)/genome size

if y=0 it mean that from the equation, I will get probability of no nucleotide sequenced because of 0 time each base sequenced

make probability into percentage is just multiply by 100, Now I got %nucleotide base that is NOT sequenced

I understood that this percentage multiply by genome size equal to total number of nucleotide that is NOT sequenced (total gap length) but I cannot imagine how

"the percentage multiply by number of reads equal to number of gap" WHY

there are more explanation about my question here 

http://www.illumina.com/documents/products/technotes/technote_coverage_calculation.pdf

go to page 2, last paragraph

thank you in advance 

Chooseel

More Chooseel Bunsuwansakul's questions See All
Similar questions and discussions