I was given a maize SNP dataset in the HapMap format and I was curious how I can infer the genotype given this particular format (see picture below). I understand that for HapMAp data the second column contains the allele information (i.e. all possible alleles at this locus) and that columns 12 and onward contain the appropriate allele for the sequenced individual. However, I'm unsure of how to derive the genotype of a diploid when only one allele is given. I realise there is a function in R for reading raw HapMap data (see http://svitsrv25.epfl.ch/R-doc/library/snpMatrix/html/read.HapMap.data.html) but I'm not even sure it's still operational. Any information would be a help. Much thanks! 

More James R Adams's questions See All
Similar questions and discussions