01 January 2018 0 8K Report

My raw data is consist of rs IDs of the SNPs for a number of individuals. e.g. one column (G-G, G-G, A-G, A-A, ...). I want to convert it into numerical data. e.g. (0, 1, 2). 0 for the homogeneous allele, 1 for the heterogenous allele and 2 for the minor allele. Also, I want to know what could I do with missing data? Is there any code that could make this task?

Similar questions and discussions