Hi,
I'd like to analyse Genetic-Variation-to-CpG correlations, i.e. methylation QTLs (meQTLs). Obviously the distribution of methylation (beta-value) of any one CpG-probe is not-normal and heavily skewed to the right or the left. On top of that the genetic data is imputed, thus formatted as "dosage" data. I wonder what the best, practical method is in terms of:
I've been working with:
The current 1000G phase 3 dataset holds 40+ million well-imputable variants and the Methylation 450K array holds about 400k+ QC-passed CpGs. Thus the comparisons in any region will be many, on average 3,000-5,000 variants vs. 100-250 CpGs. And this would only be cis-acting methylation QTLs... In other words: your insight is highly appreciated! :-)
Thanks!
Sander
http://fastqtl.sourceforge.net