Hi all.

I have a chromosome segment of roughly 4 Mb that almost does not recombine (barely 1cM overall) but the genetic map was built using another population. The DNA sequence I have from the genome assembly consist of 19 scaffolds (between 3000bp and 1.6 million bp) that are mostly unordered (because of no recombination in the population used for the genetic map).

I have SNPS from RADsequencing of a single population (55 individuals).

I am trying to order somehow the scaffold in that region using a LD approach: I selected the 19 scaffolds and used the algorithm from Zaykin (program MCLD) to calculate r2 values for each SNP within and between scaffold.

My question is: is there a way to summarize the r2 of all the loci within a scaffold in order to get a small number of value per scaffold and use it to do a linear organization of the scaffolds?

For now, I want to suppose that the assembly is good, so I do not want to break the scaffolds. I cannot use any mapping software since it is not a cross, but I still have useful information in there.

I already thought of a Minimum Spanning Tree approach, but the problem resides in the translation of the cluster matrix to one providing a linear organization (i.e. the scaffolds found to be in one group are still not ordered...)

Anyone for an opinion?

Similar questions and discussions