I am studying fungal whole genome sequence specifically (basidiomycetes) secondary metabolite biosynthesis gene clusters. Antismash and SMURF have identified several terpenes, PKS, NRPS clusters but did not provide any putative end products from such clusters. But NapDos specified some end products and one of them (epothilone) was confirmed through analytical studies. When the biosynthesis cluster of epothilone was compared using local blast, it showed almost a 2kb similarity out of the 80 kb cluster for epothilone. However, the 2kb is not a continuous sequence (an average of 50 bp), sparsed among all the PKS clusters identified by ANtismash and SMURF. It may be noted that the biosynthesis gene cluster of epothilone till date is only confined within the bacterial kingdom.
Is sparsed sequence similarity relevant? Am I on the right track in searching sequence similarity among different kingdoms? It may be mentioned that the problem is similar for several other clusters. A valuable suggestion and comments will be highly appreciated.