I have three datasets A,B and C. Each includes tens of thousands of chemical compounds. We know that A[1], the first compound in A, is similar to B[30] and both are similar to C[9999] and we call this compound NEO; A,B &C share a lot of such compounds. I wanna show A, B and C in such a way that each compound is a dot (Properties of the compound such as molecular weight can be used as 2D or 3D coordinates) and I also wanna show that they all share NEO and many more of such shared molecules and many more molecules that are only in A and not others, etc,etc.

 A venn diagram would be the simplest way. But, can I make a venn diagram of the actual data, by showing each fragment as a dot and visualize the shared ones as overlapping? 

Or a plot like the microarray having bins for compounds with certain properties and showing A,B,C with different colors (or A&C, A&B, B&C and A&B&C with colors)?

I also came across hiveplot but I'm not sure if that is a good option or not! 

In reality A,B and C are datasets of compounds in natural products, FDA approved drugs and synthetic chemicals. I wanna visualize the distribution of these compounds, in other words which compounds are shared between these datasets. 

More Misagh Naderi's questions See All
Similar questions and discussions