I'm trying to convert chemical structures to ECFP data.

Buy, I have a problem about folding step.

I understand all of the process about generating ECFP data though D. Rogers and M. Hahn's paper (J. Chem. Inf. Model., Vol. 50, No. 5, 2010)

I used a pinky module in python for computing the ECFP of each molecules.

(https://github.com/ubccr/pinky/blob/master/pinky/fingerprints/ecfp.py)

the output of this function is as follow

{6456320269923861509: 1,

-3040533427843102467: 2,

-7329542376511023568: 1,

-5821485132112031149: 1,

-643847807504931861: 1,

3054809300354049582: 1,

-3679727481768249355: 1,

-2240115528993944325: 1,

5159885938473603439: 1,

1268207003089618622: 1,

267156486644197995: 1,

6401915128722912935: 1,

-8944122298402911035: 1,

-7116035920000285502: 1}

I know what it is and what it means.

but i don't know how to convert this data to binary data form.

In this website(https://docs.chemaxon.com/display/docs/extended-connectivity-fingerprint-ecfp.md), above identifiers are converted to a fixed-length bit string (folding process)

How to convert above atomic identifiers to the fixed-length bit string?

More Hyunwoo Yook's questions See All
Similar questions and discussions