I'm trying to convert chemical structures to ECFP data.
Buy, I have a problem about folding step.
I understand all of the process about generating ECFP data though D. Rogers and M. Hahn's paper (J. Chem. Inf. Model., Vol. 50, No. 5, 2010)
I used a pinky module in python for computing the ECFP of each molecules.
(https://github.com/ubccr/pinky/blob/master/pinky/fingerprints/ecfp.py)
the output of this function is as follow
{6456320269923861509: 1,
-3040533427843102467: 2,
-7329542376511023568: 1,
-5821485132112031149: 1,
-643847807504931861: 1,
3054809300354049582: 1,
-3679727481768249355: 1,
-2240115528993944325: 1,
5159885938473603439: 1,
1268207003089618622: 1,
267156486644197995: 1,
6401915128722912935: 1,
-8944122298402911035: 1,
-7116035920000285502: 1}
I know what it is and what it means.
but i don't know how to convert this data to binary data form.
In this website(https://docs.chemaxon.com/display/docs/extended-connectivity-fingerprint-ecfp.md), above identifiers are converted to a fixed-length bit string (folding process)
How to convert above atomic identifiers to the fixed-length bit string?