18 July 2021 7 5K Report

I've been working on some ML models for quite some time and recently had my curiosities about CNN's, I was wondering if there were ways to feed strings as inputs for a CNN model?

I worked in image classification with CNN, however, I couldn't find a straightforward way for utilizing strings directly or at least converting them to an image/graph intermediate, obviously taking a SS of the string and using it as input would be a silly approach.

The datasets I'm using are of chemical compounds (represented as strings of SMILES).

I would appreciate it if anyone could give some practical advice on how I could use these SMILES strings in a CNN or at least convert them into some form of an image/graph.

Thanks in advance :)

PS: I could also work with something like using X= [a1, a2, a3 ... an] as the input variable (where a's are different descriptors), which is what I did/do with ML models.

More Naeem A.'s questions See All
Similar questions and discussions