Hi all,

I want to write a Python script that will help me select upstream and downstream proteins sequences.

I have a blastp results file (.txt) that contains protein IDs in the first column.

On the other hand, I have a file containing all protein sequences (ID+sequences)

I want to "extract" protein sequences from the blastp results but also extract the proteins located before and after ("guilt by association" approach")

I am releatively new to Python so I just wanted advices on how to proceed? If you have any suggestions, please let me know.

Thank you so much in advance for your help!

More Audrey Durand's questions See All
Similar questions and discussions