Hi all,
I want to write a Python script that will help me select upstream and downstream proteins sequences.
I have a blastp results file (.txt) that contains protein IDs in the first column.
On the other hand, I have a file containing all protein sequences (ID+sequences)
I want to "extract" protein sequences from the blastp results but also extract the proteins located before and after ("guilt by association" approach")
I am releatively new to Python so I just wanted advices on how to proceed? If you have any suggestions, please let me know.
Thank you so much in advance for your help!