Hello everyone!

I am currently dealing with some transcriptomic data and building protein-protein interaction (PPI) networks. After filtering my data by fold-change and p-value, I got quite a lot disconnected nodes in my PPI network. So I would like to expand my current network through a whole-genome network (as a template) in order to connect the maximum number of single nodes. The main assumption is that not all proteins being at play in a biologic response will show a change in their transcript level, and that up-regulated proteins may interact with partners (yet present, and unmodified during the biological response). My goal, thus, is to connect the maximum amount of query nodes with the minimum amount of newly added nodes.

The STRING database has an option of "adding more nodes to the current network", but it usually enriches current clusters rather than connecting single nodes (or at least it seems to me). However I don't know what strategy does STRING follow to choose nodes to be added. So, what would be the best network expansion strategy to connect single nodes?

Thanks in advanced!

More Alex Gallinat's questions See All
Similar questions and discussions