I wanna study the relationship between images and texts. But the core research objective is texts. So how do I select a suitable theoretical framework? Is it enough to select visual grammar? Do I need to select system functional grammar? Please give me some advice. I will appreciate it. Thanks.