Generally, we prefer images with equal H and W. Many works always reshape the rectangular image firstly.
I wonder that does the rectangular image as the input (kernel size sets the same value for convience) affect the performance of the CNN-based models.