I am about to make a vehicle detection model which is able to detect all types of vehicles like bikes, motorcycles, cars, buses and trucks and return the detection boxes. Unfortunately I did not find good dataset in some categories online, so I decided to make one for myself. My questions:
1- what sizes of images should I include? is it even important? (the images probably are going to be resized to 224*224*3 for model input)
2- how can I gather images that has good coverage over the vehicle type?
3- how can I use saliency maps to improve my understanding of what my dataset lacks in features?