Idea is a bit abstract. A vehicle could be anything including two-wheeler. Thus, the shape varies. Complexity increases as vehicle is to be identified in a single image otherwise it was possible to track moving objects within comparatively static surroundings.
I would try with some edge detection schemes (Canny's etc.) to mark out regions followed by some initial image enhancement. I can then extract some shape/region features from various marked regions to investigate the presence of a vehicle. The system can first be trained on some prototype vehicles.