Are you asking about mechanical design of such a robot or vision problem? Mechanically, simple employing inward-facing gripper wheels (say, silicone rubber) should do the trick. If you want to use chameleon-like manipulators that's more tougher.
If you're manually controlling the robot you don't really need computer vision, just a video stream.