Formally, a computer program should be able to scan such an image, perform image processing or any specific treatment on it and produce the followings.
1) The info about multiple geometrical shapes stacked together in front/side and top views in the image.
2) The correlation between the two views as some shapes (or a part of a shape) are hidden in one view but their projections are seen in the other view.
3) The relationship among shapes such as the orientation of a shape w.r.t. to each other.
4) The info about dimensions normally written in text besides arrows.
5) The info about arrows, single-headed, double-headed, straight, slanted, etc.