What is your plan? You want to do coding CNN afresh? If so in C/C++ for ARM or for DSPs? If not, are you planning to use open source CNN for linux on ARM?
We will start with the simplest cnn example, the mnist. We will analyze it from the hardware point of view ( FPGA). Then we will trart to implement each layer and to compare it with the expected results from the python