I tried one simple 4 tap FIR filter implementing on Nexys-4 board. The designed was directly based on the block architecture of a trasposed form 4 tap FIR. Using Vivado i got the simulation correctly but the on the FPGA I am not getting the correct results. What could be the reasons?