I have designed a serial input based matrix multiplier architecture on FPGA. For smaller matrix size[ like 4x4,8x8] the resource utilization is very minimal. But when the matrix size goes beyond 16x16, the resource utilization exceeds way beyond hundred percentage. What I don't understand is, being the architecture stay fixed, how come the resource utilization increases with respect to input size.
Thanking you