Today we have run our first full implementation of LSTM neural network on Xilinx Zynq MPSoC ZCU102 platform!
There are 8 hardware accelerators to help the ARM performing successive stages of the network evaluation.
We have achieved over 20x acceleration comparing to pure-software implementation.
The project has been entirely developed in SDSoC environment.