The present paper documents the research towards the development of an efficient algorithm to compute the result from a multiple-input-single-output Neural Network using floating-point arithmetic on FPGA. The proposed algorithm focus on optimizing pipeline delays by splitting the "Multiply and accumulate" algorithm into separate steps using partial products. It is a revisit of the classical algorithm for NN computation, able to overcome the main computation bottleneck in FPGA environment. The proposed algorithm can be implemented into an architecture that fully exploits the pipeline performance of the floating-point arithmetic blocks, thus allowing a very fast computation for the neural network. The performance of the proposed architecture is presented using as target a Cyclone II FPGA Device.
An efficient architecture for floating point based MISO neural neworks on FPGA / Laudani A.; Lozito G.M.; Fulginei F.R.; Salvini A.. - ELETTRONICO. - (2014), pp. 12-17. (Intervento presentato al convegno 16th UKSim-AMSS International Conference on Computer Modelling and Simulation, UKSim 2014 tenutosi a Emmanuel College, gbr nel 2014) [10.1109/UKSim.2014.15].
An efficient architecture for floating point based MISO neural neworks on FPGA
Lozito G. M.;
2014
Abstract
The present paper documents the research towards the development of an efficient algorithm to compute the result from a multiple-input-single-output Neural Network using floating-point arithmetic on FPGA. The proposed algorithm focus on optimizing pipeline delays by splitting the "Multiply and accumulate" algorithm into separate steps using partial products. It is a revisit of the classical algorithm for NN computation, able to overcome the main computation bottleneck in FPGA environment. The proposed algorithm can be implemented into an architecture that fully exploits the pipeline performance of the floating-point arithmetic blocks, thus allowing a very fast computation for the neural network. The performance of the proposed architecture is presented using as target a Cyclone II FPGA Device.I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.