Low Power HW Accelerator For FP16 Matrix Multiplications For Tight Integration Within RISC-V Cores | Yvan Tortorella, Luca Bertaccini, Davide Rossi, Luca Benini, Francesco Conti

By RISC-V Community NewsAugust 3, 2022August 30th, 2022No Comments

This new technical paper titled “RedMulE: A Compact FP16 Matrix-Multiplication Accelerator for Adaptive Deep Learning on RISC-V-Based Ultra-Low-Power SoCs” was published by researchers at University of Bologna and ETH Zurich.

According to their abstract:
“One of the key stumbling stones is the need for parallel floating-point operations, which are considered unaffordable on sub-100 mW extreme-edge SoCs. We tackle this problem with RedMulE (Reduced-precision matrix Multiplication Engine), a parametric low-power hardware accelerator for FP16 matrix multiplications – the main kernel of DL training and inference – conceived for tight integration within a cluster of tiny RISC-V cores based on the PULP (Parallel Ultra-Low-Power) architecture.”

Find the technical paper here. Published April 2022.

Previous PostMilandr MDR32F02FI is a RISC-V microcontroller for (Russian) electricity meters | Jean-Luc Aufranc, CNX Software
Next PostFirst open-source SystemVerilog RISC-V processor functional coverage library | Nick Flaherty, EE News Europe

Stay Connected With RISC-V

We send occasional news about RISC-V technical progress, news, and events.

Low Power HW Accelerator For FP16 Matrix Multiplications For Tight Integration Within RISC-V Cores | Yvan Tortorella, Luca Bertaccini, Davide Rossi, Luca Benini, Francesco Conti

Previous PostMilandr MDR32F02FI is a RISC-V microcontroller for (Russian) electricity meters | Jean-Luc Aufranc, CNX Software

Next PostFirst open-source SystemVerilog RISC-V processor functional coverage library | Nick Flaherty, EE News Europe

Stay Connected With RISC-V