Hardware Implementation Study of Particle Tracking Algorithm on FPGAs
Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt
Standard
Hardware Implementation Study of Particle Tracking Algorithm on FPGAs. / Gabrielli, Alessandro; Alfonsi, Fabrizio; Annovi, Alberto; Camplani, Alessandra; Cerri, Alessandro.
I: Electronics, Bind 10, Nr. 20, 2546, 18.10.2021.Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - Hardware Implementation Study of Particle Tracking Algorithm on FPGAs
AU - Gabrielli, Alessandro
AU - Alfonsi, Fabrizio
AU - Annovi, Alberto
AU - Camplani, Alessandra
AU - Cerri, Alessandro
PY - 2021/10/18
Y1 - 2021/10/18
N2 - In recent years, the technological node used to implement FPGA devices has led to very high performance in terms of computational capacity and in some applications these can be much more efficient than CPUs or other programmable devices. The clock managers and the enormous versatility of communication technology through digital transceivers place FPGAs in a prime position for many applications. For example, from real-time medical image analysis to high energy physics particle trajectory recognition, where computation time can be crucial, the benefits of using frontier FPGA capabilities are even more relevant. This paper shows an example of FPGA hardware implementation, via a firmware design, of a complex analytical algorithm: The Hough transform. This is a mathematical spatial transformation used here to facilitate on-the-fly recognition of the trajectories of ionising particles as they pass through the so-called tracker apparatus within high-energy physics detectors. This is a general study to demonstrate that this technique is not only implementable via software-based systems, but can also be exploited using consumer hardware devices. In this context the latter are known as hardware accelerators. In this article in particular, the Xilinx UltraScale+ FPGA is investigated as it belongs to one of the frontier family devices on the market. These FPGAs make it possible to reach high-speed clock frequencies at the expense of acceptable energy consumption thanks to the 14 nm technological node used by the vendor. These devices feature a huge number of gates, high-bandwidth memories, transceivers and other high-performance electronics in a single chip, enabling the design of large, complex and scalable architectures. In particular the Xilinx Alveo U250 has been investigated. A target frequency of 250 MHz and a total latency of 30 clock periods have been achieved using only the 17 & DIVIDE; 53% of LUTs, the 8 & DIVIDE; 12% of DSPs, the 1 & DIVIDE; 3% of Block Rams and a Flip Flop occupancy range of 9 & DIVIDE; 28%.
AB - In recent years, the technological node used to implement FPGA devices has led to very high performance in terms of computational capacity and in some applications these can be much more efficient than CPUs or other programmable devices. The clock managers and the enormous versatility of communication technology through digital transceivers place FPGAs in a prime position for many applications. For example, from real-time medical image analysis to high energy physics particle trajectory recognition, where computation time can be crucial, the benefits of using frontier FPGA capabilities are even more relevant. This paper shows an example of FPGA hardware implementation, via a firmware design, of a complex analytical algorithm: The Hough transform. This is a mathematical spatial transformation used here to facilitate on-the-fly recognition of the trajectories of ionising particles as they pass through the so-called tracker apparatus within high-energy physics detectors. This is a general study to demonstrate that this technique is not only implementable via software-based systems, but can also be exploited using consumer hardware devices. In this context the latter are known as hardware accelerators. In this article in particular, the Xilinx UltraScale+ FPGA is investigated as it belongs to one of the frontier family devices on the market. These FPGAs make it possible to reach high-speed clock frequencies at the expense of acceptable energy consumption thanks to the 14 nm technological node used by the vendor. These devices feature a huge number of gates, high-bandwidth memories, transceivers and other high-performance electronics in a single chip, enabling the design of large, complex and scalable architectures. In particular the Xilinx Alveo U250 has been investigated. A target frequency of 250 MHz and a total latency of 30 clock periods have been achieved using only the 17 & DIVIDE; 53% of LUTs, the 8 & DIVIDE; 12% of DSPs, the 1 & DIVIDE; 3% of Block Rams and a Flip Flop occupancy range of 9 & DIVIDE; 28%.
KW - Hough transform
KW - particle physics
KW - hardware accelerators
KW - particle tracking algorithms
KW - FPGAs
U2 - 10.3390/electronics10202546
DO - 10.3390/electronics10202546
M3 - Journal article
VL - 10
JO - Electronics
JF - Electronics
SN - 1450-5843
IS - 20
M1 - 2546
ER -
ID: 284621733