%0 Journal Article
%A FENG Zhen-Fu
%A GENG Li-Dong
%A GUO Jiang
%A PENG Xin-Long
%T Design and implementation of a multi-tile parallel scanning rasterization accelerator
%D 2024
%R 10.19682/j.cnki.1005-8885.2024.0009
%J Journal of China Universities of Posts and Telecommunications
%P 94-104
%V 31
%N 2
%X
In the design of a graphic processing unit (GPU), the processing speed of triangle rasterization is an important factor that determines the performance of the GPU. An architecture of a multi-tile parallel-scan rasterization accelerator was proposed in this paper. The accelerator uses a bounding box algorithm to improve scanning efficiency. It rasterizes multiple tiles in parallel and scans multiple lines at the same time within each tile. This highly parallel approach drastically improves the performance of rasterization. Using 65nm process standard cell library of Semiconductor Manufacturing International Corporation (SMIC), the accelerator can be synthesized to a maximum clock frequency of 220MHz. An implementation on the Genesys2 field programmable gate array (FPGA) board fully verifies the functionality of the accelerator. The implementation shows a significant improvement in rendering speed and efficiency and proves its suitability for high- performance rasterization.
%U https://jcupt.bupt.edu.cn/EN/10.19682/j.cnki.1005-8885.2024.0009