参考文献/References:
[1] 李全.面向ARM嵌入式平台的卷积神经网络前向加速研究[D].武汉: 华中科技大学,2019.
[2] 陈朋,陈庆清,王海霞,等.基于改进动态配置的FPGA卷积神经网络加速器的优化方法[J].高技术通讯,2020,30(3):240-247.DOI:10.3772/j.issn.1002-0470.2020.03.004.
[3] NIKOLIC G S,DIMITRIJEVIC B R,NIKOLIC T R,et al.A survey of three types of processing units: CPU, GPU and TPU[C]//57th Iinternational Scientific Conference on Information, Communication and Energy Systems and Technologies.Ohrid: IEEE Press,2022:1-6.DOI:10.1109/ICEST55168.2022.9828625.
[4] 王江波.基于ZYNQ嵌入式平台的CNN图像识别加速器研究与实现[D].沈阳: 中国科学院大学(中国科学院沈阳计算技术研究所),2022.
[5] MA Yufei,CAO Yu,VRUDHULA S,et al.Optimizing loop operation and dataflow in FPGA acceleration of deep convolutional neural networks[C]//Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays.New York: Association for Computing Machinery,2017:45-54.DOI:10.1145/3020078.3021736.
[6] 帅禄玮,张柳欣,叶蕾,等.基于低误差并行计算加速的OFDR实时处理技术[J].中国激光,2024,51(14):233-242.DOI:10.3788/CJL231526.
[7] 高树静,王程龙,董廷坤.基于ZYNQ的优化Adaboost人脸检测[J].计算机工程与应用,2020,56(6):201-206.DOI:10.3778/j.issn.1002-8331.1812-0228.
[8] 嵇达龙,张尤赛,王亚军.基于ZYNQ的行人检测系统的设计与实现[J].计算机工程与设计,2020,41(1):238-245.DOI:10.16208/j.issn1000-7024.2020.01.039.
[9] LU Liqiang,XIE Jiaming,HUANG Ruirui,et al.An efficient hardware accelerator for sparse convolutional neural networks on FPGAs[C]//27th Annual International Symposium on Field-Programmable Custom Computing Machines.San Diego: IEEE Press,2019:17-25.DOI:10.1109/FCCM.2019.00013.
[10] BAI Lin,ZHAO Yiming,HUANG Xinming.A CNN accelerator on FPGA using depthwise separable convolution[J].IEEE Transactions on Circuits and Systems Ⅱ: Express Briefs,2018,65(10):1415-1419.DOI:10.1109/TCSII.2018.2865896.
[11] MILLóN R,FRATI E,RUCCI E.A comparative study between HLS and HDL on SoC for image processing applications[EB/OL].(2020-12-15)[2024-08-20] .https://doi.org/10.48550/arxiv.2012.08320.
[12] GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Columbus: IEEE Press,2014:580-587.DOI:10.1109/CVPR.2014.81.
[13] GIRSHICK R.Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision.Santiago:IEEE Press,2015:1440-1448.DOI:10.1109/ICCV.2015.169.
[14] REN Shaoqing,HE Kaiming,GIRSHICK R,et al.Faster R-CNN: Towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2016,39(6):1137-1149.DOI:10.1109/TPAMI.2016.2577031.
[15] REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas: IEEE Press,2016:779-788.DOI: 10.1109/CVPR.2016.91.
[16] CONG J,XIAO Bingjun.Minimizing computation in convolutional neural networks[C]//International Conference on Artificial Neural Networks.Cham: Springer International Publishing,2014:281-290.DOI:10.1007/978-3-319-11179-7_36.
[17] CIJOV A.Self-driving cars[EB/OL].(2021-12-08)[2024-08-20] .https://www.kaggle.com/datasets/alincijov/self-driving-cars.
[18] CHOI K,SOBELMAN G E.An efficient CNN accelerator for low-cost edge systems[J].ACM Transactions on Embedded Computing Systems,2022,21(4):1-20.DOI:10.1145/3539224.
[19] MAZZIA V,KHALIQ A,SALVETTI F,et al.Real-time apple detection system using embedded systems with hardware accelerators: An edge AI application[J].IEEE Access,2020,8:9102-9114.DOI:10.1109/ACCESS.2020.2964608.
[20] 戴振宇.基于ZYNQ的卷积神经网络加速设计与实现[D].呼和浩特: 内蒙古大学,2021.
[21] 李景阳.基于Zynq的热成像人体目标识别算法研究及硬件加速[D].成都: 电子科技大学,2023.
[22] YU Hao,LI Sizhao.A higher performance accelerator for resource-limited FPGA to deploy deeper object detection networks[C]//16th International Conference on Anti-Counterfeiting, Security, and Identification.Xiamen: IEEE Press,2022:1-5.DOI:10.1109/ASID56930.2022.9995953.
[23] LI Peng,CHE Cheng.Mapping YOLOv4-tiny on FPGA-based DNN accelerator by using dynamic fixed-point method[C]//12th International Symposium on Parallel Architectures, Algorithms and Programming.Xi’an: IEEE Press,2021:125-129.DOI:10.1109/PAAP54281.2021.9720468.
[24] XU Shanyong,ZHOU Yujie,HUANG Yourui,et al.YOLOv4-tiny-based coalgangue image recognition and FPGA implementation[J].Micromachines,2022,13(11):1983.DOI:10.3390/mi13111983.