ZW01a-21 Hardware Acceleration for AI Processing – ELEC/CPEG Final Year Project (2021-2022)

ZW01a-21 Hardware Acceleration for AI Processing

Post author:admin
Post published:January 14, 2022
Post category:MidPoster
Post comments:4 Comments

Subscribe

4 Comments

Inline Feedbacks

View all comments

ECE FYP

January 19, 2022 4:10 pm

Mr. Zheng,

what is the bandwidth of input buffer in your design?
this a data flow design, how to do handshaking between each sub processing block?
What do your design do when the ReLU to RAM bus is busy?

0

CHENG, Yih

January 19, 2022 4:44 pm

Reply to ECE FYP

I am not sure but it should be dependent on the block RAMs of the FPGA. In addition, the main bottleneck shall be from the DRAM side rather than the BRAM side.
For synchronizing the data between each sub processing block, in the current design, the upcoming block waits for the previous blocks to finish all calculations, write to BRAM, and then it reads them. There are obvious optimizations that can be done here and they are still being worked on.
As of now, the design waits until all data are finished writing to DRAM, then continue to the next set of data/images as pipelining isn’t implemented yet. Future plan is to pipeline these sub blocks so that the all resources are working at all clock cycles. However, a tradeoff in BRAM may be made.

0

YIP, Kam Wai

January 19, 2022 4:07 pm

How many images per second can it infer? Can it perform real-time video process?

0

CHENG, Yih

January 19, 2022 4:17 pm

Reply to YIP, Kam Wai

Hello, I haven’t tested out how much images per second it can infer as I am still working on some optimization within the hardware design. As of real-time video processing, I am not sure if it is possible or not but I may try it out. Currently the resource allocations are based on 20 images of Cifar-10 dataset. Thank you!!

0