Used Verilog to design an entire NPU that supports two different kinds of dataflows to enable data reuse in order to reduce power consumption and resource utilization
-
Updated
Jan 22, 2025 - Verilog
Used Verilog to design an entire NPU that supports two different kinds of dataflows to enable data reuse in order to reduce power consumption and resource utilization
Ultra-low-latency Ternary Neural Network inference engine for HFT alpha generation, optimized for an ultra-budget 1,120-LUT Renesas FPGA. Features a fully unrolled combinatorial popcount tree achieving 2-cycle deterministic execution without hardware multipliers, paired with an ESP32-S3 hardware-software co-design.
Add a description, image, and links to the neural-network-hardware topic page so that developers can more easily learn about it.
To associate your repository with the neural-network-hardware topic, visit your repo's landing page and select "manage topics."