Block Floating Point (BFP) and DSConv with GPU support
This repo implements BFP and DSConv in cuda kernels to be used with PyTorch
- Python >= 3.6
- PyTorch >= 1.0
- CUB == 1.8
- Download CUB and put it in
/home/your_username/libs/(or the file indicated atNUQ/BlackBox/Quantization/src/setup.py:22) cd /path/to/NUQ/BlackBox/Quantization/src/thenpython build_ext --inplace.