GitHub - Qcompiler/AccText2Img: Accelerate the text to image by mixed precsion computing

An easy to use pluging for accelerate the SDXL. Just add the two line code before inference:

from mylinear import MixLinear_GEMM
torch.nn.Linear = MixLinear_GEMM

For FP16

srun -N 1 --gres=gpu:4090:1 python test.py

For mixed-int8

srun -N 1 --gres=gpu:4090:1 python test.py --mix_linear

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
demo		demo
.gitignore		.gitignore
README.MD		README.MD
gemm_a8w8.py		gemm_a8w8.py
mylinear.py		mylinear.py
mylinearfp16alg.py		mylinearfp16alg.py
mylinearfp8.py		mylinearfp8.py
mylinearsm90.py		mylinearsm90.py
out.txt		out.txt
redian.cpp		redian.cpp
test.py		test.py
testimg2vd.py		testimg2vd.py
w8a8.py		w8a8.py

Provide feedback