The original CUDA/Win version has some patches. Let's analyze and determine if we need to port them into our fork:
main...fspecii:HeartMuLa-Studio:main
Lets make sure the changes are NOT causing issue and breaking our code, since we applied so many changes for MPS support!
Do not take risks, we'd rather not merge the changes and keep our app stable, possibly reimplementing them for our architecture.