Thank you for adding Flux2 Klein support. However at least for the 9B Version Layer Offloading seems broken. My PC has a 16 GB Nvidia GPU and 78 GB of RAM.
I got Transformer and Text encoder Quantizations at 3 Bit, Layer offloading for the transformer on 100% and Cache Text embeddings enabled. But I still get a OOM during transformer loading and also the RAM (not VRAM) is not getting fuller when loading the transformer.
Hope you can fix this issue.
Thank you for adding Flux2 Klein support. However at least for the 9B Version Layer Offloading seems broken. My PC has a 16 GB Nvidia GPU and 78 GB of RAM.
I got Transformer and Text encoder Quantizations at 3 Bit, Layer offloading for the transformer on 100% and Cache Text embeddings enabled. But I still get a OOM during transformer loading and also the RAM (not VRAM) is not getting fuller when loading the transformer.
Hope you can fix this issue.