Conversation
|
Currently, there are still some issues with the support for flux.2 klein. Padding needs to be applied during tokenization and |
|
So the |
|
It doesn’t work on my side. |
|
I think I’ve correctly fixed the attention_mask issue. |
Not sure about cfg, but they use Also they use (ref is 4b variants on hf) edit: cfg of 5 seems comparatively high for models that take larger llm embedding inputs. edit3: So cfg should be 1 for the distilled model. |
|
@leejet you talk about guidance scale, but your command only shows the cfg scale change. Or did you code the guidance scale? |
|
Oh and have you tried reference image(s) ? This is a clear advantage over eg z-image. |
guidance_scale in diffusers == --cfg-scale in sd.cpp |
Here I’m comparing the performance for T2I. Using a reference image means it’s image editing, which is a different task. Currently, z-image turbo does not support image editing. |
You are right, I did not know that.
Yes, I was asking because you did not show any examples yet. :) |
I’ve updated some examples of image editing. You can take a look. I think the overall quality of the image edits is pretty good. |
|
Hello, what i miss with default steps i get bad images, maybe i miss something? for example Z-Image is working full power . Maybe i miss something? there people already get ok images with 4 steps, but for me 4 steps is only a messy image. |
There are 2 versions. A distilled model and an undistilled model (base) which is what you are using. 4 steps will only give good results with the distilled version. |
Thank you for very useful note, i am new in AI stuff and this is good to know. I started to use Stable-Diffusion.cpp because i hate fat bloated software with passion and for me SDcpp is easier to use and is very portable, light, fast and do not depend on system paths etc. I presume this is general rule about distilled vs undistilled? Undistilled just needs a very high steps? |
This is getting a bit offtopic, so feel free to open a discussion for further questions. (or pm on tox or something). Generally there are different forms of "distillation". In this case here it was a step-distillation AND a cfg-distillation. Both reduce how often the diffusion model has to be run per image. Also generally, every model requires its own set of parameters. Some work better than others. |





Flux.2 klein 4B
Flux.2 klein 9B
Flux.2 klein 4B edit
Flux.2 klein 9B edit