Conversation

Replying to
Stable Diffusion is a great generalist model, but getting a certain style of output is pretty tricky, it usual needs some serious "prompt engineering" (which I am rubbish at). Fine tuning the model itself is an easy approach to focus on just what you want, if you have some data.
1
16
Once you have a fine tuned model, it can't help but generate Pokemon not matter the prompt you give it. So no more painstaking prompting required: "robotic cat with wings"
Image
2
37
This was the most naive approach to fine tuning, and it still worked really well. (Using the EMA weights is important btw!) I'm really excited about the possibilities of specialising this model to new areas in more sophisticated ways!
1
16
P.S. You might want to disable the safety checker, it seems to get very over-excited by the Pokémon style for some reason. And I've never managed to get an actually nsfw image out of it.
1
7
Replying to
The original .ckpt is massive because it has all the weight (ema and not) and the optimiser state in too. The hf diffusers version is a lot slimmer as it's just the ema model.
1
6
Show replies