Segmentation controlnet with an image of the cover template with different parts being different colors as the input. The segmentation controlnet makes Stablediffusion treats different colors as different things, so it keeps each part distinct. If you use colors that aren't in the actual color reference chart, it just sorta improvises, so I used black and white. On one I used the color that corresponds to painting,picture for the area inside the "frame" where the actual art is. Since I was doing it all in one generation with one prompt I just sorta had to accept what it gave me for the colors and textures of the text and borders.
I had fun trying to do this "in camera" without any editing after generation. I ended up using segmentation but after playing around with a lot of different options for what colors to use (trying stone, book, window, painting), I gave up and just made it black and white and still used segmentation.
It definitely depends on the model. But if you want better results you're going to want to use adetailer.
I threw your prompt into dreamshaper 6.31 (because it's the latest version I had on my server). Here's with no adetailer:
Here's with adetailer but using the exact same prompt as the main image (leaving both fields blank but turning adetailer on):
Here's with adetailer but putting only the expression part in the positive prompt. Negative prompt still the same as the main image.
(confused concerned upset questioning (funny fun amused (cringe (raised eyebrow) shy blush ashamed bruh face), <lora:add_detail:1.2>
Now, probably a good idea to have redhead woman in there or something, but my point is how rapidly you can improve things with very minimal effort. If you're willing to mess around with adetailer prompts and denoising strength, you can do a lot with faces.
Because other people care. As soon as you figure out a way to make sure everyone DOESN'T use gender as a proxy for sex, and sex as a way to decide what someone can or can't do, and how to treat them, then gender identity becomes nonsensical. But until then, it's a tool to navigate a world where people base an incredible amount of their concept of you on their perception of your gender.
These are very large (3000x2000), so here's an album link: https://imgur.com/a/LOZcR39
Love at first sight.
Obligatory gas masks and fire.
What the fuck does anything on this vehicle do?
Are you using any hand-specific negative embeddings? adetailer also can do a decent job of fixing hands with the hand segmentation.
Reminds me of the bastion cinematic. To the point where I'm guessing either a still from that, or fanart of that cinematic was in the training set.
I just noticed it doesn't put the literal prompt in the png info. The "dagger (weapon)" needs to be entered as "dagger \(weapon\)" so that the parenthesis around weapon get sent to CLIP
@wgty
@lemmy.world