Are these softwares all free to download/use? Also, how does one start doing this? do i just need the WebUi or do i need extra files to feed into it and stuff?
Yea the webui and stable diffusion models/checkpoints are generally free to use.
It's all free, yes.
For the how: Aside from the webui (or whatever else you're running the stable diffusion code on), you'll need models that instruct it what to create. The builtin models are terrible for porn.
I just figured it out tonight playing around with the links and readmes available above. If you get stuck I can try to answer more specific questions.
Hmm ill probably have mroe questions but for now im curious:
Thanks for your help! :)
Sure,
1: The initial download was pretty small ~10GB. But with the models, lora(s), extensions, ect. I'm up to ~60GB.
2: The guide at the top was pretty easy to follow. Install the dependencies, then install the UI. Launch and run. There is a bit of a learning curve with all of the options but so far it hasn't been too confusing.
3: That's where the extra models/lora(s) come in. Various models are trained in different styles, poses, actions, ect. The lora files are smaller things, like poses. IE: Cowgirl is it's own lora file that tells the model how to use the prompts you give.
how much space was the download(s)?
On my end, it's sitting at ~64GB (with btrfs compression shenanigans), though 60 of those are from all the models I have installed. The download would probably be ~2GB, even less if you disable downloading the "default" models with --no-download-sd-model
and instead pick models off of Civit or wherever manually.
Edit: Should have mentioned. Most full models are between 2-4 GBs each. Some can be 5+ but they tend to be "full" versions intended for merging & such. LoRAs are generally smaller. Depending on how much they're pruned they'll be anywhere between 10-100 MBs each.
how confusing is the software to use?
There's definitely a learning curve, yes. But there's plenty of resources (and more importantly, examples) out there.
what kind of limitations does the software have? can i do multiple people? monsters? futa? etc.
As long as you have the correct models set up it can generate basically anything. At least with anime models, monsters and futa are a given. Your main issue will probably be multiple people, although there are solutions to that. (See the multidiffusion upscaler GitHub repo on the main post)
Thanks! this is very helpful :). Also, and maaybe more importantly, idk how to download things off of github. I 'Go To File' and theres no download button. I'm sure im missing something simple lol
After extracting the WebUI files and doing the git clone, ive tried to double click webui-user.bat but the terminal that opens up says "python not found" but i've got python downloaded..
Not on Windows right now so I can't confirm, but you probably forgot to pick "Add Python to PATH" or whatever the option is in the installer. Try running the Python installer again, maybe it'll let you add it without needing to uninstall & reinstall
Edit: If you're on Nvidia, there seems to be a simpler install method now: https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Install-and-Run-on-NVidia-GPUs (Method 1)
Hmm i uninstalled and reinstaqlled and i'm not seeing an option for "Add Python to PATH" Could there be an alternate name?
It's probably "Add python to environment variables." They must have changed the wording on that at some point.
I recommend following a setup tutorial on YouTube. They usually link to models and stuff youll need. Look for a recent one, this stuff evolves a lot.
Is it just me or do all the AI generated porn images have something in common that makes them immediately recognizable? I am not quite sure what exactly it is but they do all have some quality in common.
Its the uncanny valley effect I think. Your brain recognizes that something is wrong in the image.
That might be part of it but I think there is also a sort of sameness, certain aspects that are always the same in the generated images and more varied in real ones. Possibly related to the averaging of all the input data.
It would be amazing to have a list of remote web hosts that just take a prompt and spit out an image. Some I know:
I remember a quick and uncomplicated site that gave access to some models and took a prompt. It wasn't used too much and pretty fast if you selected a non default model. Sadly, I don't remember the name but I miss it. It didn't have a whole lot of bloated J's framework in the frontend like the ones mentioned above.
I can also recommend https://undressclothes.ai, as it is quite new, has free creds and works fast.
Forgot to add, when experimenting with prompts. Use a fixed seed number so you can see how your prompt changes effect the image between each generation
This is a great guide and was really helpful when I decided to experiment to see how this works.
A couple of things that confused me when trying this out that might save the next person some time:
where to put models etc *.ckpt and *.safetensors files live in stanle-diffusion-webui/models/stable-diffudion These will automatically be loaded when you new start wrbui-usr.bat
how to change models This took me waaay longer to figure out than I'd like to admit. There's a drop down top left of the webui to select the model after you restart
I find the range of models, loras, checkpoints, extensions etc overwhelming. Im still not sure exactly what each of these do and which ones I'd need. Eg: Whats a checkpoint for?
prompt writing is clearly a fine art and can drive you mad. For both 3&4 I found civitai.com/images to be a fantastic resource. Browse through the images for styles or images you like and most of them will have the resources used and generation data there to recreate it. I found this to be a great starting point, particularly for negative prompts.
deformity Deformed faces have mostly gone away for me by changing this webui setting: settings> face restoration > code former weight = 0 Just need to figure out hands and phantom limbs now...
Did anyone manage to find out what model sites like anydream.xyz might be using? I have hard time emulating the style..
Interesting model if you are into titfuck: https://civitai.com/models/7701/realistic-titfuck?modelVersionId=9042
So what settings do y'all use when playing around with prompts, before you generate a few really nice HQ ones?
I use 600x800 (or 800x600) and 50 steps when experimenting with prompts. Then when i get one i like, I lock the seed, maybe do a few .01-.03 variations. Then lock the variation seed and turn on 2x HighRes Fix. That outputs a 1200x1600 that very closely looks like what I expected. (I'm using a 3090 with 24GB vram)
In my experiments, i found that doing quick and smaller tests with a low number of steps, then increasing it for the highres would change the output too much. I settled on this so that theres less options to toggle back and forth.
I tried to install sd webui, Installed python, git. When I run the run.bat Error keeps on coming AttributeError: module 'OS' has no attribute 'statvfs'
Tried re installation of python with environment variable and max limit disable, still run into same attribute error. I am on windows 11, with nvidia Any pointers on what I am doing wrong
I highly recommend using Forge UI. You get the speed of ComfyUi with the simplicity of Automatic1111 and some cool extra features.
I recently came across Krita's AI plugin. It's pretty easy to install (especially if you have nVidia or ComfyUI already set up). And lets you easily fine tune your images.
Anyone else use the OpenPose control net? I'm a little confused on the head portion and which control node is doing what. Every thing else about the editor makes sense and is intuitive. Do the outer most nodes on the head affect head tilt and rotation? And are the inner nodes eye placement? this seems to be the case but I'd rather not end up confused and mad when I'm inevitably wrong and their heads are backwards and eldritch abominations. lol
I've been meaning to figure this out. So the commonly used ones are labeled WebUI, but just how much of the content goes to the web itself?
If I wanted to train on images that I don't want going online, will they? Or will the products that I create end up online, or does all this stay local?
By Web UI it means that the graphical part of it -- where you write your prompt and hit generate -- is running inside your browser and not as a separate window or command line. Everything is kept on your own computer unless you explicitly tell it to open up remote access.
It was probably a dumb question, but I wanted to clarify just the same, thanks. When ya get these kinds of pics under strict confidence, you don't wanna risk breaking that in any way.
It was probably a dumb question
No such thing as a dumb question. but yes the WebUI only exists so you dont have to type into a command prompt. It looks nice and pretty for humans, and then translates all those settings into a command that is then sent to stable diffusion. All of it stays 100% local, unless you go into the WebUI settings and tinker with remote access. But it is off by default
This is a question which bothers me about any AI image generation, but here people are focused on doing humans so I feel it's a good place to ask as with humans it hits harder:
Let's assume I download a recommended setup and I'm a total noob sitting down to generate from a prompt. How many misshapen, badly generated disturbing chaos mutants from beyond reality do I need to see before the AIs return a somewhat satisfying result? Is an "unsuccessful image" just a person with a blurred face or somewhat off fingers, or are we talking about full-blown body horror?
Since no one has answered yet I'll chime in with my experience using the Stablediffusion thing with all the 111's in it a few months ago.
It was super easy to use. I would set a prompt like "25 year old woman nude at the beach. Blonde hair blue eyes, thin small perky breasts, shaved pubic area" and run usually 10 batches of 5 images each. It would take 5-10 minutes usually, I have a 1080 TI, and I'd probably get 5-10 ""good" images and the rest would be trash. Some would have weird extra arms or be snake people with weird torsos. The most common issue I would have would be nipple and vulva placement, it can be weird sometimes. Not uncommon to have extra or no nipples, or extra breasts. Lots of barbie type pubic areas would show up.
I think part of my issue might have been the HD upscaling I was using, as I would see a quick glance of the initial render but by the time it upscaled it went funky.
I honestly just chocked it up to my lack of technical knowledge and possibly bad prompt writing.
However I do feel it was incredibly easy to generate at least some decent stuff for someone that has no coding experience or anything.
Thanks! The thing which bothers me isn't difficulty, rather how much of the painful distorted surrealism do I need to experience before getting something decent. You say you got 5-10 good images out of 50, but are the rest utter body horror or just odd limb or (as you write) awkward details? I can add I do freak out at the sight of the famous AI fingers if they are very bad tangled spaghetti of flesh.
Is anyone here using SD with Blender or for stuff other than NSFW too? Looking to see if a laptop embedded RTX 4050 or 4060 is viable? What is the real limit here? Like can anyone tell me something like "8gbv just can't do (XYZ)" or the iteration time becomes so long the workflow is not practical/sw crashes. Anyone running Linux on a 4050/4060? What kind of impact is there with storage speeds, what size of DDRx is used in practice?
For an excellent guide from beginning to doing advanced check out Olivio Sarikas on youtube. His has videos on controlnet, upscaling, training etc.