zydxt/sd-webui-rpg-diffusionmaster: RPG-DiffusionMaster Extension for A1111
Open link in next tab
GitHub - zydxt/sd-webui-rpg-diffusionmaster: Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
https://github.com/zydxt/sd-webui-rpg-diffusionmaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG) - GitHub - zydxt/sd-webui-rpg-diffusionmaster: Mastering Text-to-Image Diffusion: Recaptioning, ...
An implementation of: Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
You need GPT4-Azure or Gemini Pro to use it. Local LLMs support is still being worked on.