Git commit
3d6064b
Operating System & Version
windows 10 pro 22h2
GGML backends
Vulkan
Command-line arguments used
sd-cli.exe -M img_gen -p "person, formal clothes, city park" -n "ugly, bad art" --sampling-method dpm++2s_a --steps 6 -W 832 -H 1024 -b 1 --cfg-scale 1.5 -s -1 --clip-skip 2 --embd-dir F:\sdcppwebui\sd.cpp-webui\models/embeddings/ --lora-model-dir F:\sdcppwebui\sd.cpp-webui\models/loras/ -t 0 --rng cpu --sampler-rng cpu --lora-apply-mode auto -o F:\sdcppwebui\sd.cpp-webui\outputs/txt2img/20260509_184635_078417_6_steps.png --model F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors --vae F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors --type q4_K --scheduler karras --max-vram 3.8 --vae-tile-overlap 0.5 --vae-tile-size 32x32 --offload-to-cpu --vae-tiling --fa --diffusion-conv-direct --vae-conv-direct --mmap --color
Steps to reproduce
run standard gen with offload-to-cpu on
What you expected to happen
an image would generate
What actually happened
after a second the generation gets terminated
Logs / error messages / stack trace
sd-cli.exe -M img_gen -p "person, formal clothes, city park" -n "ugly, bad art" --sampling-method dpm++2s_a --steps 6 -W 832 -H 1024 -b 1 --cfg-scale 1.5 -s -1 --clip-skip 2 --embd-dir F:\sdcppwebui\sd.cpp-webui\models/embeddings/ --lora-model-dir F:\sdcppwebui\sd.cpp-webui\models/loras/ -t 0 --rng cpu --sampler-rng cpu --lora-apply-mode auto -o F:\sdcppwebui\sd.cpp-webui\outputs/txt2img/20260509_184635_078417_6_steps.png --model F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors --vae F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors --type q4_K --scheduler karras --max-vram 3.8 --vae-tile-overlap 0.5 --vae-tile-size 32x32 --offload-to-cpu --vae-tiling --fa --diffusion-conv-direct --vae-conv-direct --mmap --color
stable-diffusion.cpp:212 - loading model from 'F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors'
[INFO ] model.cpp:219 - load F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors using safetensors format
[INFO ] stable-diffusion.cpp:280 - loading vae from 'F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors'
[INFO ] model.cpp:222 - load F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors using torch zip format
[INFO ] stable-diffusion.cpp:305 - Version: SDXL
[INFO ] stable-diffusion.cpp:333 - Weight type stat: f16: 2513
[INFO ] stable-diffusion.cpp:334 - Conditioner weight type stat: f16: 585
[INFO ] stable-diffusion.cpp:335 - Diffusion model weight type stat: f16: 1680
[INFO ] stable-diffusion.cpp:336 - VAE weight type stat: f16: 248
[INFO ]
Subprocess terminated.
Additional context / environment details
I downloaded and tested exes untill I could generate with the offload-to-cpu on.
I tested with different combos of performance options and everyone with offload-to-cpu would always terminate.
without it I could generate images but would have very limited vram
[master-592-b8079e2] would generate but [master-593-3d6064b] and all release till today would not
I am on a rather old under powered amd system, ryzen 4500 and a 4gb rx480
Git commit
3d6064b
Operating System & Version
windows 10 pro 22h2
GGML backends
Vulkan
Command-line arguments used
sd-cli.exe -M img_gen -p "person, formal clothes, city park" -n "ugly, bad art" --sampling-method dpm++2s_a --steps 6 -W 832 -H 1024 -b 1 --cfg-scale 1.5 -s -1 --clip-skip 2 --embd-dir F:\sdcppwebui\sd.cpp-webui\models/embeddings/ --lora-model-dir F:\sdcppwebui\sd.cpp-webui\models/loras/ -t 0 --rng cpu --sampler-rng cpu --lora-apply-mode auto -o F:\sdcppwebui\sd.cpp-webui\outputs/txt2img/20260509_184635_078417_6_steps.png --model F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors --vae F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors --type q4_K --scheduler karras --max-vram 3.8 --vae-tile-overlap 0.5 --vae-tile-size 32x32 --offload-to-cpu --vae-tiling --fa --diffusion-conv-direct --vae-conv-direct --mmap --color
Steps to reproduce
run standard gen with offload-to-cpu on
What you expected to happen
an image would generate
What actually happened
after a second the generation gets terminated
Logs / error messages / stack trace
sd-cli.exe -M img_gen -p "person, formal clothes, city park" -n "ugly, bad art" --sampling-method dpm++2s_a --steps 6 -W 832 -H 1024 -b 1 --cfg-scale 1.5 -s -1 --clip-skip 2 --embd-dir F:\sdcppwebui\sd.cpp-webui\models/embeddings/ --lora-model-dir F:\sdcppwebui\sd.cpp-webui\models/loras/ -t 0 --rng cpu --sampler-rng cpu --lora-apply-mode auto -o F:\sdcppwebui\sd.cpp-webui\outputs/txt2img/20260509_184635_078417_6_steps.png --model F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors --vae F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors --type q4_K --scheduler karras --max-vram 3.8 --vae-tile-overlap 0.5 --vae-tile-size 32x32 --offload-to-cpu --vae-tiling --fa --diffusion-conv-direct --vae-conv-direct --mmap --color
stable-diffusion.cpp:212 - loading model from 'F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors'
[INFO ] model.cpp:219 - load F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors using safetensors format
[INFO ] stable-diffusion.cpp:280 - loading vae from 'F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors'
[INFO ] model.cpp:222 - load F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors using torch zip format
[INFO ] stable-diffusion.cpp:305 - Version: SDXL
[INFO ] stable-diffusion.cpp:333 - Weight type stat: f16: 2513
[INFO ] stable-diffusion.cpp:334 - Conditioner weight type stat: f16: 585
[INFO ] stable-diffusion.cpp:335 - Diffusion model weight type stat: f16: 1680
[INFO ] stable-diffusion.cpp:336 - VAE weight type stat: f16: 248
[INFO ]
Subprocess terminated.
Additional context / environment details
I downloaded and tested exes untill I could generate with the offload-to-cpu on.
I tested with different combos of performance options and everyone with offload-to-cpu would always terminate.
without it I could generate images but would have very limited vram
[master-592-b8079e2] would generate but [master-593-3d6064b] and all release till today would not
I am on a rather old under powered amd system, ryzen 4500 and a 4gb rx480