Skip to content

[Bug] offload-to-cpu casues the process to terminate #1483

@dm3rd

Description

@dm3rd

Git commit

3d6064b

Operating System & Version

windows 10 pro 22h2

GGML backends

Vulkan

Command-line arguments used

sd-cli.exe -M img_gen -p "person, formal clothes, city park" -n "ugly, bad art" --sampling-method dpm++2s_a --steps 6 -W 832 -H 1024 -b 1 --cfg-scale 1.5 -s -1 --clip-skip 2 --embd-dir F:\sdcppwebui\sd.cpp-webui\models/embeddings/ --lora-model-dir F:\sdcppwebui\sd.cpp-webui\models/loras/ -t 0 --rng cpu --sampler-rng cpu --lora-apply-mode auto -o F:\sdcppwebui\sd.cpp-webui\outputs/txt2img/20260509_184635_078417_6_steps.png --model F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors --vae F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors --type q4_K --scheduler karras --max-vram 3.8 --vae-tile-overlap 0.5 --vae-tile-size 32x32 --offload-to-cpu --vae-tiling --fa --diffusion-conv-direct --vae-conv-direct --mmap --color

Steps to reproduce

run standard gen with offload-to-cpu on

What you expected to happen

an image would generate

What actually happened

after a second the generation gets terminated

Logs / error messages / stack trace

sd-cli.exe -M img_gen -p "person, formal clothes, city park" -n "ugly, bad art" --sampling-method dpm++2s_a --steps 6 -W 832 -H 1024 -b 1 --cfg-scale 1.5 -s -1 --clip-skip 2 --embd-dir F:\sdcppwebui\sd.cpp-webui\models/embeddings/ --lora-model-dir F:\sdcppwebui\sd.cpp-webui\models/loras/ -t 0 --rng cpu --sampler-rng cpu --lora-apply-mode auto -o F:\sdcppwebui\sd.cpp-webui\outputs/txt2img/20260509_184635_078417_6_steps.png --model F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors --vae F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors --type q4_K --scheduler karras --max-vram 3.8 --vae-tile-overlap 0.5 --vae-tile-size 32x32 --offload-to-cpu --vae-tiling --fa --diffusion-conv-direct --vae-conv-direct --mmap --color

stable-diffusion.cpp:212 - loading model from 'F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors'
[INFO ] model.cpp:219 - load F:\sdcppwebui\sd.cpp-webui\models/checkpoints/prefectPonyXL_v50.safetensors using safetensors format
[INFO ] stable-diffusion.cpp:280 - loading vae from 'F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors'
[INFO ] model.cpp:222 - load F:\sdcppwebui\sd.cpp-webui\models/vae/irisXLVAE_luna.safetensors using torch zip format
[INFO ] stable-diffusion.cpp:305 - Version: SDXL
[INFO ] stable-diffusion.cpp:333 - Weight type stat: f16: 2513
[INFO ] stable-diffusion.cpp:334 - Conditioner weight type stat: f16: 585
[INFO ] stable-diffusion.cpp:335 - Diffusion model weight type stat: f16: 1680
[INFO ] stable-diffusion.cpp:336 - VAE weight type stat: f16: 248
[INFO ]
Subprocess terminated.

Additional context / environment details

I downloaded and tested exes untill I could generate with the offload-to-cpu on.
I tested with different combos of performance options and everyone with offload-to-cpu would always terminate.
without it I could generate images but would have very limited vram

[master-592-b8079e2] would generate but [master-593-3d6064b] and all release till today would not

I am on a rather old under powered amd system, ryzen 4500 and a 4gb rx480

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions