-
Notifications
You must be signed in to change notification settings - Fork 0
UPSTREAM PR #1254: feat: add support for SDXS-09 #52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Even though the name "SDXS-09" is similar to "SDXS", it is completely different from SDXS. For this reason that SDXS was renamed to "SDXS (DS)", where DS stands for the DreamShaper edition by IDKiro.
OverviewAnalysis of stable-diffusion.cpp compared 48,089 functions across two versions, finding 83 modified (0.17%), with no new or removed functions. The single commit adds SDXS-09 model variant support. Binaries Analyzed:
Power consumption improved marginally in both binaries despite localized function regressions, indicating stable overall performance. Function AnalysisIntentional Feature Additions (Expected Overhead): Four instances of Two instances of UNet lambda operator Standard Library Variations (No Source Changes): Three STL iterator functions in build.bin.sd-server show consistent regressions:
Performance Improvements:
Other analyzed functions showed negligible changes. Additional FindingsML Operations Impact: No analyzed functions are in the inference hot path. All changes affect initialization and model loading phases. The quantization improvement benefits quantized model workflows, while SDXS-09 additions enable new model variant support without inference overhead. Cross-Function Patterns: STL regressions show consistent absolute increases (~180ns for iterators) only in the server binary, suggesting binary-specific build configuration differences rather than code quality issues. Feature additions show predictable overhead proportional to added logic complexity. Net effect for quantized models is ~30ms improvement (quantization gains offset STL regressions); for non-quantized models, ~4.4ms overhead is negligible compared to typical model loading times (seconds). 🔎 Full breakdown: Loci Inspector. |
OverviewAnalysis of 48,089 functions across two binaries reveals minimal performance impact from SDXS-09 model support addition. Modified functions: 83 (0.17%), with no new or removed functions. Binaries analyzed:
Power consumption improved slightly in both binaries, indicating maintained energy efficiency despite localized regressions. Function AnalysisVersion Detection Functions ( UNet Attention Layer Factory ( Quantization Conversion ( STL Functions: Multiple standard library functions (map/vector iterators, smart pointers) show 40-300% throughput increases with 60-183ns absolute changes. No application source code modifications detected—regressions likely stem from compiler optimization differences (GCC 13, aarch64). These occur primarily during initialization, not inference hot paths. Additional FindingsThe quantization improvement offsets initialization overhead, resulting in net positive power efficiency. All performance-critical inference operations remain unaffected or improved. Cross-binary consistency confirms deterministic implementation of SDXS-09 support with minimal architectural impact. 🔎 Full breakdown: Loci Inspector. |
3ad80c4 to
74d69ae
Compare
Note
Source pull request: leejet/stable-diffusion.cpp#1254
Even though the name "SDXS-09" is similar to "SDXS", it is completely different from SDXS.
For this reason that "SDXS" was renamed to "SDXS (DS)", where DS stands for the DreamShaper edition by IDKiro.