Release 2.2.11: Add CogVideoX1.5-5B I2V Model Configuration #186

benliang99 · 2025-05-02T14:45:06Z

Release 2.2.11 – I2V Expansion & Task-Specific Prompt Generation Fixes

Release Date: May 2, 2025

Version 2.2.11 introduces configuration support for the CogVideoX1.5-5B image-to-video model and fixing image and video prompt generation logic by correctly utilizing task types for annotation enhancement. It adds prompt sanitation to address legal NSFW filtering limitations in the DeepFloyd/IF model.

Updates

Model Changes

New Model Added:
- Added THUDM/CogVideoX1.5-5B-I2V model with full pipeline configuration and support for I2V generation
- Enabled I2V model selection in randomized pipelines

Fixes

Prompt Generation:
- Now derives task type from the model group rather than uninitialized model_name
- Grouped models by task type to reduce prompt generator reload frequency
- Added detailed logging for better runtime visibility
DeepFloyd/IF NSFW Filtering:
- We found that this model automatically censors NSFW content with no configuration toggle due to legal constraints
- Automatic Prompt Sanitization:
  - If NSFW is detected, the prompt is automatically sanitized using the moderation LLM and generation is retried with a safer prompt (up to 3 attempts)

Technical Details

Key changes include:

Introduced CogVideoX1.5-5B-I2V into the I2V_MODELS pipeline
Reworked task-type inference logic in the prompt generator
Reduced overhead from redundant prompt generator loads
Improved logging granularity for synthetic data processes
Implemented DeepFloyd/IF NSFW detection and prompt sanitization loop

Impact

These changes strengthen the robustness of I2V generation by expanding model coverage and resolving incorrect task typing. Prompt generation is now more efficient and adaptable across task types.

Breaking Changes

Updated prompt generation to rely on model group task typing
Internal logic changes may affect custom model configurations relying on model_name-based task detection

Add THUDM/CogVideoX1.5-5B-I2V model configuration with pipeline settings and enable I2V in random model selection.

…neration - Fix task assignment by using model group's task type instead of uninitialized model_name - Group models by task type to minimize prompt generator reloading - Add detailed logging for better monitoring - Remove duplicate random import

… handling - Refactor PromptGenerator to add load_vlm() and load_llm() for separate model loading - Use only load_llm() in DeepFloyd/IF NSFW retry loop to reduce memory usage - Add is_black_image utility to image_utils.py - Ensure LLM is loaded before prompt sanitization and clear GPU after - Minor code cleanup and comments

Release 2.2.11: Add CogVideoX1.5-5B I2V Model Configuration

aliang322

Looks good!

benliang99 and others added 6 commits April 29, 2025 20:37

feat: add CogVideoX1.5-5B I2V model configuration

adeb9de

Add THUDM/CogVideoX1.5-5B-I2V model configuration with pipeline settings and enable I2V in random model selection.

version bump

7338cc8

Merge remote-tracking branch 'origin/testnet' into i2v

17813b8

Merge pull request #185 from BitMind-AI/i2v

fb91000

Release 2.2.11: Add CogVideoX1.5-5B I2V Model Configuration

benliang99 requested a review from aliang322 May 2, 2025 19:01

aliang322 approved these changes May 2, 2025

View reviewed changes

benliang99 merged commit b2b82de into main May 2, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Release 2.2.11: Add CogVideoX1.5-5B I2V Model Configuration #186

Release 2.2.11: Add CogVideoX1.5-5B I2V Model Configuration #186

Uh oh!

benliang99 commented May 2, 2025 •

edited

Loading

Uh oh!

aliang322 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Release 2.2.11: Add CogVideoX1.5-5B I2V Model Configuration #186

Release 2.2.11: Add CogVideoX1.5-5B I2V Model Configuration #186

Uh oh!

Conversation

benliang99 commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Release 2.2.11 – I2V Expansion & Task-Specific Prompt Generation Fixes

Updates

Model Changes

Fixes

Technical Details

Impact

Breaking Changes

Uh oh!

aliang322 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

benliang99 commented May 2, 2025 •

edited

Loading