Add Stable Diffusion lora pass #2296

xiaoyu-work · 2025-12-29T06:25:11Z

This pull request introduces significant improvements to the SD-LoRA data preprocessing pipeline, particularly enhancing support for DreamBooth-style datasets and improving the modularity and usability of related components. The main changes include adding explicit DreamBooth class image handling to both aspect_ratio_bucketing and image_resizing, refactoring dataset loading for HuggingFace datasets, and registering the SDLoRA pass in the configuration.

Enhancements to SD-LoRA data preprocessing and configuration:

DreamBooth support and preprocessing improvements:

Added explicit processing of DreamBooth class images in both aspect_ratio_bucketing and image_resizing, including resizing, bucket assignment, and output path management. This ensures that class images are handled consistently with instance images and that relevant metadata is tracked for downstream tasks. [1] [2]
Refactored the image_resizing function to use a helper for per-image processing, improved crop coordinate calculation, and ensured bucket assignment metadata is stored for all processed images. [1] [2] [3] [4]

Codebase and API improvements:

Updated the olive.data.component.sd_lora.__init__.py to explicitly export key preprocessing modules, making them more discoverable and easier to import elsewhere in the codebase.
Refactored ImageDataContainer to extract HuggingFace-specific parameters earlier in the dataset loading process, ensuring proper conversion and avoiding parameter leakage to downstream loaders.

Configuration updates:

Registered the SDLoRA pass in olive_config.json, specifying its dependencies, supported providers, and dataset requirements, enabling its use in Olive pipelines.

test/passes/diffusers/conftest.py

test/passes/diffusers/test_lora.py

Copilot

Pull request overview

This PR introduces a comprehensive SD-LoRA training pass that enables fine-tuning of Stable Diffusion models (SD 1.5, SDXL, and Flux) using LoRA adapters, with full support for DreamBooth-style training with prior preservation.

Key changes:

New SDLoRA pass with automatic model type detection and support for three diffusion architectures (SD 1.5, SDXL, Flux)
Enhanced data preprocessing pipeline with explicit DreamBooth class image handling in both aspect ratio bucketing and image resizing components
Validation improvements to DiffusersModelHandler with explicit model checking via model_index.json

Reviewed changes

Copilot reviewed 10 out of 12 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
olive/passes/diffusers/lora.py	New SDLoRA pass implementation with training loops for SD/SDXL (UNet-based) and Flux (DiT-based) models, including prior preservation loss for DreamBooth
olive/data/component/sd_lora/aspect_ratio_bucketing.py	Added explicit processing of DreamBooth class images including bucket assignment, resizing, and metadata tracking
olive/data/component/sd_lora/image_resizing.py	Refactored to support class image processing with helper functions for per-image operations and crop coordinate calculation
olive/data/component/sd_lora/init.py	Explicit exports of preprocessing modules for improved discoverability
olive/data/container/image_data_container.py	Refactored HuggingFace dataset loading to extract ImageDataContainer-specific parameters before passing to base loader
olive/model/handler/diffusers.py	Added validation via is_valid_model() checking for model_index.json, plus adapter_path property for LoRA weights
olive/olive_config.json	Registered SDLoRA pass with GPU-only support and sd-lora extra dependencies
test/passes/diffusers/test_lora.py	Comprehensive test suite covering SD 1.5, SDXL, Flux training, LoRA merging, and DreamBooth mode
test/passes/diffusers/conftest.py	Test fixtures for mock models, accelerator, and test image folders
test/model/test_diffusers_model.py	Updated tests to mock is_valid_model for network-free validation
test/passes/diffusers/init.py	Empty init file for test package structure

test/passes/diffusers/test_lora.py

olive/data/container/image_data_container.py

olive/model/handler/diffusers.py

olive/data/component/sd_lora/aspect_ratio_bucketing.py

olive/data/component/sd_lora/__init__.py

xiaoyu-work added 2 commits December 29, 2025 06:21

Add Stable Diffusion lora pass

1820307

Add unit test

0596d56

github-advanced-security bot found potential problems Dec 30, 2025

View reviewed changes

test/passes/diffusers/conftest.py Fixed Show fixed Hide fixed

test/passes/diffusers/test_lora.py Fixed Show fixed Hide fixed

xiaoyu-work added 2 commits December 30, 2025 02:43

Add diffusion model check

579ff00

fix test

8c6c91f

xiaoyu-work requested a review from Copilot December 30, 2025 05:43

Copilot started reviewing on behalf of xiaoyu-work December 30, 2025 05:43 View session

Copilot AI reviewed Dec 30, 2025

View reviewed changes

Update test

a87cfb1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Stable Diffusion lora pass #2296

Add Stable Diffusion lora pass #2296

Uh oh!

xiaoyu-work commented Dec 29, 2025

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Stable Diffusion lora pass #2296

Are you sure you want to change the base?

Add Stable Diffusion lora pass #2296

Uh oh!

Conversation

xiaoyu-work commented Dec 29, 2025

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants