Skip to content

Implementation of CSD loss #31

@spacewanderx

Description

@spacewanderx

Hi, thank you for this open-sourced project.

I am wondering why is the gradient in the CSD loss defined to be pred_fake_latents - pred_real_latents, not pred_real_latents - pred_fake_latents?

Based on what I understand, in such a VSD-like formulation, you want pred_real_latents to represent the real distribution, which should lead to pred_real_latents - pred_fake_latents as the gradient? (references: SwiftBrush, ProlificDreamer)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions