Skip to content

Any fine-tuning tip? #39

@tsutsui22

Description

@tsutsui22

Hi, I recently try to fine-tune NitroGen model.
It's because NitroGen model looks it does not know what to do, what's enemy, what to shoot, where to go.
Thus, I aimed NitroGen to be accustomed a specific game situation, making it a specialist leveraging its generalist capability.

On fine-tuning, there were several problems.
First, the provided weight here https://huggingface.co/nvidia/NitroGen/blob/main/ng.pt does not have action dim of 21, but 25.
How does action dim of 25 was used? How does mapping between 21 dim of gamepad and action dim set up on the weight?

Since there exists mismatch, I haven't been able to check NitroGen's true capability. Probably, I have failed to fine-tune because of this mismatch. The semantic space that DiT, NitroGen's core, expects as input and output is misaligned with my setting, causing total chaos and can't learn anything meaningfull.

If you have any tips, please teach me, especially how should fine-tuning be conducted.
I'm also curious about learning rate, scheduler setting, what to freeze.
thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions