Any fine-tuning tip?

Hi, I recently try to fine-tune NitroGen model.
It's because NitroGen model looks it does not know what to do, what's enemy, what to shoot, where to go.
Thus, I aimed NitroGen to be accustomed a specific game situation, making it a specialist leveraging its generalist capability.

On fine-tuning, there were several problems.
First, the provided weight here https://huggingface.co/nvidia/NitroGen/blob/main/ng.pt does not have action dim of 21, but 25.
How does action dim of 25 was used? How does mapping between 21 dim of gamepad and action dim set up on the weight?

Since there exists mismatch, I haven't been able to check NitroGen's true capability. Probably, I have failed to fine-tune because of this mismatch. The semantic space that DiT, NitroGen's core, expects as input and output is misaligned with my setting, causing total chaos and can't learn anything meaningfull. 

If you have any tips, please teach me, especially how should fine-tuning be conducted.
I'm also curious about learning rate, scheduler setting, what to freeze.
thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any fine-tuning tip? #39

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Any fine-tuning tip? #39

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions