-
Notifications
You must be signed in to change notification settings - Fork 203
Description
Hi, I recently try to fine-tune NitroGen model.
It's because NitroGen model looks it does not know what to do, what's enemy, what to shoot, where to go.
Thus, I aimed NitroGen to be accustomed a specific game situation, making it a specialist leveraging its generalist capability.
On fine-tuning, there were several problems.
First, the provided weight here https://huggingface.co/nvidia/NitroGen/blob/main/ng.pt does not have action dim of 21, but 25.
How does action dim of 25 was used? How does mapping between 21 dim of gamepad and action dim set up on the weight?
Since there exists mismatch, I haven't been able to check NitroGen's true capability. Probably, I have failed to fine-tune because of this mismatch. The semantic space that DiT, NitroGen's core, expects as input and output is misaligned with my setting, causing total chaos and can't learn anything meaningfull.
If you have any tips, please teach me, especially how should fine-tuning be conducted.
I'm also curious about learning rate, scheduler setting, what to freeze.
thank you!