-
Notifications
You must be signed in to change notification settings - Fork 38
Open
Description
I have loaded the weights on the MineCLIP model for both the attn and avg variants to observe how the generated reward varies when parsing a video from a user playing MineCraft following one of the provided tasks. It appears that the generated reward remains stagnant throughout the video regardless of the text prompt, in addition when using randomly generated frames or zeroed frames the output is still very similar.
I am fairly certain that my video loading and weight loading process is in accordance to the existing documentation, so I am wondering if the current uploaded weights for either variants is incorrect.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels