-
Notifications
You must be signed in to change notification settings - Fork 6
Open
Description
Hi! I really enjoyed your paper and the code. Do you have some more details regarding the published Italian and Dutch models?
Specifically:
- for how many steps did you train only the embeddings?
- for how many steps did you train the full model?
- at which batch size / learning rate?
Sorry if this is already mentioned somewhere in the Repo or the paper, I wasn't able to find it. I am trying to use some of your results in my work, so this information would help a lot :) Thanks!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels