Details on training of published models

Hi! I really enjoyed your paper and the code. Do you have some more details regarding the published Italian and Dutch models?

Specifically:
- for how many steps did you train only the embeddings?
- for how many steps did you train the full model?
- at which batch size / learning rate?

Sorry if this is already mentioned somewhere in the Repo or the paper, I wasn't able to find it. I am trying to use some of your results in my work, so this information would help a lot :) Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Details on training of published models #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Details on training of published models #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions