Skip to content

Details on training of published models #2

@bminixhofer

Description

@bminixhofer

Hi! I really enjoyed your paper and the code. Do you have some more details regarding the published Italian and Dutch models?

Specifically:

  • for how many steps did you train only the embeddings?
  • for how many steps did you train the full model?
  • at which batch size / learning rate?

Sorry if this is already mentioned somewhere in the Repo or the paper, I wasn't able to find it. I am trying to use some of your results in my work, so this information would help a lot :) Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions