Add pytorch/training/gpu/2.3.1/transformers/4.48.0/py311/Dockerfile#134
Add pytorch/training/gpu/2.3.1/transformers/4.48.0/py311/Dockerfile#134alvarobartt merged 15 commits intomainfrom
pytorch/training/gpu/2.3.1/transformers/4.48.0/py311/Dockerfile#134Conversation
This commit also contains some formatting improvements to better debug the `Dockerfile` such as indentation when a command is divided in multiple lines to know that it refers to the unindented command above; also set bash as the default shell, and fix `gcloud` CLI installation
Bump the `transformers` dependency to 4.48.0 to support the ModernBERT architecture, as well as bumping `diffusers` including new video and image generation pipelines, as well as a bunch of other features, improvements and bug fixes. Additionally, the `Dockerfile` formatting has been fixed.
pytorch/training/gpu/2.3.0/transformers/4.47.0/py311/Dockerfilepytorch/training/gpu/2.3.0/transformers/4.48.0/py311/Dockerfile
pytorch/training/gpu/2.3.0/transformers/4.48.0/py311/Dockerfilepytorch/training/gpu/2.3.0/transformers/4.47.1/py311/Dockerfile
philschmid
left a comment
There was a problem hiding this comment.
Whats the time benefit of uv? Not sure if we should add another dependency for a few seconds faster build time of containers. Especially with the alias, e.g. adding custom dependencies in training jobs, what if they are not supported etc.
So it's mainly for both the |
|
Ok @philschmid after checking, apparently |
Lets remove it and maybe revisit in a few months. |
pytorch/training/gpu/2.3.0/transformers/4.47.1/py311/Dockerfilepytorch/training/gpu/2.3.q/transformers/4.47.1/py311/Dockerfile
pytorch/training/gpu/2.3.q/transformers/4.47.1/py311/Dockerfilepytorch/training/gpu/2.3.1/transformers/4.47.1/py311/Dockerfile
pytorch/training/gpu/2.3.1/transformers/4.47.1/py311/Dockerfilepytorch/training/gpu/2.3.1/transformers/4.48.0/py311/Dockerfile
Description
This PR bumps the dependencies to release a new PyTorch DLC for training with improvements, support for newer model architectures, bug fixes and much more.
Additionally, besides the version bumps, this PR also includes the
gcloudCLI and installshuggingface_hubwith thehf-transferutility for improvements on download/upload speed to the Hugging Face Hub.Note
This PR will enable the example on how to fine-tune PaliGemma 2 with TRL to be shipped within #133