Skip to content

Conversation

@arilotter
Copy link
Collaborator

@arilotter arilotter commented Jan 25, 2026

passes both train test and compiling cudnn stuff on h100

add vllm package and update the world

fixes some cuda lib path problems in vllm
updates mdbook
fixes some clippy lints

later, we should consider splitting the vllm code into a separate set of deps, so the regular psyche binaries don't get vllm included

Original PR was #464

@arilotter arilotter force-pushed the vlllm branch 6 times, most recently from 1d0f34e to 3fcc4c5 Compare January 26, 2026 23:08
nix/python.nix Outdated
# packages that we provide to the venv via nix derivations
topLevelNixPkgs = [
"torch"
"vllm"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't know if we want to support vllm for aarch64-darwin but I think this is not compiling on that architecture:

error: Package ‘python3.12-vllm-0.14.0’ in /nix/store/m6ji2a8jp7w4ck7vvrjh4grs4p3n7r26-source/pkgs/development/python-modules/vllm/default.nix:553 is not available on the requested hostPlatform:
         hostPlatform.system = "aarch64-darwin"

Maybe we can move inside the isLinux check

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done - also made the inference node packages not build at all on macos since they won't work without vllm

@IAvecilla
Copy link
Contributor

I tried running nix develop and it's building a lot of Python packages (e.g. python3.12-keras-3.13.1, python3.12-flash-attn-2.8.2). Is that expected now or should I see that only when using nix develop .#python?

@arilotter
Copy link
Collaborator Author

I tried running nix develop and it's building a lot of Python packages (e.g. python3.12-keras-3.13.1, python3.12-flash-attn-2.8.2). Is that expected now or should I see that only when using nix develop .#python?

This is intended - the #python devshell builds the python psyche extension, but the normal devshell still links against python deps. If this change isn't workable, I can drop python otherwise, it just means we have to feature-gate the inference stuff to the python feature.

@IAvecilla
Copy link
Contributor

This is intended - the #python devshell builds the python psyche extension, but the normal devshell still links against python deps. If this change isn't workable, I can drop python otherwise, it just means we have to feature-gate the inference stuff to the python feature.

Oh okay. No worries, if we see that the decentralized tests don’t crash because of that, I think we’re good

@arilotter arilotter force-pushed the vlllm branch 2 times, most recently from a9dc40f to a403d98 Compare January 28, 2026 01:22
- add vllm package and update the world

fixes some cuda lib path problems in vllm
updates mdbook
fixes some clippy lints

later, we should consider splitting the vllm code into a separate set of
deps, so the regular psyche binaries don't get vllm included

Original PR was #464
should drop many minutes from each test since we don't have to re-build
them every time :3
- only build vllm / inference on linux
- fix test build binary copies
- use optionalAttrs et al. everywhere
@arilotter
Copy link
Collaborator Author

oh my god the tests passed im gonna cry :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants