Deploying Inference Nodes on RunPod

Quick Start

1. Use the ready-made RunPod template:

2. Register the node with Network Node:

curl -X POST http://NETWORK_NODE_HOST:9200/admin/v1/nodes \
     -H "Content-Type: application/json" \
     -d '{
       "id": "node1",
       "host": "NETWORK_NODE_HOST",
       "inference_port": 5000,
       "poc_port": 8080,
       "max_concurrent": 500,
       "models": {
         "Qwen/Qwen3-235B-A22B-Instruct-2507-FP8": {
           "args": [
             "--max-model-len",
             "240000",
             "--tensor-parallel-size",
             "4"
           ]
         }
       }
     }'

Detailed Documentation

For a detailed deployment guide, see README_RU.md (in Russian).

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README_RU.md		README_RU.md
ReadMe.md		ReadMe.md
docker-compose.mlnode.yml		docker-compose.mlnode.yml
nginx-runpod.conf.template		nginx-runpod.conf.template
node-config.json		node-config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deploying Inference Nodes on RunPod

Quick Start

1. Use the ready-made RunPod template:

2. Register the node with Network Node:

Detailed Documentation

Useful Links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Deploying Inference Nodes on RunPod

Quick Start

1. Use the ready-made RunPod template:

2. Register the node with Network Node:

Detailed Documentation

Useful Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages