RVLR experiments

Launch cluster

The following will launch n EC2 machines with the right configuration to start experimenting.

# 0. Clone repo
git clone https://github.com/halflearned/rlvr-experiments.git
cd rlvr-experiments

# 1. Create SSH key (if needed)
# You may need to add --profile some_configured_profile 
aws ec2 create-key-pair --key-name rlvr-key --query 'KeyMaterial' --output text --region us-west-2  > ~/.ssh/rlvr-key.pem
chmod 400 ~/.ssh/rlvr-key.pem

# 2. Create cluster (waits for instances)
python infra/launch.py create -n 2

# Or use an existing VPC (if you've hit VPC limits):
# python infra/launch.py create -n 2 --vpc vpc-xxxxx --subnet subnet-xxxxx

# 3. Check status / get IPs
python infra/launch.py status

# 4. SSH and start Ray on head node
ssh -i ~/.ssh/rlvr-key.pem ubuntu@<HEAD_IP>
cd /efs/rlvr-experiments && source .venv/bin/activate
ray start --head
python infra/launch.py set-head $(hostname -I | awk '{print $1}')  # save head IP

# 5. On worker nodes, join Ray (status shows the command)
ray start --address=<HEAD_PRIVATE_IP>:6379

# 6. Run training
python entrypoints/train_grpo.py configs/qwen3-06B-base.yaml

# 7. Cleanup
python infra/launch.py delete

Scale cluster

Scale up:

python infra/launch.py scale -n 4  # set total instances
python infra/launch.py status      # shows head node and ray command

# SSH to new node and join Ray (status shows the exact command)
ssh -i ~/.ssh/rlvr-key.pem ubuntu@<NEW_NODE_IP>
cd /efs/rlvr-experiments && source .venv/bin/activate
ray start --address=<HEAD_PRIVATE_IP>:6379

Scale down:

# Stop Ray on ALL worker nodes first (AWS picks which to terminate)
# On each non-head node:
ray stop

# Then scale from local machine
python infra/launch.py scale -n 2

# Remaining workers rejoin Ray
ray start --address=<HEAD_PRIVATE_IP>:6379

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
configs		configs
entrypoints		entrypoints
experiments		experiments
infra		infra
scripts		scripts
src/rlvr_experiments		src/rlvr_experiments
tests		tests
viz		viz
writeup		writeup
.gitignore		.gitignore
AGENTS.md		AGENTS.md
Dockerfile		Dockerfile
NOTES.md		NOTES.md
README.md		README.md
mount_efs.sh		mount_efs.sh
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RVLR experiments

Launch cluster

Scale cluster

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

halflearned/rlvr-experiments

Folders and files

Latest commit

History

Repository files navigation

RVLR experiments

Launch cluster

Scale cluster

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages