[draft] Adds serving to GPT2 #19

rachfop · 2025-12-11T23:03:40Z

Towards DEVA-1068
Run:

pixi run max serve \
  --model openai-community/gpt2 \
  --custom-architectures gpt2_module_v3 \
  --port 8888

Then in another terminal run:

curl http://localhost:8888/v1/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai-community/gpt2",
    "prompt": "The future of AI",
    "max_tokens": 30
  }' | jq .

note: you can use a different port number or leave it out.

[draft] Adds serving to GPT2

6730b25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[draft] Adds serving to GPT2 #19

[draft] Adds serving to GPT2 #19

Uh oh!

rachfop commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[draft] Adds serving to GPT2 #19

Are you sure you want to change the base?

[draft] Adds serving to GPT2 #19

Uh oh!

Conversation

rachfop commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants