Multi-model registry and adapter layer for realtime-0.5b, tts-1.5b, tts-7b by Copilot · Pull Request #7 · groxaxo/vibevoice-realtimeFASTAPI

Copilot · 2026-03-13T02:02:38Z

Refactors the hardcoded single-model runner into a multi-backend TTS system. The model field in OpenAI-compatible requests is now resolved via a registry instead of ignored. Longform models (1.5B/7B) are registered with API plumbing but fail gracefully with 501 until a real backend is wired in. Existing realtime-0.5b behavior is fully preserved.

New `runner/` package

model_registry.py — ModelProfile dataclass, 3 model profiles, alias map (tts-1 → realtime-0.5b, etc.), resolve_model_key(), get_model_profile()
adapters/base.py — EngineAdapter ABC with is_available(), capabilities(), synthesize(), stream(), health()
adapters/realtime_demo.py — RealtimeDemoAdapter wrapping existing subprocess demo. Extracted detect_device(), apply_overrides(), build_realtime_demo_cmd() from run_realtime_demo.py
adapters/longform_native.py — LongformNativeAdapter scaffold. is_available() returns False; synthesize() raises BackendUnavailableError. No fake streaming
adapter_factory.py — make_adapter(model_key) dispatches by family/loader_mode
types.py — SpeechRequest/SpeakerTurn pydantic models with per-family validation
errors.py — UnknownModelError, CapabilityError, BackendUnavailableError, InvalidRequestForModelError

Modified files

overrides/app.py — /v1/audio/speech resolves model, returns 400 (invalid combo), 404 (unknown model), 501 (missing backend). /stream rejects non-realtime models. /config and /health expose model registry info
scripts/download_model.py — Accepts --model with registry resolution. Default unchanged
scripts/run_realtime_demo.py — Now a thin shim delegating to runner.adapters.realtime_demo
scripts/run_server.py — New generic launcher: --model realtime-0.5b|tts-1.5b|tts-7b

Backward compatibility

All existing entry points work unchanged:

uv run python scripts/download_model.py           # still downloads realtime-0.5b
uv run python scripts/run_realtime_demo.py --port 8000  # still launches realtime demo
curl -d '{"model":"tts-1","input":"Hello"}' /v1/audio/speech  # still maps to realtime-0.5b

Remaining TODOs

longform_native.py:_load_backend() — replace placeholder ImportError with real backend import
longform_native.py:synthesize() — wire actual inference
run_server.py — add longform serving path

Tests

46 new tests covering registry, alias resolution, per-family validation, adapter factory, graceful degradation, and error classes. All run without GPU/model weights.

📱 Kick off Copilot coding agent tasks wherever you are with GitHub Mobile, available on iOS and Android.

…l scripts Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

…entation Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

…or message clarity Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

Initial plan

cf4c176

Copilot AI assigned Copilot and groxaxo Mar 13, 2026

Copilot started work on behalf of groxaxo March 13, 2026 02:02 View session

groxaxo marked this pull request as ready for review March 13, 2026 02:03

Copilot AI and others added 3 commits March 13, 2026 02:12

Add runner package with model registry, adapter layer, and multi-mode…

569aee6

…l scripts Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

Add tests for runner package and update README with multi-model docum…

4c450a3

…entation Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

Address code review: add public getter for backend error, improve err…

0e2c54d

…or message clarity Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add multi-model compatibility for existing repo~~ Multi-model registry and adapter layer for realtime-0.5b, tts-1.5b, tts-7b Mar 13, 2026

Copilot AI requested a review from groxaxo March 13, 2026 02:19

Copilot finished work on behalf of groxaxo March 13, 2026 02:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-model registry and adapter layer for realtime-0.5b, tts-1.5b, tts-7b#7

Multi-model registry and adapter layer for realtime-0.5b, tts-1.5b, tts-7b#7
Copilot wants to merge 4 commits intomainfrom
copilot/implement-multi-model-compatibility

Copilot AI commented Mar 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

New runner/ package

Modified files

Backward compatibility

Remaining TODOs

Tests

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Mar 13, 2026 •

edited

Loading

New `runner/` package