ai-cache

Here is 1 public repository matching this topic...

peva3 / SmarterRouter

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

docker self-hosted model-serving gpu-monitoring fastapi llm openai-proxy semantic-cache local-llm ollama llm-proxy ollama-api ai-gateway llm-router self-hosted-ai ai-cache

Updated Feb 22, 2026
Python

Improve this page

Add a description, image, and links to the ai-cache topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-cache topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-cache

Here is 1 public repository matching this topic...

peva3 / SmarterRouter

Improve this page

Add this topic to your repo