Glem

An offline‑first e‑commerce assistant that pairs local vector search over JSON data with an LLM‑based router and response generator. It is designed for fast, deterministic retrieval while still producing natural responses, with optional speech input and speech output. For deeper architecture details, see ARCHITECTURE.md.

flowchart TD
  User[User] --> IO[main.py IO]
  User --> VoiceIn[Voice input STT utils stt]
  VoiceIn --> IO
  IO --> VoiceOut[Voice output TTS utils tts]
  IO --> Agent[AGENT ChatGlem core chat_engine]
  Agent --> Router[IntentClassifier core intent]
  Router --> LLM[GlemEngine core glem]
  LLM --> Agent
  Agent --> Tools[KnowledgeBaseTools core tools]
  Tools --> Retrieval[FAISS vector search]
  Tools --> Actions[Cancel return actions]
  Retrieval --> Data[JSON data and indexes]
  Tools --> Agent

Core technologies

Python 3: Primary runtime for the assistant and tools.
Groq LLM API: Routing and response generation via GlemEngine.
FAISS: Vector similarity search over local indexes.
Sentence Transformers: Embeddings for catalog, FAQ, policy, and orders.
JSON data stores: Product catalog, FAQs, policies, and orders.
ElevenLabs (optional): Text-to-speech via utils/tts.py.

Setup

1. Create a virtual environment

python -m venv .venv
source .venv/bin/activate

2. Install dependencies

pip install -r requirements.txt

3. Configure environment variables

Create a .env file in the project root or export variables in your shell.

Required:

API_KEYS: Comma‑separated Groq API keys used by GlemEngine.

Optional:

USE_STT: 1, true, or yes to enable speech‑to‑text.
USE_TTS: 1, true, or yes to enable text‑to‑speech.
TTS_VOICE_ID: ElevenLabs voice id.
TTS_MODEL_ID: ElevenLabs model id. Default eleven_multilingual_v2 in main.py.
TTS_OUTPUT_FORMAT: e.g. mp3_44100_128.
TTS_RATE: Optional int for speech rate if your TTS backend supports it.
ELEVENLABS_API_KEY or ELEVENLABS_API_KEYS: Used by utils/tts.py.

4. Build vector indexes

The assistant expects FAISS indexes under data/indexes/.

python scripts/build_faiss_indexes.py --data-dir data --out-dir data/indexes

5. Run

python main.py

How it works

main.py wires the system prompt, tools, intent classifier, and agent.
core/chat_engine.py runs the loop, calls tools, and crafts the final response.
core/intent.py uses GlemEngine with a JSON schema to decide tool vs chat routing.
core/tools.py handles retrieval, order actions, and tool execution.
utils/search_utils.py provides embeddings, FAISS index access, and query parsing.
utils/stt.py and utils/tts.py provide optional audio input/output.
scripts/build_faiss_indexes.py builds the indexes from JSON data. See ARCHITECTURE.md for a full system diagram and component breakdown.

Assumptions

Data lives in data/ and is treated as the source of truth.
Indexes in data/indexes/ are built before runtime.
A single customer context is active per run via CUSTOMER_ID in main.py.
The configured model string openai/gpt-oss-20b is available in Groq.
The assistant only acts on orders belonging to the active customer id.

Limitations

No live data updates. You must rebuild indexes if JSON data changes.
Order actions do not mutate the order database; they only write to data/action_log.jsonl.
Tool calls and routing depend on the LLM; incorrect routing is possible.
Token budgeting uses a rough heuristic in build_sliding_window.
No streaming responses or concurrency controls.
Audio features require a working local audio device and the related packages.
Errors are surfaced as plain text; there is no structured error handling layer.

Troubleshooting

Missing indexes: run scripts/build_faiss_indexes.py.
API errors: confirm API_KEYS is set and valid.
STT/TTS issues: disable with USE_STT=0 or USE_TTS=0 and verify dependencies.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Glem

Core technologies

Setup

1. Create a virtual environment

2. Install dependencies

3. Configure environment variables

4. Build vector indexes

5. Run

How it works

Assumptions

Limitations

Troubleshooting

About

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
core		core
data		data
media		media
scripts		scripts
utils		utils
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

datavorous/glem

Folders and files

Latest commit

History

Repository files navigation

Glem

Core technologies

Setup

1. Create a virtual environment

2. Install dependencies

3. Configure environment variables

4. Build vector indexes

5. Run

How it works

Assumptions

Limitations

Troubleshooting

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages