🚀 Deep-Dive Video Note Taker

LLM + RAG

🎥 Local Video → Structured Notes + Timestamps + Action Items + RAG Q&A

CPU-only • Privacy-First • Offline-Capable • LLM Powered

Convert long YouTube videos, lectures, and meetings into structured knowledge — locally.

🔎 What Is This?

Deep-Dive Video Note Taker (Lite) is a local AI system that converts long videos into:

📌 Structured notes
⏱️ Key timestamps
✅ Action items
🧠 RAG-based Q&A with citations

No cloud upload required. Everything runs locally using:

whisper.cpp
sentence-transformers
ChromaDB
Ollama (LLM)

🧠 Architecture Overview

flowchart LR
    A[Video Input] --> B[Audio Extraction]
    B --> C[Speech-to-Text]
    C --> D[Chunk + Embed]
    D --> E[Vector DB]
    E --> F[LLM Notes Generator]
    E --> G[RAG Q&A]

✨ Core Features

🎥 Input

YouTube URL
Local video file
Batch processing

📝 Output

Structured summary
Multi-level notes
Timestamped highlights
Action item extraction
Export to Markdown / JSON / Obsidian / Notion

🧠 Intelligence Layer

Semantic chunking
Embedding-based retrieval
RAG pipeline
Citation-backed answers

⚡ Quick Start

1️⃣ Install Dependencies

pip install poetry
poetry install

2️⃣ Install Ollama Model

ollama pull llama3.1:8b

3️⃣ Process a Video

poetry run notetaker process "https://www.youtube.com/watch?v=VIDEO_ID"

4️⃣ Ask Questions (RAG)

poetry run notetaker query VIDEO_ID "What were the main insights?"

🌐 Web UI + REST API

Start server:

poetry run notetaker serve

Open:

Web UI → http://localhost:8000
Health → http://localhost:8000/health

API Endpoints

POST   /api/process
POST   /api/process/upload
GET    /api/status/{job_id}
GET    /api/notes/{video_id}
GET    /api/transcript/{video_id}
POST   /api/query/{video_id}
GET    /api/library
GET    /api/export/{video_id}?format=json|markdown|obsidian|notion
DELETE /api/video/{video_id}

📦 Tech Stack

Speech-to-Text
whisper.cpp

Embeddings
sentence-transformers

Vector DB
ChromaDB

LLM
Ollama (llama3.1:8b)

⚙️ Configuration

User config:

~/.notetaker/config.yaml

Environment variables:

NOTETAKER_OLLAMA_BASE_URL
NOTETAKER_OLLAMA_MODEL
NOTETAKER_WHISPER_MODEL
NOTETAKER_DATA_DIR
NOTETAKER_NOTION_API_KEY

📤 Exports

JSON
Markdown
Obsidian (YAML + callouts)
Notion blocks JSON

🐳 Docker

docker compose up --build

App → http://localhost:8000 Ollama → http://localhost:11434

❓ FAQ

Does it upload my videos?

No. Everything runs locally.

Can I use it offline?

Yes — fully offline if Ollama + models are installed.

Can I search my entire video library?

Yes — semantic retrieval via ChromaDB.

Example Queries

“Summarize the lecture in 5 bullet points”
“List action items from 00:20–00:40”
“Where was regression discussed?”

🧪 Development

Run tests:

poetry run pytest -v

Lint:

poetry run ruff check .

🤝 Contributing

PRs welcome.

Fork
Create branch
Add tests
Open PR

📄 License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github/workflows		.github/workflows
scripts		scripts
src/notetaker		src/notetaker
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
config.default.yaml		config.default.yaml
docker-compose.yml		docker-compose.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Deep-Dive Video Note Taker

🎥 Local Video → Structured Notes + Timestamps + Action Items + RAG Q&A

🔎 What Is This?

🧠 Architecture Overview

✨ Core Features

🎥 Input

📝 Output

🧠 Intelligence Layer

⚡ Quick Start

1️⃣ Install Dependencies

2️⃣ Install Ollama Model

3️⃣ Process a Video

4️⃣ Ask Questions (RAG)

🌐 Web UI + REST API

API Endpoints

📦 Tech Stack

⚙️ Configuration

📤 Exports

🐳 Docker

❓ FAQ

Does it upload my videos?

Can I use it offline?

Can I search my entire video library?

Example Queries

🧪 Development

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚀 Deep-Dive Video Note Taker

🎥 Local Video → Structured Notes + Timestamps + Action Items + RAG Q&A

🔎 What Is This?

🧠 Architecture Overview

✨ Core Features

🎥 Input

📝 Output

🧠 Intelligence Layer

⚡ Quick Start

1️⃣ Install Dependencies

2️⃣ Install Ollama Model

3️⃣ Process a Video

4️⃣ Ask Questions (RAG)

🌐 Web UI + REST API

API Endpoints

📦 Tech Stack

⚙️ Configuration

📤 Exports

🐳 Docker

❓ FAQ

Does it upload my videos?

Can I use it offline?

Can I search my entire video library?

Example Queries

🧪 Development

🤝 Contributing

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages