Skip to content

viralcode/openwhale

Repository files navigation

OpenWhale

OpenWhale

The AI that doesn't just talk. It works. πŸ‹

Deploys agent swarms, clears your inbox, sends emails, manages your calendar, browses the web, writes code, and runs your life.
All from WhatsApp, Telegram, Discord, or any chat app you already use.

πŸ€– Multi-Agent Swarms β€” Fan-out tasks to parallel agents with shared memory β€” not one bot, an army
🧠 Self-Extensible β€” Creates its own tools, skills, and automations on the fly
πŸ”’ Runs on Your Machine β€” Your data never leaves your computer. Private by default
🌐 Website β€” viralcode.github.io/openwhale
☁️ Managed Hosting β€” Coming soon


What is OpenWhale?

Other AI assistants chat. OpenWhale does. It deploys autonomous agent swarms that fan-out complex work to multiple AI agents running in parallel, coordinate them with shared memory and file locks, and collect results automatically. Connect it to 8 AI providers, let it talk on your behalf through WhatsApp/Telegram/Discord/Twitter/iMessage, browse the web with real browser automation, execute code, manage your calendar, send emails β€” and basically run your digital life.

An AI workforce, not just an AI assistant. One agent is nice. A coordinated swarm is unstoppable.

OpenWhale Dashboard


⚑ Quick Start

git clone https://github.com/viralcode/openwhale.git
cd openwhale
pnpm install
pnpm approve-builds
pnpm run dev

Open http://localhost:7777/dashboard β†’ add your API keys β†’ start chatting.

πŸ“– Full Getting Started Guide β†’

πŸͺŸ On Windows? Run the one-line installer in PowerShell β€” installs everything automatically:

irm https://raw.githubusercontent.com/viralcode/openwhale/main/install.ps1 | iex

Or see the Windows Installation Guide β†’ for manual setup.


✨ Core Features

πŸ€– 8 AI Providers

Switch between any major AI model on the fly. One assistant, unlimited model choices.

Provider Top Models
Anthropic Opus 4.5, Sonnet 4.5
OpenAI GPT-5.2, GPT-5, GPT-4o, o4-mini
Google Gemini Gemini 3 Pro, Gemini 2.5 Pro
Qwen Qwen3-Max, QwQ-Plus, Qwen3-Coder
DeepSeek DeepSeek Chat, Coder, Reasoner
Groq Llama 3.3 70B, Mixtral 8x7B
Together AI Any open-source model
Ollama Local models β€” no API key!

Automatic failover between providers. If one fails, the next takes over.

πŸ“– Providers Documentation β†’

AI Providers


πŸ€– Multi-Agent Coordination

Deploy multiple AI agents in parallel to tackle complex tasks faster. Automatic task decomposition, shared context, advisory file locks, and inter-agent messaging.

"Research quantum computing and write a Python sorting algorithm"
     β†’ Orchestrator fans out to Research Agent + Coder Agent
     β†’ Results synthesized automatically
  • Fan-out / Fan-in β€” Split work across specialized parallel agents
  • Shared Context β€” Agents share data via a namespaced key-value store
  • Active Locks β€” Advisory file locks prevent concurrent write conflicts
  • Inter-Agent Messaging β€” Agents can talk to each other during execution
  • A2A Protocol β€” Google Agent2Agent protocol for interop with LangGraph, CrewAI, AutoGen

πŸ“– Multi-Agent Documentation β†’


πŸ’¬ 6 Messaging Channels

Your AI responds across all platforms β€” unified inbox, one brain.

Channel How It Works
WhatsApp Scan QR code, uses your personal account
Telegram Create a bot with @BotFather
Discord Add bot to your server
Slack Slack app integration
Twitter/X Auto-reply to mentions via bird CLI (cookie auth, no API!)
iMessage macOS native β€” read and send iMessages

πŸ“– Channels Setup Guide β†’


πŸ› οΈ 40+ Built-in Tools

The AI can actually DO things, not just talk about them.

Category Tools
System Shell exec, code execution, system info, clipboard, Apple Shortcuts
Files Read/write files, git, zip, codebase search, database queries
Documents PDF creation, PowerPoint slides, Excel spreadsheets
Web Browser automation (Playwright + BrowserOS), web fetch
Media Screenshots, camera, canvas, image analysis, TTS, screen recording, QR codes
Communication Email, iMessage, calendar events
Infrastructure Docker, SSH
Memory Persistent memory, knowledge graphs, multi-step planning
Automation Cron scheduling, self-extensions, skill creator
Coordination Agent listing, session management, shared context, file locks

πŸ“– Full Tools Reference β†’

Available Tools


πŸ”Œ API Skills (External Integrations)

Connect to your favorite services β€” the AI handles the API calls.

Skill What It Does
GitHub Manage repos, issues, PRs, commits
Notion Search, create, update pages and databases
Google Calendar View and create events
Gmail Read, send, and search emails
Google Drive Upload, download, list files
Google Tasks Manage your to-do lists
Weather Current conditions and forecasts
1Password Securely fetch passwords and secrets
Apple Notes/Reminders Native macOS integration
Twitter/X Post, search, timeline, mentions, follow
Spotify Playback control, search, playlists
Trello Boards, lists, and cards
ElevenLabs AI text-to-speech
Twilio SMS and voice calls

Plus 50+ Markdown Skills from the OpenClaw community.

πŸ“– Skills Documentation β†’

Markdown Skills


🌐 Browser Automation

Two backends to choose from:

Backend Best For
Playwright Simple scraping, headless automation, zero setup
BrowserOS Real browser with your extensions, cookies, and logins

BrowserOS lets the AI control YOUR actual browser β€” including logged-in sessions, AdBlock, 1Password, and everything else.

πŸ“– Browser Automation Guide β†’


⚑ Self-Extension System

The AI can create its own automations. Just ask.

  • "Create an extension that checks Bitcoin price every hour and sends it to WhatsApp"
  • "Make an extension that reminds me to drink water"
  • "Create a daily standup summary extension"

Extensions support cron scheduling, multi-channel output, and persist across restarts.

πŸ“– Extensions Documentation β†’

Extensions


🧠 Persistent Memory

Remembers everything, even after restarts: long-term facts, daily notes, vector search (local embeddings, no API needed!), and session persistence.

πŸ“– Memory System β†’


πŸ’“ Heartbeat (Proactive Agent)

The AI wakes up periodically to check on things β€” configurations, inboxes, tasks β€” without you having to ask.

πŸ“– Heartbeat Documentation β†’


πŸ”’ Enterprise Security

JWT authentication, rate limiting, audit logs, sandboxed code execution, command filtering, multi-user support.

πŸ“– Security Documentation β†’


πŸ‹ macOS Native App

Native SwiftUI menu bar app with quick chat, status monitoring, and channel controls β€” no browser needed.

πŸ“– macOS App Documentation β†’


Why OpenWhale?

Feature OpenWhale ChatGPT Claude Typical Chatbot
Multi-agent orchestration βœ… Fan-out/fan-in with shared memory ❌ No ❌ No ❌ No
Multi-model support βœ… 8 providers ❌ GPT only ❌ Claude only ❌ Single model
Run shell commands βœ… Full access ❌ No ❌ No ❌ No
Browser automation βœ… Playwright + BrowserOS ❌ Limited βœ… Limited ❌ No
WhatsApp/Telegram/Discord βœ… All channels ❌ No ❌ No ❌ Maybe one
iMessage (macOS) βœ… Native ❌ No ❌ No ❌ No
Twitter integration βœ… Full API ❌ No ❌ No ❌ No
Self-extensible βœ… Creates own tools ❌ No ❌ No ❌ No
Runs locally βœ… Your machine ❌ Cloud only ❌ Cloud only ❌ Usually cloud
Open source βœ… MIT license ❌ Closed ❌ Closed ❌ Varies
Persistent memory βœ… Vector search βœ… Limited βœ… Limited ❌ Usually no
Proactive heartbeat βœ… Configurable ❌ No ❌ No ❌ No

πŸ“š Documentation

Document Description
Getting Started Installation, prerequisites, quick start, Docker
Providers AI providers, models, failover configuration
Multi-Agent Fan-out/fan-in, shared context, A2A protocol
Channels WhatsApp, Telegram, Discord, Slack, Twitter, iMessage
Tools Complete reference for all 40+ built-in tools
Skills API skills and Markdown community skills
Browser Automation Playwright vs BrowserOS
Memory Long-term memory, vector search, session persistence
Heartbeat Proactive agent configuration
Extensions Self-extension system and scheduling
Dashboard Web dashboard and setup wizard
macOS App Native menu bar app
API REST API reference and OpenAI-compatible endpoints
Security Authentication, rate limiting, audit, sandboxing
Windows Setup Installing & running on Windows 10/11
Configuration Full .env reference
CLI CLI commands and npm scripts
Troubleshooting Common issues and fixes

Project Structure

src/
β”œβ”€β”€ agents/      # Multi-agent orchestration, inter-agent comms, A2A protocol
β”œβ”€β”€ auth/        # JWT, API keys, sessions
β”œβ”€β”€ channels/    # WhatsApp, Telegram, Discord, Slack, Twitter, iMessage adapters
β”œβ”€β”€ cli.ts       # Interactive terminal interface
β”œβ”€β”€ daemon/      # Background service (launchd on macOS)
β”œβ”€β”€ dashboard/   # Web admin panel + coordination dashboard
β”œβ”€β”€ db/          # SQLite/PostgreSQL with Drizzle ORM
β”œβ”€β”€ gateway/     # Hono-based HTTP API routes
β”œβ”€β”€ heartbeat/   # Proactive agent scheduler
β”œβ”€β”€ integrations/# Google APIs (Calendar, Gmail, Drive, Tasks)
β”œβ”€β”€ memory/      # Persistent memory, embeddings, vector search
β”œβ”€β”€ providers/   # Anthropic, OpenAI, Google, Groq, Qwen, DeepSeek, Together, Ollama
β”œβ”€β”€ security/    # Rate limiting, audit logs, sandboxing, command filtering
β”œβ”€β”€ sessions/    # Persistent conversations, fan-out auto-detection, compaction
β”œβ”€β”€ skills/      # GitHub, Notion, Spotify, Weather, Apple, Twitter, etc.
β”œβ”€β”€ tools/       # 40+ tools β€” files, browser, code exec, screenshots, coordination
└── voice/       # Voice interaction support

Contributing

PRs welcome! If you find a bug or have an idea, open an issue. Keep it friendly.

License

MIT β€” do whatever you want with it.


Built with ❀️ by Jijo John