The AI that doesn't just talk. It works. π
Deploys agent swarms, clears your inbox, sends emails, manages your calendar, browses the web, writes code, and runs your life.
All from WhatsApp, Telegram, Discord, or any chat app you already use.
π€ Multi-Agent Swarms β Fan-out tasks to parallel agents with shared memory β not one bot, an army
π§ Self-Extensible β Creates its own tools, skills, and automations on the fly
π Runs on Your Machine β Your data never leaves your computer. Private by default
π Website β viralcode.github.io/openwhale
βοΈ Managed Hosting β Coming soon
Other AI assistants chat. OpenWhale does. It deploys autonomous agent swarms that fan-out complex work to multiple AI agents running in parallel, coordinate them with shared memory and file locks, and collect results automatically. Connect it to 8 AI providers, let it talk on your behalf through WhatsApp/Telegram/Discord/Twitter/iMessage, browse the web with real browser automation, execute code, manage your calendar, send emails β and basically run your digital life.
An AI workforce, not just an AI assistant. One agent is nice. A coordinated swarm is unstoppable.
git clone https://github.com/viralcode/openwhale.git
cd openwhale
pnpm install
pnpm approve-builds
pnpm run devOpen http://localhost:7777/dashboard β add your API keys β start chatting.
π Full Getting Started Guide β
πͺ On Windows? Run the one-line installer in PowerShell β installs everything automatically:
irm https://raw.githubusercontent.com/viralcode/openwhale/main/install.ps1 | iexOr see the Windows Installation Guide β for manual setup.
Switch between any major AI model on the fly. One assistant, unlimited model choices.
| Provider | Top Models |
|---|---|
| Anthropic | Opus 4.5, Sonnet 4.5 |
| OpenAI | GPT-5.2, GPT-5, GPT-4o, o4-mini |
| Google Gemini | Gemini 3 Pro, Gemini 2.5 Pro |
| Qwen | Qwen3-Max, QwQ-Plus, Qwen3-Coder |
| DeepSeek | DeepSeek Chat, Coder, Reasoner |
| Groq | Llama 3.3 70B, Mixtral 8x7B |
| Together AI | Any open-source model |
| Ollama | Local models β no API key! |
Automatic failover between providers. If one fails, the next takes over.
Deploy multiple AI agents in parallel to tackle complex tasks faster. Automatic task decomposition, shared context, advisory file locks, and inter-agent messaging.
"Research quantum computing and write a Python sorting algorithm"
β Orchestrator fans out to Research Agent + Coder Agent
β Results synthesized automatically
- Fan-out / Fan-in β Split work across specialized parallel agents
- Shared Context β Agents share data via a namespaced key-value store
- Active Locks β Advisory file locks prevent concurrent write conflicts
- Inter-Agent Messaging β Agents can talk to each other during execution
- A2A Protocol β Google Agent2Agent protocol for interop with LangGraph, CrewAI, AutoGen
Your AI responds across all platforms β unified inbox, one brain.
| Channel | How It Works |
|---|---|
| Scan QR code, uses your personal account | |
| Telegram | Create a bot with @BotFather |
| Discord | Add bot to your server |
| Slack | Slack app integration |
| Twitter/X | Auto-reply to mentions via bird CLI (cookie auth, no API!) |
| iMessage | macOS native β read and send iMessages |
The AI can actually DO things, not just talk about them.
| Category | Tools |
|---|---|
| System | Shell exec, code execution, system info, clipboard, Apple Shortcuts |
| Files | Read/write files, git, zip, codebase search, database queries |
| Documents | PDF creation, PowerPoint slides, Excel spreadsheets |
| Web | Browser automation (Playwright + BrowserOS), web fetch |
| Media | Screenshots, camera, canvas, image analysis, TTS, screen recording, QR codes |
| Communication | Email, iMessage, calendar events |
| Infrastructure | Docker, SSH |
| Memory | Persistent memory, knowledge graphs, multi-step planning |
| Automation | Cron scheduling, self-extensions, skill creator |
| Coordination | Agent listing, session management, shared context, file locks |
Connect to your favorite services β the AI handles the API calls.
| Skill | What It Does |
|---|---|
| GitHub | Manage repos, issues, PRs, commits |
| Notion | Search, create, update pages and databases |
| Google Calendar | View and create events |
| Gmail | Read, send, and search emails |
| Google Drive | Upload, download, list files |
| Google Tasks | Manage your to-do lists |
| Weather | Current conditions and forecasts |
| 1Password | Securely fetch passwords and secrets |
| Apple Notes/Reminders | Native macOS integration |
| Twitter/X | Post, search, timeline, mentions, follow |
| Spotify | Playback control, search, playlists |
| Trello | Boards, lists, and cards |
| ElevenLabs | AI text-to-speech |
| Twilio | SMS and voice calls |
Plus 50+ Markdown Skills from the OpenClaw community.
Two backends to choose from:
| Backend | Best For |
|---|---|
| Playwright | Simple scraping, headless automation, zero setup |
| BrowserOS | Real browser with your extensions, cookies, and logins |
BrowserOS lets the AI control YOUR actual browser β including logged-in sessions, AdBlock, 1Password, and everything else.
The AI can create its own automations. Just ask.
- "Create an extension that checks Bitcoin price every hour and sends it to WhatsApp"
- "Make an extension that reminds me to drink water"
- "Create a daily standup summary extension"
Extensions support cron scheduling, multi-channel output, and persist across restarts.
Remembers everything, even after restarts: long-term facts, daily notes, vector search (local embeddings, no API needed!), and session persistence.
π Memory System β
The AI wakes up periodically to check on things β configurations, inboxes, tasks β without you having to ask.
JWT authentication, rate limiting, audit logs, sandboxed code execution, command filtering, multi-user support.
Native SwiftUI menu bar app with quick chat, status monitoring, and channel controls β no browser needed.
| Feature | OpenWhale | ChatGPT | Claude | Typical Chatbot |
|---|---|---|---|---|
| Multi-agent orchestration | β Fan-out/fan-in with shared memory | β No | β No | β No |
| Multi-model support | β 8 providers | β GPT only | β Claude only | β Single model |
| Run shell commands | β Full access | β No | β No | β No |
| Browser automation | β Playwright + BrowserOS | β Limited | β Limited | β No |
| WhatsApp/Telegram/Discord | β All channels | β No | β No | β Maybe one |
| iMessage (macOS) | β Native | β No | β No | β No |
| Twitter integration | β Full API | β No | β No | β No |
| Self-extensible | β Creates own tools | β No | β No | β No |
| Runs locally | β Your machine | β Cloud only | β Cloud only | β Usually cloud |
| Open source | β MIT license | β Closed | β Closed | β Varies |
| Persistent memory | β Vector search | β Limited | β Limited | β Usually no |
| Proactive heartbeat | β Configurable | β No | β No | β No |
| Document | Description |
|---|---|
| Getting Started | Installation, prerequisites, quick start, Docker |
| Providers | AI providers, models, failover configuration |
| Multi-Agent | Fan-out/fan-in, shared context, A2A protocol |
| Channels | WhatsApp, Telegram, Discord, Slack, Twitter, iMessage |
| Tools | Complete reference for all 40+ built-in tools |
| Skills | API skills and Markdown community skills |
| Browser Automation | Playwright vs BrowserOS |
| Memory | Long-term memory, vector search, session persistence |
| Heartbeat | Proactive agent configuration |
| Extensions | Self-extension system and scheduling |
| Dashboard | Web dashboard and setup wizard |
| macOS App | Native menu bar app |
| API | REST API reference and OpenAI-compatible endpoints |
| Security | Authentication, rate limiting, audit, sandboxing |
| Windows Setup | Installing & running on Windows 10/11 |
| Configuration | Full .env reference |
| CLI | CLI commands and npm scripts |
| Troubleshooting | Common issues and fixes |
src/
βββ agents/ # Multi-agent orchestration, inter-agent comms, A2A protocol
βββ auth/ # JWT, API keys, sessions
βββ channels/ # WhatsApp, Telegram, Discord, Slack, Twitter, iMessage adapters
βββ cli.ts # Interactive terminal interface
βββ daemon/ # Background service (launchd on macOS)
βββ dashboard/ # Web admin panel + coordination dashboard
βββ db/ # SQLite/PostgreSQL with Drizzle ORM
βββ gateway/ # Hono-based HTTP API routes
βββ heartbeat/ # Proactive agent scheduler
βββ integrations/# Google APIs (Calendar, Gmail, Drive, Tasks)
βββ memory/ # Persistent memory, embeddings, vector search
βββ providers/ # Anthropic, OpenAI, Google, Groq, Qwen, DeepSeek, Together, Ollama
βββ security/ # Rate limiting, audit logs, sandboxing, command filtering
βββ sessions/ # Persistent conversations, fan-out auto-detection, compaction
βββ skills/ # GitHub, Notion, Spotify, Weather, Apple, Twitter, etc.
βββ tools/ # 40+ tools β files, browser, code exec, screenshots, coordination
βββ voice/ # Voice interaction support
PRs welcome! If you find a bug or have an idea, open an issue. Keep it friendly.
MIT β do whatever you want with it.
Built with β€οΈ by Jijo John





