|
|
2 周之前 | |
|---|---|---|
| .github | 2 周之前 | |
| assets | 2 周之前 | |
| docs | 2 周之前 | |
| products | 2 周之前 | |
| scripts | 2 周之前 | |
| skills | 2 周之前 | |
| src | 2 周之前 | |
| ui | 2 周之前 | |
| .dockerignore | 2 周之前 | |
| .gitattributes | 2 周之前 | |
| .gitignore | 2 周之前 | |
| Dockerfile | 2 周之前 | |
| LICENSE | 2 周之前 | |
| README.md | 2 周之前 | |
| README.zh.md | 2 周之前 | |
| README_DOCKER.md | 2 周之前 | |
| docker-compose.yml | 2 周之前 | |
| docker-entrypoint.sh | 2 周之前 | |
| install.sh | 2 周之前 | |
| package.json | 2 周之前 | |
| pnpm-lock.yaml | 2 周之前 | |
| pnpm-workspace.yaml | 2 周之前 | |
| tsconfig.json | 2 周之前 | |
| vitest.config.js | 2 周之前 | |
| vitest.setup.ts | 2 周之前 |
OpenBMB开源一站式大模型开发部署平台,全流程工具链助力大模型快速落地
本项目是OpenBMB推出的生产级大模型全生命周期管理平台,为开发者和企业提供从模型训练、微调、推理部署到性能监控、资源调度的一站式解决方案,解决大模型开发门槛高、部署复杂、运维困难等核心痛点,帮助企业零门槛构建和部署自己的大模型应用。
# 克隆仓库
git clone https://github.com/OpenBMB/PilotDeck.git
cd PilotDeck
# 复制并修改环境配置文件
cp .env.example .env
# 编辑.env文件,配置数据库、GPU资源和模型存储路径
# 启动所有服务
docker compose up -d
服务启动后,访问 http://你的服务器IP:8000 即可进入管理后台,默认账号密码为 admin/admin123。
# 克隆仓库
git clone https://github.com/OpenBMB/PilotDeck.git
cd PilotDeck
# 创建并激活虚拟环境
python -m venv venv
source venv/bin/activate # Windows: venv\Scripts\activate
# 安装依赖
pip install -r requirements.txt
# 初始化数据库
python init_db.py
# 启动前端和后端服务
npm install && npm run build
python main.py
本地访问 http://localhost:8000 即可使用。
curl -X POST http://你的服务器IP:8000/api/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer 你的API密钥" \
-d '{
"model": "qwen-7b-chat",
"messages": [{"role": "user", "content": "你好,请介绍一下自己"}],
"temperature": 0.7,
"max_tokens": 512
}'
本项目采用 Apache License 2.0 开源许可证,详细条款请参考项目根目录下的 LICENSE 文件。
Task-oriented AI Agent productivity platform — redefining operational boundaries and memory evolution, one WorkSpace at a time.
English | 简体中文
Website · Live Demo · Tutorial · Quick Start · Highlights · Use Cases · Community
News 🔥
PilotDeck is an open-source agent operating system designed around the concept of "WorkSpace". It is jointly developed and open-sourced by Tsinghua University THUNLP, ModelBest, OpenBMB, and AI9Stars. Targeting general-purpose, multi-task scenarios, PilotDeck is built to be a true productivity tool for the Agent era.
A wave of excellent AI Agent harnesses has emerged in recent years, each with its own focus: Claude Code / Cursor / Trae Solo brought model reasoning deep into the programming IDE; Claude Cowork introduced the notion of project-level isolation to desktop-side knowledge work; WorkBuddy connected agents to IM ecosystems such as WeCom and Feishu so AI is one message away.
When we shift the lens from "one-shot programming" or "immediate Q&A" to long-running, multi-project productivity work, however, several questions remain open:
PilotDeck is an incremental exploration around exactly these questions. It uses the WorkSpace as the fundamental unit — completely isolating files, memory and skills per project — and pairs it with three pillar capabilities: White-box Memory, Smart Routing and Always-on. The entire system natively supports the Model Context Protocol (MCP) and behaves consistently across front-ends (Web / CLI / IM).
|
**WorkSpace-Level Isolation & Accretion**
Every project gets its own file system, memory store and skill set. Parallel work no longer interferes with itself, retrieval has a bounded scope, and skills accrete naturally as each task grows — no more global context pollution.
|
**Traceable White-box Memory**
Memory generation, extraction, storage and retrieval are visible end-to-end. When the AI mis-remembers, you can pinpoint and fix the offending entry. Built-in **Dream Mode** consolidates memory in idle windows, and supports one-click rollback.
|
|
**Smart Routing & Cost Optimization**
Task difficulty is auto-detected; complex calls go to flagship models (e.g. Claude 3.5 Sonnet / GPT-4o), simple ones drop to lighter models. Through on-device / cloud co-orchestration and precise matching, token spend shrinks dramatically without sacrificing quality.
|
**Always-on Background Execution**
PilotDeck breaks the "you ask, it answers" loop: after you sign off, the agent keeps discovering candidate tasks, running long-horizon monitors, and finally lands deliverables as local files with a summary report waiting for you.
|
The three pillar capabilities have shown clear advantages in production-grade workflows:
In Xiaohongshu-style social-media operations, enabling Smart Routing automatically demotes simple polishing / layout tasks to a sub-agent (e.g. Sonnet 4.5) and only invokes Opus 4.5 at planning checkpoints:
| Setup | Model configuration | Cost | Multiplier |
|---|---|---|---|
| Smart Routing ON | Opus 4.5 (main) + Sonnet 4.5 (sub) | $2.83 | 1.1× |
| Smart Routing OFF | All Opus 4.5 (main + sub) | $12.58 | 5.0× |
| Monolithic | Single Opus 4.5 long-react (estimated) | $12.20 | 4.8× |
The research team benchmarked 7 complex tasks (multilingual podcast push, multi-source data reports, domain-specific literature review, codebase architecture docs, etc.). The "strong main + light sub" routing setup matches or beats the frontier single-model setup at a fraction of the cost:
| Setting | Score | Cost |
|---|---|---|
| MiniMax-M2.7 single-agent | 37.1 | $1.90 |
| Claude Sonnet 4.6 single-agent | 69.1 | $18.36 |
| Sonnet 4.6 (main) + MiniMax-M2.7 (sub) | 70.6 | $3.15 |
In black-box agents, mixing tasks in a shared context pool inevitably pollutes memory. PilotDeck's WorkSpace-scoped white-box memory addresses this end-to-end:
| Dimension | Current AI Agents (black-box) | PilotDeck (white-box) |
|---|---|---|
| Visibility | You can't see what the AI remembers, only what it outputs | View every memory entry: what was stored, when, and which WorkSpace |
| Control | Once written, memory can't be edited or removed | Edit / delete entries, pin critical decisions so they don't drift |
| Traceability | When it goes wrong, you can't find the root cause | Generation → extraction → storage → retrieval, all auditable |
| Isolation | One shared pool — projects bleed into each other | Scoped per WorkSpace; A's memory never reaches B |
| Reversible | After compression, the original is gone | Dream-mode supports one-click rollback to the prior state |
PilotDeck ships an out-of-the-box Web UI with full WorkSpace management, white-box memory editing, and visualization of multi-agent collaboration.
All demos below are generated entirely by edge-side models via PilotDeck's Smart Routing — no cloud-side frontier model required.
"Survey the Chinese LLM application market and turn it into a formal HTML white paper."
| Process | Result |
![]() |
![]() |
"Walk me through building an iOS AR mini-game Ball Finder in Vibe Coding mode."
| Process | Result |
![]() |
![]() |
"Build a low-code embedding fine-tuning platform from scratch."
| Process | Result |
![]() |
![]() |
"Push this English podcast to a global audience in Chinese / Japanese / French / Korean / Spanish / Arabic."
| Process | Result (with audio) |
![]() |
https://github.com/user-attachments/assets/a7245467-ee3c-4939-a055-c56576ac56
|
We provide a one-line installer for macOS / Linux, plus a source-based workflow for developers.
curl -fsSL https://raw.githubusercontent.com/OpenBMB/PilotDeck/main/install.sh | bash
The script auto-installs Node.js 22, clones the repo, installs dependencies, and builds the frontend. Once it finishes:
pilotdeck # starts the server at http://localhost:3001
pilotdeck status # check runtime status
1. Clone and install dependencies
This repo uses Git LFS for large media assets. Make sure
git lfsis installed before cloning. If you don't need the demo videos/GIFs, addGIT_LFS_SKIP_SMUDGE=1beforegit cloneto skip downloading them.
git clone https://github.com/OpenBMB/PilotDeck.git
cd PilotDeck
npm install # root deps (Gateway runtime)
cd ui && npm install # UI deps
cd ..
2. Configure a model provider
PilotDeck reads ~/.pilotdeck/pilotdeck.yaml. You can create it manually, let the bootstrap script generate one, or just open the Web UI and configure providers visually in the settings panel.
Supported protocols include OpenAI, Anthropic, DeepSeek, Qwen, Kimi, MiniMax and other OpenAI-compatible endpoints.
schemaVersion: 1
agent:
model: deepseek/deepseek-v4-pro
model:
providers:
deepseek:
protocol: openai
url: https://api.deepseek.com/v1
apiKey: sk-your-api-key
3. Start the services
cd ui && npm run dev # dev mode (HMR), visit http://localhost:5173
# or
cd ui && npm run start # production mode, visit http://localhost:3001
If Docker is installed, you can start PilotDeck with:
docker compose up -d
PilotDeck has an open plugin architecture with a strict boundary between the open-source core and plugin customization. Extending the system is a plugin.json away:
PreToolUse, UserPromptSubmit, and other critical lifecycle events.Thanks to everyone who has contributed code, feedback, and ideas. New contributors are warmly welcome — let's build the next-gen agent OS together.
Workflow: Fork → feature branch → PR.
| WeChat Community | Feishu Community | Discord Community |
![]() |
![]() |
![]() |
We thank Agent OS pioneers such as OpenClaw, Claude Code, Codex, Cursor, and Hermes for their explorations that helped shape this field.
PilotDeck builds upon the following outstanding open-source projects:
PilotDeck is jointly developed by Tsinghua University THUNLP, ModelBest, OpenBMB and AI9Stars.
If PilotDeck has been helpful in your work or research, please consider giving us a Star on GitHub!
@misc{pilotdeck2026,
author = {PilotDeck Team},
title = {PilotDeck: A WorkSpace-Centric Open-Source Agent Operating System},
howpublished = {\url{https://github.com/OpenBMB/PilotDeck}},
year = {2026},
note = {Accessed: 2026-05-29}
}
This project is licensed under the GNU Affero General Public License v3.0.