Master Plan v15 (Web Summary)
This page is a human-readable snapshot of the Monkey‑Head‑Project / HueyOS master plan as it exists today. It’s written to be:
- readable by humans,
- linkable from the website,
- and stable enough to survive “site refactors” without becoming a blank page.
Canonical references: the HueyOS / Monkey‑Head‑Project repositories on GitHub.
1) What Huey is
Huey is an offline‑first lab partner: a modular robotics shell with a local compute core that can run, reason, and log decisions without relying on cloud services.
The project’s public-facing software stack and documentation is referred to as HueyOS.
2) System architecture (high level)
Huey is designed as a single “core node” orchestrating four GPU‑based districts:
- CPU = Overseer / Orchestrator
- 4× GPU districts (Spark / Volt / Zap / Watt) = primary parallel compute
Districts are intentionally treated as semi-independent “regions” to support governance separation, auditability, and modular scaling.
3) Canonical hardware baseline (current target)
The baseline is intentionally uniform across nodes so a build can be replicated:
- CPU: Intel Core i9‑14900K
- Motherboard: ASUS TUF Gaming Z790‑PLUS WiFi
- Memory: DDR5 (DDR4 minimum supported for non‑core lab tech)
- GPUs: 4 total (1 per district), 2019+ required, 2020+ preferred
- recommended minimum VRAM: 16 GB per GPU (64 GB total)
- absolute minimum VRAM: 12 GB per GPU (not recommended)
- Storage: “Honeycomb” approach built around RAID 10 NVMe
- Power: minimum 2 PSUs, 3–4 preferred, and an eventual “per‑GPU PSU” model for the prototype
Cooling is treated as a first‑class subsystem (see below).
4) Cooling + physical shell
The Robotics V3 shell is layered and salvage‑driven:
- base platform on casters
- lower housing for the core compute
- mid structure + upper housing for the GPU districts
- top section: animatronic monkey head + microphone
Cooling direction:
- CPU uses a custom liquid loop (reservoir + pump)
- GPU liquid cooling planned (full or augmented), integrated into overall loop
- target coolant volume: ~4L once fully online
5) HueyPulse (always‑on subcontroller)
HueyPulse is the “heartbeat” subsystem that remains online even when the main node is down.
Responsibilities:
- thermal monitoring
- pump control
- sensor polling / logging
- basic actuation safety
HueyPulse is intended to be backed by a UPS to maintain continuity during power loss.
6) Governance model (Cloud Pyramid)
Huey’s internal decision structure is modeled as a distributed government:
- Parliament
- Presidency
- Supreme Court
- 4 GPU districts (Spark / Volt / Zap / Watt)
Each district hosts a population of AI “citizens”, with quota/token budgeting per cycle. Decisions are tracked for auditability (append-only logs + indexing).
7) Software stack snapshot
- Debian 14 “Forky” (migration from Debian 13 “Trixie”)
- Python 3.12–3.14 range (current work often on 3.13)
- custom low-latency Huey kernel (6.17+ lineage)
- orchestration: PyGPT‑net, Ollama backend, Whisper for STT (where applicable)
- memory + governance logs: append-only JSON + SQLite indexes
8) Roadmap philosophy
This project is intentionally a roadmap as much as it is a build:
- open-source
- modular + replicable
- heavy emphasis on documentation that explains not just what, but why
Status note
This is a living plan. The “final” layout decisions (PSU placement, exact cooling routing, final GPU SKU selection) are intentionally deferred until the acquisition phase.