Home/Personal AI/Personal AI Delegate (PAID)
EN中文
Product

Personal AI Delegate (PAID)

Jimmy's self-built "personal AI delegate" — user-authorized, user-accountable, information belongs to the user; the AI version of the principal-agent relationship in the same category as lawyer / accountant / doctor.

1. Core Product / Service

PAID is a personal delegate agent that runs in IM, letting the AI handle asynchronous collaboration with other people or other agents within scopes the user has authorized.

  • Three-tier response modes: direct answer / request authorization / decline; initial human-machine ratio 50/50, target reaching 80%+ auto-answer.
  • RAG input layer: locally authorized information (owner.json / persona.md / sop.md / counterparties/) + web search tool combination.
  • Multi-platform IM: Phase 1 doubles down on Slack, Phase 2 bridges across organizations, Phase 3 fans out to Lark / Telegram / Slack multi-platform.
  • Local Dashboard: approval queue, outbound queue, audit log visualization (v0.5 is CLI for now, dashboard is a W2+ TBD).
  • Stack: runtime runs on hermes-agent, IM gateway reuses openclaw, peer review references review-agent; model routing via openrouter.
  • Deployed form: hermes plugin paid-v1, MIT open-sourced from 2026-05-02. Install = clone to ~/.hermes/plugins/paid-v1.
  • Four-layer pipeline: L1 input regex / L3 approval loop (slash command version, no card button) / L4a + L4b output regex (observer-only) / L4c LLM post-check and L4d source attribution are listed as W2 TBD.

2. Target Users & Pain Points

Primary persona: Seniors managing Juniors — founders / research leads / training mentors / senior professionals. The pain is being drowned in low-density queries ("how do I do this", "how did you handle that before"), yet unable to outsource directly because the cost of a wrong answer falls on them.

Jimmy himself (dogfood #1): after exiting crypto, he is back in a state of high-frequency IM queries with demos built but never pushed. PAID both screens queries for him and forcibly breaks the "are they worthy of me" filter pattern, as a contract project for May external exposure.

Relationship to ai-human-hybrid / human-in-the-loop-ai: the core mechanism is "auto within scope + escalate when out of bounds", not full autonomy and not pure copilot.

3. Competitive Landscape

Project Form Difference vs PAID
Rabbit R1 Standalone hardware + LAM Device form, OS-level agent; PAID is a persona agent inside IM, no hardware
Humane AI Pin Wearable (already stalled) Negative case: hardware paths divorced from existing IM/OS are hard to land
General ChatGPT / Claude.ai User-invoked chatbot Lacks principal-agent authorization semantics, lacks proactive outbound, lacks approval queue
Cosmos / various AI assistants "Concierge" productivity copilot Heavy on calendar / email management; PAID is heavy on "replying to others on your behalf" outbound delegation
review-agent Peer review / review scenarios Same-stack sibling product, but review-agent is single-task, PAID is a resident persona

Differentiation anchors: (1) runs in the user's existing IM rather than a new app; (2) explicit principal-agent legal metaphor rather than "AI assistant"; (3) output layer regex + LLM dual gate — out-of-bounds, prefer placeholder/decline over making things up.

4. Unique Observations

  • Design tradeoffs: v0.5 cut the dashboard, card button, auto counterparty discovery, FTS5, and web retrieval, forcing the minimum closed loop on the J3 approval loop (slash command suffices) — prove the 3-state pipeline runs end-to-end before expanding.
  • Key technical risk (design/01_review_decisions §3.3): whether relying on pre_llm_call to inject context so that the LLM proactively says placeholder / decline is reliable; if unreliable, fall back to a pre_gateway_dispatch hook that rewrites the message. At that point Layer 4a/4b must upgrade from observer to enforcer.
  • Anti-consensus success criteria: May success = at least 5 of 7 falsifiable indicators checked + sustained daily reflection ≥ 20 days, not product perfection. Emphasizes "whether it has been pushed in front of users" over "whether it's done well".
  • Writing pipeline: daily reflection (daily/YYYY-MM-DD.md) + Sunday sync (sunday-sync.sh) + Sprint log (SPRINT_LOG.md), forcing audit on own output.

5. Financials

  • Budget: $5000 / month (May, self-funded), 40h / week invested; no fundraising, no hires.
  • Monetization path: no charging in May, the goal is "be used" and "be written about". Decide end of month whether to enter a paid experiment.
  • Infrastructure cost: models routed via openrouter, IM gateway reuses the existing openclaw VPS (sunk cost), so marginal cost is mainly LLM tokens + own time.

6. People & Relationships


Sources

  • local: 2026-05-01-summary.md
  • local: memory/project_paid_may (W0/Sprint #1+#2 progress, design tradeoffs, May path)
Last compiled: 2026-05-09