Grok & agent systems — May 2026¶
Strategic snapshot: Grok product posture + Crusty (Hermes) local agent stance + wiki maintenance workflow for https://wiki.darcymenard.com. Cross-links: projects/crusty, Grok-Brief, projects/personal-projects-wiki, projects/openclaw-autonomy-org.
Capability bullets below reflect xAI Grok circa mid-May 2026 (Skills ship, Grok Build beta, tiers). Re-check vendor docs before wiring production automations.
Latest Grok capabilities (mid-May 2026)¶
- Skills system (shipped ~May 12–13): reusable, modular workflows and persistent expertise. Invoke via
/skill-nameor/skill-creator. Supports document generation (docx, pdf, pptx, xlsx), file operations in sandbox, custom automations, and treats Grok as a persistent workspace layer — prebuilt and fully custom skills; integrates with agents. - Grok 4.3 + computer use: native ability to write/run code, install dependencies, and produce real output files (documents, presentations, spreadsheets, etc.) inside a sandboxed computer environment. Rolling out to SuperGrok / Premium+ users.
- Grok Build (early beta, May 14): terminal-based CLI coding agent for professional software engineering — plan/review/approve workflow, parallel sub-agents, integrates with Skills, AGENTS.md, plugins, MCP. Tier-limited (SuperGrok Heavy, ~$300/mo at time of writing). Strong fit for complex codebases and heavy automations.
- Multi-agent architecture (Grok 4.20 /
grok-4.20-multi-agent): multiple specialized agents collaborate in parallel on a shared backbone for deeper research, reasoning, and synthesis. Agent count and effort are configurable. - Connectors: native OAuth integrations (Gmail, Google Calendar, Drive, GitHub, Notion, Slack, etc.) for direct read/write inside Grok — scoped and revocable. Caution: prompt-injection risk on sensitive accounts; treat connected inboxes/docs as hostile input surfaces.
- Tiers: X Premium+ and SuperGrok cover Skills, computer use, and core models solidly for most workflows. SuperGrok Heavy unlocks Grok Build CLI and higher limits / early access.
Crusty + local agent strategy (Hermes + Grok OAuth)¶
Crusty (dedicated local hardware — see projects/crusty) is the home for persistent agents tuned for always-on work: email/calendar summarization, reminders, content pipelines, publishing gates, etc.
Recommended harness: Hermes Agent (Nous Research — open-source, self-hosted, framed as self-improving) with native xAI Grok OAuth (browser login; uses existing SuperGrok / Premium+ subscription — no separate XAI_API_KEY for covered Grok paths). Hermes supports a Telegram gateway, custom skills/plugins, MCP, memory, and tool use; it can operate on local filesystem, Git, and browser actions where configured.
Why Hermes on Crusty (vs cloud-only Grok)¶
- Isolation for sensitive access (e.g. Gmail/Calendar scopes) under your machine and policy boundaries.
- Always-on without relying on cloud round-trips for routine automation.
- Self-improving skills loop aligns with iterated local agent ops.
- Continuity with the existing projects/crusty / projects/openclaw-autonomy-org philosophy (delegated autonomy, Telegram gates, audited execution).
Hybrid model: local Crusty + cloud Grok¶
| Layer | Role |
|---|---|
| Local (Crusty + Hermes) | Always-on schedules, Telegram remote control, sensitive-account integrations where posture demands it, routine wiki/repo maintenance intent, sandboxed execution, Git operations after approval. |
| Cloud (Grok grok.com / subscription Grok) | Heavy multi-agent reasoning, deep synthesis, high-quality documents/presentations (Skills + computer use), rapid prototyping — use when frontier tooling or parallel agents beat local economics. |
Handoff pattern: use cloud Grok for hard conversations or research; emit structured update payloads (sections, bullets, filenames) for the local agent to apply, diff, gate, commit, push.
Wiki maintenance workflow (wiki.darcymenard.com)¶
Goal: Keep the public wiki (projects/personal-projects-wiki) a living, agent-maintained knowledge base driven from conversations — without moving public hosting onto Crusty.
Architecture¶
- Source of truth: Git repo on Crusty — Markdown under
wiki/. - Agent loop: edits locally → commits with clear messages → pushes.
- Hosting: Vercel (or equivalent) continues auto-deploy on push (see MkDocs
site_url: https://wiki.darcymenard.com/; other deploy notes on tools-and-repos). - Remote control: Telegram bot attached to Hermes on Crusty (same gate philosophy as projects/crusty dailies).
Update flow (target)¶
- Chat in cloud Grok for synthesis — or explicitly ask for a wiki update payload (structured deltas).
- Or message the Telegram bot with natural-language instructions.
- Hermes (via a dedicated WikiMaintainer skill) extracts relevant facts, proposes structured edits, surfaces diff + summary for approval.
- On approval: patch Markdown → commit → push.
- Vercel deploys → confirmation + published link back to human.
WikiMaintainer skill (Hermes — to build)¶
| Property | Choice |
|---|---|
| Scope | Wiki directory only (path allowlist); no unrelated filesystem. |
| Git | pull, edit, diff, commit, push — push only behind explicit approval. |
| Outputs | Consistent headings: projects tables, glossary touchpoints, dates, cross-links ([wiki/...](<wiki/...>)). |
| Governance | Mandatory approval before git push; session logging / monitoring where Hermes supports it. |
| Evolution | Treat prompts + checklists as a living skill — refine from each merge. |
Security (minimal bar)¶
- Deploy key / token: repo-scoped; minimum permissions for push to this wiki repo only.
- Filesystem sandbox within Hermes/agent config.
- Approval gate before any publish push (non-negotiable).
- Logging adequate to audit what changed and why.
- Public wiki keeps blast radius lower than automating private financial systems — still no excuse for careless credentials.
Why this approach¶
- Turns scattered chats into durable structured context.
- Enables mobile upkeep via Telegram.
- Keeps sovereignty for always-on, sensitive-ish surfaces while reserving cloud for frontier reasoning and doc generation.
- Low incremental risk: site is already public.
- Extends naturally to reports, glossary maintenance, status pages — same Git + approve + push backbone.
Next actions (execution checklist)¶
- Wire Hermes + Telegram gateway on Crusty end-to-end.
- Implement WikiMaintainer skill (scoped Git/files + structured proposal + approval).
- Configure repo credential posture + approval UX.
- Dry run: one small wiki section update (
git pull→ edit → show diff → approve → push → verify deploy).
Canonical cold-start routing for blank Grok threads remains Grok-Brief — this page is architecture/strategy, not the paste-pack.