I Gave My Hermes + OpenClaw Agents a Subconscious, and Now It Dreams 24/7 | Full Guide

Friday, April 3, 2026 AI

Scraped Article

You built your Hermes and Openclaw agents to run your workflows, but the more workflows you create, the less time you have to improve them. That is what the subconscious agent is for. Much like a human's subconscious, it continuously problem-solves to help your agents self-improve. It brainstorms, debates, refines, and writes the results back into the system so the next run starts smarter than the last. This is the difference between agents that guess improvements and agents that actually compound. That means less: background anxiety manual ops work repeated debugging mental overhead And more: shipping experimenting compounding Personally, I built the loop inside my Hermes agent workspace so it could stay close to my main workflows, stay configurable, and stay honest about what it knows. This guide is meant to be a template for all agents, so you can point your Hermes, OpenClaw, or other agent here and have it build a similar system. Why This Matters Most agent systems break in the same boring ways: They need babysitting they drift They burn tokens on vague exploration. They produce output, but not momentum. That is the trap. You spend more energy managing the system than using it. The subconscious agent flips that. Instead of asking, “What should this agent do right now?” the system keeps asking: What did we learn? What failed? What should we try next? What needs guardrails? What should be frozen until it earns trust? That is how you get a system that saves time, saves energy, saves tokens, and makes the whole stack feel a little bit like magic. The Architecture Think of it as a small but relentless loop: The system gathers evidence from its latest run. It generates candidate ideas. It debates those ideas against hard objections with a smarter agent. It synthesizes one recommendation that can be accepted or rejected. It writes the result into the state. The next run starts from that updated state instead of starting from zero. That last part is the real unlock. Most systems “remember” in a loose, fuzzy way. This one remembers by keeping the winning direction, the rejected paths, and the next improvement in a durable workspace. So the machine does not just answer. It learns how to answer better. What You Actually Need To Build It This is the part people usually skip. A self-improving agent system needs more than prompting. At a minimum, you need: 1. A runner Something has to coordinate the whole cycle: load the brief fetch current state run ideation run critique run synthesis write artifacts hand off the result In my setup, that is the runner. It is the control plane for the loop. 2. Persistent state The system needs memory that survives process restarts. That usually means: JSON for current summaries and governance JSONL for append-only history markdown for human-readable outputs a stable directory structure so later runs can pick up where the last one ended Without a durable state, the system cannot improve. It just reenacts the same conversation every time. 3. A scheduler or trigger source You need a way to decide when the loop runs: on a cron schedule after new metrics arrive after a live signal changes after a manual review request Please be realistic about how many times you want the subconscious to run on a given workflow each day. Too many runs can lead to excessive divergence from the original principles. 4. A transport or delivery layer The loop is useless if nobody sees the result. So you need a way to send the output somewhere useful: Discord (my setup) Telegram a file path a dashboard a task queue The transport should be separate from the reasoning layer. That keeps the model from becoming tightly coupled to one output channel. 5. A model router Different phases should use different models. For example: cheap/local model for ideation stronger model for challenge and synthesis execution model for artifact generation or final writes That split is what keeps costs sane and quality high. 6. A review and approval gate If the system can ship directly without a human check, you are no longer building an assistant loop. You are building an autopilot. This pattern keeps approval at the end, so the system can be smart without becoming reckless. Of course, it's up to you to decide which tasks and workflows you want to auto-approve, but it's best to test with human approval at the end. 7. Artifact writers The loop needs to write back into the filesystem in a predictable way. For example: `ideas/ideas-internal.jsonl` `debate/debate-log.jsonl.` `winning-concept.md` `improvement-backlog.md` `run-summary.json` These are the system's memories. What it actually does in an example Here is an example of the subconscious trying to think through content to create today. The subconscious layer creates a loop where: 1. `ideas/` stores candidate directions The subconscious starts by thinking in options, not conclusions. It looks at the evidence and generates a few possible directions. In this run, the names