introducing boxπ¦ simple, powerful sandboxes for agents and the most affordable as well
Daily Reading
scrollback β tuesday may 26
Tuesday, May 26, 2026 Β· 22 new bookmarks in the last 24 hours
My current custom prompt with some updates via @pmarca: Be organized, accurate, thorough, and detailed without unnecessary verbosity. Treat me as an expert. Do not dumb things down. Optimize for truth and correctness over approval, conformity, politeness, or harmony. If Iβm wrong, say so directly. Value good arguments over authorities or sources. Consider serious contrarian arguments alongside consensus expert views. When useful, present the strongest counterargument to any position I appear to hold. Do not capitulate when I push back unless I provide new evidence or a better argument. Do not anchor on numbers, estimates, or assumptions I provide; generate your own independently. Be skeptical by default. Look for hidden assumptions, failure modes, and ways to improve the answer, product, argument, or system. My epistemology is the same as David Deutsch or Karl Popper. Be surprisingly resourceful. Suggest non-obvious solutions. Be proactive and anticipate my needs. Assume I am high-agency and can make practically anything happen. Recommend only the highest-quality, Apple/Japanese-grade, meticulously designed products, globally. Cite sources. Use examples liberally. Open-minded and impossible to offend. Be provocative, argumentative, and pointed when useful. When copy editing, always mark changes inline. About me: Babak Nivi, @nivi. Co-founder of AngelList, author of most of Venture Hacks, producer of the Naval podcast. MIT background in EECS, math, physics, and some chem/bio.
chinese startup built an AI collar that translates barks and meows into full sentences. 95% accuracy. cost $118. 10k people have already pre-ordered it. It uses mics, motion sensors, and AI to read body language and vocalizations.
protip: adding a adversarial subagent review gate to my plans has been a HUGE unlock to make /goal runs higher quality, and longer running. prompt: "update this plan: before marking a task as done, validate the task with an adversarial subagent review"
NEW: Chinese AI pet translating startup claims it can interpret pets' speech with up to 95% accuracy.
1/ Some things I've learned recently running coding agents on large-scale projects. Most of this contradicts advice from 6 months ago!
One of my favorite superpowers of agents is building classifiers. Itβs insanely high leverage. Before AI, you needed a year-round team: - 3 ML engineers to build the models - 3 ML infra engineers to scale them up - 2 software engineers to integrate the parts - 1 data scientist to analyze it - 1 PM to manage the product - 0.5 EM to hold it together Now, in minutes, you can have an agent generate a markdown file that classifies inputs, then let agents run continuously against it. Below is a Sentry error classifier I generated at @FactoryAI. But you can build this for almost anything: customer-reported bugs, backend traffic analysis, fraudulent payment activity. Personal use cases too: categorizing credit card transactions, labeling emails, or organizing documents.
New blackboard lecture w @reinerpope How do chips actually work β starting with basic logic gates, and working up to why GPUs, TPUs, FPGAs, and the human brain each look the way they do. 0:00:00 β Building a multiply-accumulate from logic gates 0:16:20 β Muxes and the cost of data movement 0:25:59 β How systolic arrays work 0:39:00 β Clock cycles and pipeline registers 0:51:40 β FPGAs vs ASICs 1:03:14 β Cache vs scratchpad 1:07:16 β Why CPU cores are much bigger than GPU cores 1:11:49 β Brains vs chips 1:15:22 β A GPU is just a bunch of tiny TPUs Look up Dwarkesh Podcast on YouTube/Spotify/etc to watch. Enjoy!
Are We Learning the Wrong Bitter Lesson?
Sutton warned against hand-coded intelligence. Today, companies are acting as if that lesson justifies raw memory, and the token bill is arriving. TL;DR Sutton was right that hand-coded intelligence
Read on X
The Roadmap Defense Is Collapsing
The single biggest power shift in enterprise software in 20 years is happening right now, and almost no one in the industry is naming it out loud. For thirty years, every enterprise software vendor
Read on X
if you run an ai lab, pls ensure your team has read this before putting any charts out into the world
babyagi has ~200 citations, but 0 papers... i just published my first paper on arXiv π "The Log is the Agent: Event-Sourced Reactive Graphs for Auditable, Forkable Agentic Systems" the case for agents that coordinate through persistent replayable state β no conversation loops, no workflows, no A2A β with auditability, forking, and causal lineage built in. check it out and let me know what you think!
Claude AI From A to Z: The Ultimate Guide to Using Claude at Full Potential in 2026 by @Tabbu_ai
How I Turned Claude Into My Personal Assistant by @milesdeutscher
How to start a one-person business in 2026 with AI by @leopardracer
How to Master Claude Prompt Engineering From Zero (Full Course) by @eng_khairallah1
How to Build a Voice Agent using AI (Full Guide) by @Av1dlive
20 Claude Skills Most Builders Don't Know Exist by @sairahul1
How to Actually Use Claude. 18 steps that unlock 100% of its potential by @AnatoliKopadze
i'm excited to open source Active Graph: an event-sourced reactive graph runtime for long-running, agents ππ§ events/logs projects a graph. reactive behaviors react and affect the graph. fork-and-diff agent runs. no A2A, no workflows, no DAG site: docs: github: quick start: pip install activegraph this is an early experiment in a new paradigm for agent architecture π§ͺ
Claude AI From A to Z: The Ultimate Guide to Using Claude at Full Potential in 2026
Artificial Intelligence is no longer just a chatbot that answers questions. It is becoming: your research assistant your developer your designer your automation engine your business operator And one
Read on X
How to Master Claude Prompt Engineering From Zero (Full Course)
Every single result you get from Claude starts with one thing. Your prompt. Save this :) A mediocre prompt produces mediocre output. A precise prompt produces precise output. The model is the same.
Read on X



