Daily Reading

scrollback β€” tuesday may 26

Tuesday, May 26, 2026 Β· 22 new bookmarks in the last 24 hours

introducing boxπŸ“¦ simple, powerful sandboxes for agents and the most affordable as well

My current custom prompt with some updates via @pmarca: Be organized, accurate, thorough, and detailed without unnecessary verbosity. Treat me as an expert. Do not dumb things down. Optimize for truth and correctness over approval, conformity, politeness, or harmony. If I’m wrong, say so directly. Value good arguments over authorities or sources. Consider serious contrarian arguments alongside consensus expert views. When useful, present the strongest counterargument to any position I appear to hold. Do not capitulate when I push back unless I provide new evidence or a better argument. Do not anchor on numbers, estimates, or assumptions I provide; generate your own independently. Be skeptical by default. Look for hidden assumptions, failure modes, and ways to improve the answer, product, argument, or system. My epistemology is the same as David Deutsch or Karl Popper. Be surprisingly resourceful. Suggest non-obvious solutions. Be proactive and anticipate my needs. Assume I am high-agency and can make practically anything happen. Recommend only the highest-quality, Apple/Japanese-grade, meticulously designed products, globally. Cite sources. Use examples liberally. Open-minded and impossible to offend. Be provocative, argumentative, and pointed when useful. When copy editing, always mark changes inline. About me: Babak Nivi, @nivi. Co-founder of AngelList, author of most of Venture Hacks, producer of the Naval podcast. MIT background in EECS, math, physics, and some chem/bio.

chinese startup built an AI collar that translates barks and meows into full sentences. 95% accuracy. cost $118. 10k people have already pre-ordered it. It uses mics, motion sensors, and AI to read body language and vocalizations.

protip: adding a adversarial subagent review gate to my plans has been a HUGE unlock to make /goal runs higher quality, and longer running. prompt: "update this plan: before marking a task as done, validate the task with an adversarial subagent review"

NEW: Chinese AI pet translating startup claims it can interpret pets' speech with up to 95% accuracy.

1/ Some things I've learned recently running coding agents on large-scale projects. Most of this contradicts advice from 6 months ago!

One of my favorite superpowers of agents is building classifiers. It’s insanely high leverage. Before AI, you needed a year-round team: - 3 ML engineers to build the models - 3 ML infra engineers to scale them up - 2 software engineers to integrate the parts - 1 data scientist to analyze it - 1 PM to manage the product - 0.5 EM to hold it together Now, in minutes, you can have an agent generate a markdown file that classifies inputs, then let agents run continuously against it. Below is a Sentry error classifier I generated at @FactoryAI. But you can build this for almost anything: customer-reported bugs, backend traffic analysis, fraudulent payment activity. Personal use cases too: categorizing credit card transactions, labeling emails, or organizing documents.

New blackboard lecture w @reinerpope How do chips actually work – starting with basic logic gates, and working up to why GPUs, TPUs, FPGAs, and the human brain each look the way they do. 0:00:00 – Building a multiply-accumulate from logic gates 0:16:20 – Muxes and the cost of data movement 0:25:59 – How systolic arrays work 0:39:00 – Clock cycles and pipeline registers 0:51:40 – FPGAs vs ASICs 1:03:14 – Cache vs scratchpad 1:07:16 – Why CPU cores are much bigger than GPU cores 1:11:49 – Brains vs chips 1:15:22 – A GPU is just a bunch of tiny TPUs Look up Dwarkesh Podcast on YouTube/Spotify/etc to watch. Enjoy!

Article x.com

Are We Learning the Wrong Bitter Lesson?

Sutton warned against hand-coded intelligence. Today, companies are acting as if that lesson justifies raw memory, and the token bill is arriving. TL;DR Sutton was right that hand-coded intelligence

Read on X
Article x.com

The Roadmap Defense Is Collapsing

The single biggest power shift in enterprise software in 20 years is happening right now, and almost no one in the industry is naming it out loud. For thirty years, every enterprise software vendor

Read on X

babyagi has ~200 citations, but 0 papers... i just published my first paper on arXiv πŸ˜† "The Log is the Agent: Event-Sourced Reactive Graphs for Auditable, Forkable Agentic Systems" the case for agents that coordinate through persistent replayable state β€” no conversation loops, no workflows, no A2A β€” with auditability, forking, and causal lineage built in. check it out and let me know what you think!

i'm excited to open source Active Graph: an event-sourced reactive graph runtime for long-running, agents πŸ”„πŸ§  events/logs projects a graph. reactive behaviors react and affect the graph. fork-and-diff agent runs. no A2A, no workflows, no DAG site: docs: github: quick start: pip install activegraph this is an early experiment in a new paradigm for agent architecture πŸ§ͺ

Article x.com

Claude AI From A to Z: The Ultimate Guide to Using Claude at Full Potential in 2026

Artificial Intelligence is no longer just a chatbot that answers questions. It is becoming: your research assistant your developer your designer your automation engine your business operator And one

Read on X
Article x.com

How to Master Claude Prompt Engineering From Zero (Full Course)

Every single result you get from Claude starts with one thing. Your prompt. Save this :) A mediocre prompt produces mediocre output. A precise prompt produces precise output. The model is the same.

Read on X