Muhammad Farooq · Writing

Muhammad Farooq · WritingEssays on retrieval systems, large language models, agents, and the engineering around them.https://engineerprompt.ai/How agent harnesses manage context: cap, slice, search, storehttps://engineerprompt.ai/writing/agent-context-management/https://engineerprompt.ai/writing/agent-context-management/What occupies an agent's context window and the four moves harnesses use when content does not fit: cap it, slice it, search it, or store it elsewhere.Thu, 11 Jun 2026 00:00:00 GMTCompaction is the hardest problem in agent engineeringhttps://engineerprompt.ai/writing/compaction/https://engineerprompt.ai/writing/compaction/Why agent harnesses summarize old history, what a careless summary destroys, the failure modes that follow, and the patterns that make compaction safe.Thu, 11 Jun 2026 00:00:00 GMTDeepSeek visual primitives: teaching models to reason with a cursorhttps://engineerprompt.ai/writing/deepseek-visual-primitives/https://engineerprompt.ai/writing/deepseek-visual-primitives/Notes on DeepSeek's briefly public paper Thinking with Visual Primitives: boxes, points, and paths placed inside the reasoning trace, and its honest limits.Thu, 11 Jun 2026 00:00:00 GMTDiffusionGemma: what Google's open text diffusion model actually changeshttps://engineerprompt.ai/writing/diffusion-gemma/https://engineerprompt.ai/writing/diffusion-gemma/Notes on DiffusionGemma, Google's first open-weight text diffusion model: how block diffusion refines a 256-token canvas in parallel, the official speed and benchmark numbers, and what it takes to run locally.Thu, 11 Jun 2026 00:00:00 GMTDwarfStar 4: how a 284B model runs on a MacBookhttps://engineerprompt.ai/writing/dwarfstar-4/https://engineerprompt.ai/writing/dwarfstar-4/A 284B parameter model needs 568 GB stored normally. DwarfStar runs it on 128 GB machines at usable speeds. The quantization recipe, SSD streaming, and the numbers.Thu, 11 Jun 2026 00:00:00 GMTHow to evaluate an agent harnesshttps://engineerprompt.ai/writing/evaluating-harnesses/https://engineerprompt.ai/writing/evaluating-harnesses/Harness configurations cluster at 74-76% resolve rate while cost varies fourteen times. A five-step method for judging harnesses on accuracy and cost.Thu, 11 Jun 2026 00:00:00 GMTLoop engineering: what it is, when to use it, and when to stay awayhttps://engineerprompt.ai/writing/loop-engineering/https://engineerprompt.ai/writing/loop-engineering/Loop engineering means designing systems that prompt your agents instead of prompting them yourself. What a loop is, what a serious one needs, and the caveats that matter.Thu, 11 Jun 2026 00:00:00 GMTRAG beyond similarity search: how a modern retrieval pipeline workshttps://engineerprompt.ai/writing/rag-beyond-similarity-search/https://engineerprompt.ai/writing/rag-beyond-similarity-search/Traditional RAG embeds chunks and hopes similarity search finds the right ones. What replaced it: hybrid retrieval, reranking, enrichment, verification, with localGPT as a working example.Thu, 11 Jun 2026 00:00:00 GMTSub-agents: when one context window is not enoughhttps://engineerprompt.ai/writing/sub-agents/https://engineerprompt.ai/writing/sub-agents/Why single-context agents hit a wall, how harnesses isolate work in child agents with the spawn, restrict, collect pattern, and when delegation backfires.Thu, 11 Jun 2026 00:00:00 GMTTools vs skills vs MCP: how agents acquire capabilitieshttps://engineerprompt.ai/writing/tools-skills-mcp/https://engineerprompt.ai/writing/tools-skills-mcp/Tools are primitives. Skills are knowledge. MCP is neither: a protocol that connects external tool servers to any harness. How the three fit together.Thu, 11 Jun 2026 00:00:00 GMTHarness engineering: why agent performance now lives outside the modelhttps://engineerprompt.ai/writing/harness-engineering/https://engineerprompt.ai/writing/harness-engineering/Same model, same benchmark, six times the performance difference. Two March 2026 papers show the code around the model now matters more than the model. Here is what they found.Wed, 10 Jun 2026 00:00:00 GMTWhat is an agent harness? The nine components of a great onehttps://engineerprompt.ai/writing/what-is-an-agent-harness/https://engineerprompt.ai/writing/what-is-an-agent-harness/A harness is the fixed architecture that turns a model into an agent. What it is, how it differs from a framework, and the nine components every modern harness needs.Wed, 10 Jun 2026 00:00:00 GMT